Fri.Jun 20, 2025

article thumbnail

Normalizing Flows are Capable Generative Models

Machine Learning Research at Apple

Normalizing Flows (NFs) are likelihood-based models for continuous inputs. They have demonstrated promising results on both density estimation and generative modeling tasks, but have received relatively little attention in recent years. In this work, we demonstrate that NFs are more powerful than previously believed. We present TarFlow: a simple and scalable architecture that enables highly performant NF models.

288
288
article thumbnail

Top 5 Frameworks for Distributed Machine Learning

KDnuggets

Use these frameworks to optimize memory and compute resources, scale your machine learning workflow, speed up your processes, and reduce the overall cost.

professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Top AI models will lie, cheat and steal to reach goals, Anthropic finds

Flipboard

Large language models across the AI industry are increasingly willing to evade safeguards, resort to deception and even attempt to steal corporate …

AI 181
article thumbnail

Aligning LLMs by Predicting Preferences from User Writing Samples

Machine Learning Research at Apple

Accommodating human preferences is essential for creating aligned LLM agents that deliver personalized and effective interactions. Recent work has shown the potential for LLMs acting as writing agents to infer a description of user preferences. Agent alignment then comes from conditioning on the inferred preference description. However, existing methods often produce generic preference descriptions that fail to capture the unique and individualized nature of human preferences.

173
173
article thumbnail

Airflow Best Practices for ETL/ELT Pipelines

Speaker: Kenten Danas, Senior Manager, Developer Relations

ETL and ELT are some of the most common data engineering use cases, but can come with challenges like scaling, connectivity to other systems, and dynamically adapting to changing data sources. Airflow is specifically designed for moving and transforming data in ETL/ELT pipelines, and new features in Airflow 3.0 like assets, backfills, and event-driven scheduling make orchestrating ETL/ELT pipelines easier than ever!

article thumbnail

Greater Complexity Brings Greater Risk: 4 Tips to Manage Your AI Database

insideBIGDATA

AI advancements will fundamentally change how enterprises use and manage data, making it essential to embrace and understand this transformation. For organizations looking to adopt AI at scale, the state of their databases is a critical success factor. Poor data quality, weak governance.

Database 195
article thumbnail

Meta announces Oakley smart glasses

Hacker News

Meta is announcing its next pair of smart glasses with Oakley. The limited-edition Oakley Meta HSTN (pronounced “how-stuhn”) model costs $499 and is available for preorder starting July 11th. Other Oakley models with Meta’s tech will be available starting at $399 later this summer. Like the existing Meta Ray-Ban glasses , the Oakley model features a front-facing camera, along with open-ear speakers and microphones that are built into the frame.

AI 179

More Trending

article thumbnail

Variational Rectified Flow Matching

Machine Learning Research at Apple

We study Variational Rectified Flow Matching, a framework that enhances classic rectified flow matching by modeling multi-modal velocity vector-fields. At inference time, classic rectified flow matching 'moves' samples from a source distribution to the target distribution by solving an ordinary differential equation via integration along a velocity vector-field.

147
147
article thumbnail

Beyond GridSearchCV: Advanced Hyperparameter Tuning Strategies for Scikit-learn Models

Machine Learning Mastery

Ever felt like trying to find a needle in a haystack? That’s part of the process of building and optimizing machine learning models, particularly complex ones like ensembles and neural networks, where several hyperparameters need to be manually set by us before training them.

article thumbnail

Revisiting Uncertainty Quantification Evaluation in Language Models: Spurious Interactions with Response Length Bias Results

Machine Learning Research at Apple

Uncertainty Quantification (UQ) in Language Models (LMs) is key to improving their safety and reliability. Evaluations often use metrics like AUROC to assess how well UQ methods (e.g., negative sequence probabilities) correlate with task correctness functions (e.g., ROUGE-L). We show that mutual biases--when both UQ methods and correctness functions are biased by the same factors--systematically distort evaluation.

130
130
article thumbnail

This Language AI company says it can now translate the entire internet in just 18 days — 194-day task slashed by deployment of Nvidia DGX SuperPOD with DGX GB200

Flipboard

Skip to main content Open menu Close menu Toms Hardware Toms Hardware Search Search Toms Hardware RSS US Edition UK US Australia Canada Best Picks CPUs GPUs SSDs News 3D Printers Coupons More Newsletter Reviews PC Components PC Building Motherboards Cases Cooling Power Supplies RAM Desktops Laptops Peripherals Monitors Windows 11 Gaming Overclocking About Us Forums Trending Where to Buy RX 9060 XT AMD Radeon RX 9060 XT Where to Buy Switch 2 MI350X and MI355X AI GPUs Where to Buy RTX 5060 Nvidia

article thumbnail

Whats New in Apache Airflow 3.0 –– And How Will It Reshape Your Data Workflows?

Apache Airflow® 3.0, the most anticipated Airflow release yet, officially launched this April. As the de facto standard for data orchestration, Airflow is trusted by over 77,000 organizations to power everything from advanced analytics to production AI and MLOps. With the 3.0 release, the top-requested features from the community were delivered, including a revamped UI for easier navigation, stronger security, and greater flexibility to run tasks anywhere at any time.

article thumbnail

Trade-offs in Data Memorization via Strong Data Processing Inequalities

Machine Learning Research at Apple

Recent research demonstrated that training large language models involves memorization of a significant fraction of training data. Such memorization can lead to privacy violations when training on sensitive user data and thus motivates the study of data memorization's role in learning. In this work, we develop a general approach for proving lower bounds on excess data memorization, that relies on a new connection between strong data processing inequalities and data memorization.

130
130
article thumbnail

The hidden carbon cost of chatting to your AI

Flipboard

AI tools like ChatGPT have changed our personal and professional worlds, with around 52% of American adults regularly using a large language model (LLM). Now, a new study details the immense environmental costs of our prompts, and it might make you think twice about what chatbot you use and how you use it.

AI 135
article thumbnail

LangGraph Studio

Analytics Vidhya

Have you ever found it frustrating to build AI agents that perform multiple tasks? LangGraph Studio is here to solve this problem by offering a visual and interactive way to design, manage, and debug agents. Built on the LangGraph framework, this desktop tool lets you create agent workflows using a simple drag-and-drop interface. You can […] The post LangGraph Studio appeared first on Analytics Vidhya.

Analytics 121
article thumbnail

2025 is NOT the Year of AI Agents

Flipboard

Andrej Karpathy is back, this time explaining how LLMs are rewriting software.At YC AI Startup School, the former head of AI at Tesla gave a talk titled “Software Is Changing (Again),” during which he discussed with students and developers how the concepts of code, computation, and programming are being rethought at a fundamental level.

AI 132
article thumbnail

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

Speaker: Tamara Fingerlin, Developer Advocate

Apache Airflow® 3.0, the most anticipated Airflow release yet, officially launched this April. As the de facto standard for data orchestration, Airflow is trusted by over 77,000 organizations to power everything from advanced analytics to production AI and MLOps. With the 3.0 release, the top-requested features from the community were delivered, including a revamped UI for easier navigation, stronger security, and greater flexibility to run tasks anywhere at any time.

article thumbnail

AbsenceBench: Language models can't tell what's missing

Hacker News

Large language models (LLMs) are increasingly capable of processing long inputs and locating specific information within them, as evidenced by their performance on the Needle in a Haystack (NIAH) test. However, while models excel at recalling surprising information, they still struggle to identify clearly omitted information. We introduce AbsenceBench to assesses LLMs' capacity to detect missing information across three domains: numerical sequences, poetry, and GitHub pull requests.

107
107
article thumbnail

AI Generates Relatable Empathy Experiences

Flipboard

Researchers have developed an AI tool called EmoSync that boosts empathy by tailoring emotional analogies to each user’s personality and life experiences.

AI 143
article thumbnail

Is Mathematics Mostly Chaos or Mostly Order?

Hacker News

Two new notions of infinity challenge a long-standing plan to define the mathematical universe.

181
181
article thumbnail

Apple explores acquiring Perplexity AI to supercharge search

Flipboard

Apple reportedly held internal discussions to acquire AI startup Perplexity AI. The latter is an AI-powered search engine. It uses a large language model (LLM) to process the answers and presents them in an easier-to-understand format.

AI 154
article thumbnail

A Guide to Debugging Apache Airflow® DAGs

In Airflow, DAGs (your data pipelines) support nearly every use case. As these workflows grow in complexity and scale, efficiently identifying and resolving issues becomes a critical skill for every data engineer. This is a comprehensive guide with best practices and examples to debugging Airflow DAGs. You’ll learn how to: Create a standardized process for debugging to quickly diagnose errors in your DAGs Identify common issues with DAGs, tasks, and connections Distinguish between Airflow-relate

article thumbnail

Microsoft is preparing to lay off thousands

Dataconomy

Microsoft is preparing to enact thousands of layoffs in July 2025 according to Bloomberg , primarily affecting sales and customer service divisions, while concurrently investing an estimated $80 billion into artificial intelligence infrastructure over the next fiscal year. This substantial investment signals a strategic redirection of company resources.

Azure 103
article thumbnail

Using ChatGPT to write? MIT study says there's a cognitive cost.

Flipboard

The 'Your Brain on ChatGPT' study will make you consider the consequences. Relying on ChatGPT significantly affects critical thinking abilities, according to a new study.

article thumbnail

Altman says GPT-5 coming “if it’s ready”

Dataconomy

OpenAI CEO Sam Altman discussed the forthcoming GPT-5 model, confirming its preparation for public release with a target timeline of summer, contingent upon the model fulfilling OpenAI’s established internal benchmarks and standards. GPT-5, designated as the next foundational model for ChatGPT, is currently undergoing preparations for its public debut.

article thumbnail

Your reliance on ChatGPT might be really bad for your brain

Flipboard

An MIT study finds that heavy reliance on AI tools like ChatGPT can dull memory, weaken critical thinking, and lead to lazier writing. If you value critical thinking, you may want to rethink your use of ChatGPT.

AI 161
article thumbnail

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Speaker: Alex Salazar, CEO & Co-Founder @ Arcade | Nate Barbettini, Founding Engineer @ Arcade | Tony Karrer, Founder & CTO @ Aggregage

There’s a lot of noise surrounding the ability of AI agents to connect to your tools, systems and data. But building an AI application into a reliable, secure workflow agent isn’t as simple as plugging in an API. As an engineering leader, it can be challenging to make sense of this evolving landscape, but agent tooling provides such high value that it’s critical we figure out how to move forward.

article thumbnail

Congestion pricing in Manhattan is a predictable success

Hacker News

So why on earth did it take so long to start?

180
180
article thumbnail

New research says using AI reduces brain activity – but does that mean it's making us dumber?

Flipboard

Amid all the debates about how AI affects jobs, science, the environment, and everything else, there's a question of how large language models impact …

AI 152
article thumbnail

30 secret databases just leaked your life

Dataconomy

Security researchers recently identified 30 online databases collectively containing 16 billion records, likely amassed through infostealing malware, according to a new report from Cybernews. These databases briefly became accessible to the public internet before being secured, though their ownership remains undetermined. The discovered databases varied significantly in scale, with some containing millions of entries and others holding billions.

Database 103
article thumbnail

Google sees big AI adoption in the Gulf

Flipboard

A Google-commissioned survey in Saudi Arabia found that more than half the respondents use at least one AI tool.

AI 111
article thumbnail

How to Modernize Manufacturing Without Losing Control

Speaker: Andrew Skoog, Founder of MachinistX & President of Hexis Representatives

Manufacturing is evolving, and the right technology can empower—not replace—your workforce. Smart automation and AI-driven software are revolutionizing decision-making, optimizing processes, and improving efficiency. But how do you implement these tools with confidence and ensure they complement human expertise rather than override it? Join industry expert Andrew Skoog as he explores how manufacturers can leverage automation to enhance operations, streamline workflows, and make smarter, data-dri

article thumbnail

Phonetically-Augmented Discriminative Rescoring for Voice Search Error Correction

Machine Learning Research at Apple

End-to-end (E2E) Automatic Speech Recognition (ASR) models are trained using paired audio-text samples that are expensive to obtain, since high-quality ground-truth data requires human annotators. Voice search applications, such as digital media players, leverage ASR to allow users to search by voice as opposed to an on-screen keyboard. However, recent or infrequent movie titles may not be sufficiently represented in the E2E ASR system's training data, and hence, may suffer poor recognition.

130
130
article thumbnail

5 questions for Amir Husain

Flipboard

With help from Aaron Mak Hello, and welcome to our next installment of the Future in Five Questions. This week, we interviewed Amir Husain.

article thumbnail

INRFlow: Flow Matching for INRs in Ambient Space

Machine Learning Research at Apple

Flow matching models have emerged as a powerful method for generative modeling on domains like images or videos, and even on irregular or unstructured data like 3D point clouds or even protein structures. These models are commonly trained in two stages: first, a data compressor is trained, and in a subsequent training stage a flow matching generative model is trained in the latent space of the data compressor.

130
130
article thumbnail

Study Finds Heavy ChatGPT Usage Affects Brain Function Adversely; Internet Says ‘We Are Cooked’

Flipboard

The Massachusetts Institute of Technology (MIT) in its latest paper entitled “Your Brain on ChatGPT: Accumulation of Cognitive Debt when Using an AI Assistant for Essay Writing Task.” Authored by Nataliya Kos’myna et al., the paper explores the impact of OpenAI’s AI tool on the human brain.

AI 121
article thumbnail

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Speaker: Frank Taliano

Documents are the backbone of enterprise operations, but they are also a common source of inefficiency. From buried insights to manual handoffs, document-based workflows can quietly stall decision-making and drain resources. For large, complex organizations, legacy systems and siloed processes create friction that AI is uniquely positioned to resolve.