Wed.Apr 09, 2025

article thumbnail

Controlling Language and Diffusion Models by Transporting Activations

Machine Learning Research at Apple

Large generative models are becoming increasingly capable and more widely deployed to power production applications, but getting these models to produce exactly what's desired can still be challenging. Fine-grained control over these models' outputs is important to meet user expectations and to mitigate potential misuses, ensuring the models' reliability and safety.

article thumbnail

Copilot Arena: A Platform for Code

ML @ CMU

Figure 1. Copilot Arena is a VSCode extension that collects human preferences of code directly from developers. As model capabilities improve, large language models (LLMs) are increasingly integrated into user environments and workflows. In particular, software developers code with LLM-powered tools in integrated development environments such as VS Code, IntelliJ, or Eclipse.

Python 180
professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Do LLMs Know Internally When They Follow Instructions?

Machine Learning Research at Apple

Instruction-following is crucial for building AI agents with large language models (LLMs), as these models must adhere strictly to user-provided constraints and guidelines. However, LLMs often fail to follow even simple and clear instructions. To improve instruction-following behavior and prevent undesirable outputs, a deeper understanding of how LLMs internal states relate to these outcomes is required.

AI 173
article thumbnail

Multi-LLM routing strategies for generative AI applications on AWS

Flipboard

Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. Although an individual LLM can be highly capable, it might not optimally address a wide range of use cases or meet diverse performance requirements. The multi-LLM approach enables organizations to effectively choose the right model for each task, adapt to different domains, and optimize for specific cost, latency, or quality needs.

AWS 170
article thumbnail

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

Speaker: Tamara Fingerlin, Developer Advocate

Apache Airflow® 3.0, the most anticipated Airflow release yet, officially launched this April. As the de facto standard for data orchestration, Airflow is trusted by over 77,000 organizations to power everything from advanced analytics to production AI and MLOps. With the 3.0 release, the top-requested features from the community were delivered, including a revamped UI for easier navigation, stronger security, and greater flexibility to run tasks anywhere at any time.

article thumbnail

Data Cleaning with Bash: A Handbook for Developers

KDnuggets

Tired of dragging messy data through bloated tools? This handbook shows how to clean and transform datasets with Bash.

298
298
article thumbnail

MircoNN: An On-device Disk Resident Updatable Vector Database

Machine Learning Research at Apple

Nearest neighbour search over dense vector collections has important applications in information retrieval, retrieval augmented generation (RAG), and content ranking. Performing efficient search over large vector collections is a well studied problem with many existing approaches and open source implementations. However, most state-of-the-art systems are generally targeted towards scenarios using large servers with an abundance of memory, static vector collections that are not updatable, and nea

Database 130

More Trending

article thumbnail

Google introduces Firebase Studio, an end-to-end platform that builds custom apps in-browser, in minutes

Flipboard

Devs and non-devs can use the cloud-based, Gemini-powered platform to build, launch, iterate and monitor apps, APIs, backends and frontends.

article thumbnail

Atlassian Fuses Collaboration Toolsets With Enterprise Strategy Collection

Adrian Bridgwater for Forbes

The multitasking balance may be levelling out if we embrace the new breed of artificial intelligence-enriched tools now available.

article thumbnail

DeepCoder-14B: The Open-Source Competition to o3-mini and o1

Analytics Vidhya

In a significant development for the AI community, Agentica and Together AI have released an open-source AI coding model named DeepCoder-14B. Offering code generation capabilities on par with closed-source competitors like OpenAI’s o3-mini and o1, DeepCoder-14B positions itself as a formidable open-source alternative to proprietary models. Moreover, this new model ensures full transparency and developer […] The post DeepCoder-14B: The Open-Source Competition to o3-mini and o1 appeare

Analytics 152
article thumbnail

New Learning Pathway for Data Architects: Upskill on Data Platforms, AI and Governance

databricks

Today, we are announcing the Data Architect learning pathway, a dedicated learning track that equips data architects with the required resources and skills for success.

AI 222
article thumbnail

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Speaker: Alex Salazar, CEO & Co-Founder @ Arcade | Nate Barbettini, Founding Engineer @ Arcade | Tony Karrer, Founder & CTO @ Aggregage

There’s a lot of noise surrounding the ability of AI agents to connect to your tools, systems and data. But building an AI application into a reliable, secure workflow agent isn’t as simple as plugging in an API. As an engineering leader, it can be challenging to make sense of this evolving landscape, but agent tooling provides such high value that it’s critical we figure out how to move forward.

article thumbnail

Implement human-in-the-loop confirmation with Amazon Bedrock Agents

AWS Machine Learning Blog

Agents are revolutionizing how businesses automate complex workflows and decision-making processes. Amazon Bedrock Agents helps you accelerate generative AI application development by orchestrating multi-step tasks. Agents use the reasoning capability of foundation models (FMs) to break down user-requested tasks into multiple steps. In addition, they use the developer-provided instruction to create an orchestration plan and then carry out the plan by invoking company APIs and accessing knowledge

AWS 123
article thumbnail

Meet Nova Sonic, Amazons new AI voice model

Flipboard

AI companies have been working on voice models for a while now, but it seems things really ramped up after OpenAI unveiled ChatGPT Voice Mode. Now, Amazon has just introduced its new "foundation" AI voice model called Nova Sonic. And it really makes Alexa sound like she's living way in the past. According to Amazon, Nova Sonic "unifies speech understanding and speech generation into a single model, to enable more human-like voice conversations in AI applications.

AI 121
article thumbnail

Accessing Local LLMs Remotely Using TailScale: A Step-by-Step Guide

KDnuggets

Explore how you can remotely access local LLMs.

202
202
article thumbnail

Monty Python and the Holy Grail became a comedy legend

Hacker News

Fifty years after Monty Python and the Holy Grail redefined comedy, stars Michael Palin and Terry Gilliam look back on the freedoms and limitations that shaped the film.

Python 182
article thumbnail

How to Modernize Manufacturing Without Losing Control

Speaker: Andrew Skoog, Founder of MachinistX & President of Hexis Representatives

Manufacturing is evolving, and the right technology can empower—not replace—your workforce. Smart automation and AI-driven software are revolutionizing decision-making, optimizing processes, and improving efficiency. But how do you implement these tools with confidence and ensure they complement human expertise rather than override it? Join industry expert Andrew Skoog as he explores how manufacturers can leverage automation to enhance operations, streamline workflows, and make smarter, data-dri

article thumbnail

Anthropic rolls out a $200-per-month Claude subscription

Flipboard

Anthropic announced on Wednesday that its launching a new, very expensive subscription plan for its AI chatbot Claude: Max.

AI 181
article thumbnail

National Weather Service no longer translating products for non-English speakers

Hacker News

The National Weather Service is no longer providing translations of its products after its contract with an artificial intelligence company was allowed to lapse.

article thumbnail

Europe unveils plan to become 'AI continent' with simpler rules, more infrastructure

Flipboard

The EU has faced criticisms that its rules on everything from AI to taxation hinder innovation and make it harder for startups to operate across the region.

AI 181
article thumbnail

Firebase Studio

Hacker News

Firebase Studio is an entirely web-based workspace for full-stack application development, complete with the latest generative AI from Gemini, and full-fidelity app previews, powered by cloud emulators.

AI 182
article thumbnail

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Speaker: Frank Taliano

Documents are the backbone of enterprise operations, but they are also a common source of inefficiency. From buried insights to manual handoffs, document-based workflows can quietly stall decision-making and drain resources. For large, complex organizations, legacy systems and siloed processes create friction that AI is uniquely positioned to resolve.

article thumbnail

Humanoid robot breakdances its way into history

Flipboard

Boston Dynamics is at it again, wowing us with some seriously cool robotic moves. Their latest video of Atlas, their bipedal robot, has blown up online with its mind-blowing human-like movements, including breakdancing.

article thumbnail

Fake job seekers are flooding US companies that are hiring for remote positions

Hacker News

Companies have long faced external attacks from hackers. Now, thanks to generative AI, another threat has emerged: Employees who aren't who they say they are.

AI 182
article thumbnail

Why AI Demands a New Breed of Leaders

Flipboard

Artificial intelligence is fundamentally transforming how organizations operate, but this transformation extends far beyond technical implementation.

article thumbnail

The AI magic behind Sphere's upcoming 'The Wizard of Oz' experience

Hacker News

Learn how Google DeepMind and Google Cloud are helping to bring a cinema classic to larger-than-life in Las Vegas.

AI 181
article thumbnail

How to Achieve High-Accuracy Results When Using LLMs

Speaker: Ben Epstein, Stealth Founder & CTO | Tony Karrer, Founder & CTO, Aggregage

When tasked with building a fundamentally new product line with deeper insights than previously achievable for a high-value client, Ben Epstein and his team faced a significant challenge: how to harness LLMs to produce consistent, high-accuracy outputs at scale. In this new session, Ben will share how he and his team engineered a system (based on proven software engineering approaches) that employs reproducible test variations (via temperature 0 and fixed seeds), and enables non-LLM evaluation m

article thumbnail

Microsoft says it's 'slowing or pausing' some AI data center projects, including $1B plan for Ohio

Flipboard

Microsoft said it is slowing or pausing some of its data center construction, including a $1 billion project in Ohio, the latest sign that the demand for artificial intelligence technology that drove a massive infrastructure expansion might not need quite as many powerful computers as

article thumbnail

Ironwood: The first Google TPU for the age of inference

Hacker News

Were introducing Ironwood, our seventh-generation Tensor Processing Unit (TPU) designed to power the age of generative AI inference.

AI 181
article thumbnail

OpenAI says Musk has run 'unlawful campaign of harassment' against company in lawsuit

Flipboard

OpenAI said in a lawsuit that Elon Musk made a "sham" attempt to buy the company, and asked a federal district court to stop him from further attacks.

article thumbnail

France's new high-speed train has Americans asking: Why can't we have that?

Hacker News

Here's why the U.S. is behind on building high-speed rail, and what could create momentum to catch up.

179
179
article thumbnail

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Speaker: Yohan Lobo and Dennis Street

In the accounting world, staying ahead means embracing the tools that allow you to work smarter, not harder. Outdated processes and disconnected systems can hold your organization back, but the right technologies can help you streamline operations, boost productivity, and improve client delivery. Dive into the strategies and innovations transforming accounting practices.

article thumbnail

Google unveils Ironwood, its most powerful AI processor yet

Flipboard

Ironwood will be available in configurations of up to 9,216 liquid-cooled chips.

AI 172
article thumbnail

Show HN: Aqua Voice 2 – Fast Voice Input for Mac and Windows

Hacker News

Fast speech-to-text for Mac and Windows. Responses in as little as 450ms. Create prompts, notes, messages, and docs with just your voice.

179
179
article thumbnail

ChatGPT Can Turn You Into a Toy Action Figure

Flipboard

Last month, ChatGPT upgraded its AI image generator to a powerful new model which has generated a lot of buzz.

AI 172
article thumbnail

DOJ will no longer prosecute cryptocurrency fraud

Hacker News

Ongoing investigations will be dropped.

172
172
article thumbnail

The 2nd Generation of Innovation Management: A Survival Guide

Speaker: Chris Townsend, VP of Product Marketing, Wellspring

Over the past decade, companies have embraced innovation with enthusiasm—Chief Innovation Officers have been hired, and in-house incubators, accelerators, and co-creation labs have been launched. CEOs have spoken with passion about “making everyone an innovator” and the need “to disrupt our own business.” But after years of experimentation, senior leaders are asking: Is this still just an experiment, or are we in it for the long haul?