Thu.Mar 13, 2025

article thumbnail

@HPCpodcast: Dr. Ian Cutress on the State of Advanced Chips, the GPU Landscape and AI Compute, Global Chip Manufacturing and GTC Expectations

insideBIGDATA

[link] Just before GTC (and for the 100th episode of the @HPCpodcast and this one sponsored by liquid cooling company CoolIT), we welcome special guest and high-powered chip industry analyst Dr.

AI 329
article thumbnail

Exploring Prediction Targets in Masked Pre-Training for Speech Foundation Models

Machine Learning Research at Apple

Speech foundation models, such as HuBERT and its variants, are pre-trained on large amounts of unlabeled speech data and then used for a range of downstream tasks. These models use a masked prediction objective, where the model learns to predict information about masked input segments from the unmasked context. The choice of prediction targets in this framework impacts their performance on downstream tasks.

178
178
professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Palantir and Databricks Announce AI Product Partnership

insideBIGDATA

SAN FRANCISCO,March 13, 2025 —DatabricksandPalantir Technologies Inc.(NASDAQ:PLTR), provider of enterprise operating systems, today announced a strategic product partnership that combines Palantir’s AI operating system and Databricks’ platform for AI, data warehousing and data engineering.

article thumbnail

The Hundred-Page Language Models Book: A Great Technical Intro to LLMs

KDnuggets

The Hundred-Page Language Models Book is the LLM book you shouldn't miss.

333
333
article thumbnail

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

Speaker: Tamara Fingerlin, Developer Advocate

Apache Airflow® 3.0, the most anticipated Airflow release yet, officially launched this April. As the de facto standard for data orchestration, Airflow is trusted by over 77,000 organizations to power everything from advanced analytics to production AI and MLOps. With the 3.0 release, the top-requested features from the community were delivered, including a revamped UI for easier navigation, stronger security, and greater flexibility to run tasks anywhere at any time.

article thumbnail

AI is coming for the laptop class

Flipboard

Enjoy the laptop lifestyle while it lasts, folks. | Smith Collection/Gado/Getty Images My entire job takes place on my laptop. I write stories like this in Google Docs on my laptop. I coordinate with my editor in Slack on my laptop. I reach out to sources with Gmail and then interview them over Zoom, on my laptop. This isnt true of all journalists some go to war zones but its true of many of us, and for accountants, tax preparers, software engineers, and many more workers, maybe over one in 10

AI 178
article thumbnail

Google’s Gemma 3: Features, Benchmarks, Performance and Implementation

Analytics Vidhya

Googles commitment to making AI accessible leaps forward with Gemma 3, the latest addition to the Gemma family of open models. After an impressive first yearmarked by over 100 million downloads and more than 60,000 community-created variantsthe Gemmaverse continues to expand. With Gemma 3, developers gain access to state-of-the-art, lightweight AI models that run efficiently […] The post Google’s Gemma 3: Features, Benchmarks, Performance and Implementation appeared first on Analytic

Analytics 201

More Trending

article thumbnail

In Praise of “Normal” Engineers

Hacker News

A version of this post originally appeared in Refactoring , a Substack offering advice for software engineers. Most of us have encountered a few software engineers who seem practically magician-like, a class apart from the rest of us in their ability to reason about complex mental models, leap to nonobvious yet elegant solutions, or emit waves of high-quality code at unreal velocity.

Database 180
article thumbnail

AI coding assistant refuses to write code, tells user to learn programming instead

Flipboard

On Saturday, a developer using Cursor AI for a racing game project hit an unexpected roadblock when the programming assistant abruptly refused to continue generating code, instead offering some unsolicited career advice. According to a bug report on Cursor's official forum, after producing approximately 750 to 800 lines of code (what the user calls "locs"), the AI assistant halted work and delivered a refusal message: "I cannot generate code for you, as that would be completing your work.

AI 182
article thumbnail

Introducing Serverless Batch Inference

databricks

Generative AI is transforming how organizations interact with their data, and batch LLM processing has quickly become one of Databricks' most popular use cases. Last.

AI 251
article thumbnail

Anthropic’s CEO wonders if future AI should have option to quit “unpleasant” tasks

Flipboard

Anthropic CEO Dario Amodei raised a few eyebrows on Monday after suggesting that advanced AI models might someday be provided with the ability to push a "button" to quit tasks they might find unpleasant. Amodei made the provocative remarks during an interview at the Council on Foreign Relations, acknowledging that the idea "sounds crazy." "So this isthis is another one of those topics thats going to make me sound completely insane," Amodei said during the interview.

article thumbnail

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Speaker: Alex Salazar, CEO & Co-Founder @ Arcade | Nate Barbettini, Founding Engineer @ Arcade | Tony Karrer, Founder & CTO @ Aggregage

There’s a lot of noise surrounding the ability of AI agents to connect to your tools, systems and data. But building an AI application into a reliable, secure workflow agent isn’t as simple as plugging in an API. As an engineering leader, it can be challenging to make sense of this evolving landscape, but agent tooling provides such high value that it’s critical we figure out how to move forward.

article thumbnail

Middle Layers Excel: New Research Challenges Final-Layer Focus in Language Models

NYU Center for Data Science

The intermediate layers of large language models (LLMs) contain surprisingly rich representations that often outperform the final layer on downstream tasks, according to new research from CDS Research Scientist Ravid Shwartz-Ziv , CDS Professor Yann LeCun , and their collaborators. Their paper, Layer by Layer: Uncovering Hidden Representations in Language Models , led by Oscar Skean and Md Rifat Arefin , reveals that the conventional wisdom of using final-layer outputs for embeddings may be sub

article thumbnail

LaTeXify in Python: No Need to Write LaTeX Equations Manually

Analytics Vidhya

In mathematical computing and scientific programming, clear and precise representation of functions is essential. While LaTeX is widely used for formatting mathematical expressions, manually writing equations can be time-consuming. The latexify-py library offers a solution by automatically converting Python functions into LaTeX-formatted expressions.

Python 140
article thumbnail

Amazon, Google and Meta commit to tripling nuclear energy by 2050

Dataconomy

Amazon, Google, and Meta Platforms have announced their support for tripling nuclear energy capacity worldwide by 2050, signing a pledge at the CERAWeek energy conference in Houston, Texas, on March 12, 2025. This pledge, nonbinding in nature, follows an earlier commitment adopted in December 2023 by over 20 countries, including the United States, at the U.N.

article thumbnail

Benchmarking customized models on Amazon Bedrock using LLMPerf and LiteLLM

AWS Machine Learning Blog

Open foundation models (FMs) allow organizations to build customized AI applications by fine-tuning for their specific domains or tasks, while retaining control over costs and deployments. However, deployment can be a significant portion of the effort, often requiring 30% of project time because engineers must carefully optimize instance types and configure serving parameters through careful testing.

AWS 117
article thumbnail

How to Modernize Manufacturing Without Losing Control

Speaker: Andrew Skoog, Founder of MachinistX & President of Hexis Representatives

Manufacturing is evolving, and the right technology can empower—not replace—your workforce. Smart automation and AI-driven software are revolutionizing decision-making, optimizing processes, and improving efficiency. But how do you implement these tools with confidence and ensure they complement human expertise rather than override it? Join industry expert Andrew Skoog as he explores how manufacturers can leverage automation to enhance operations, streamline workflows, and make smarter, data-dri

article thumbnail

Conversations with Trailblazing Women: Sarah Armstrong-Smith, Chief Security Advisor of Microsoft

Dataconomy

Sarah Armstrong-Smith has been Microsofts Chief Security Advisor for Europe since 2020. In this role, she offers strategic guidance to major customers, helping them navigate cloud adoption, digital transformation, and the ever-evolving landscape of cyber threats. We spoke with Sarah about her career journey, her perspectives on diversity in technology, and her advice for those looking to break into the tech industry.

91
article thumbnail

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

AWS Machine Learning Blog

This post is cowritten with Harrison Hunter is the CTO and co-founder of MaestroQA. MaestroQA augments call center operations by empowering the quality assurance (QA) process and customer feedback analysis to increase customer satisfaction and drive operational efficiencies. They assist with operations such as QA reporting, coaching, workflow automations, and root cause analysis.

AWS 115
article thumbnail

D-Wave claims quantum supremacy: Experts are not convinced

Dataconomy

D-Wave Quantum announced on Wednesday that it has achieved “quantum supremacy” using a practical problem, a first in the industry. The company’s Advantage2 system solved a simulation problem in 20 minutes, a task that would take one of the world’s most powerful supercomputers over 1 million years. A peer-reviewed paper detailing these findings was published in the journal Science.

article thumbnail

Tesla Cybertruck deliveries are on hold as trims are flying off the ‘bulletproof’ truck

Hacker News

According to Tesla delivery agents, Cybertruck deliveries are on hold. Theres a containment hold as many owners are reporting trims flying off the supposedly bulletproof electric truck.

181
181
article thumbnail

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Speaker: Frank Taliano

Documents are the backbone of enterprise operations, but they are also a common source of inefficiency. From buried insights to manual handoffs, document-based workflows can quietly stall decision-making and drain resources. For large, complex organizations, legacy systems and siloed processes create friction that AI is uniquely positioned to resolve.

article thumbnail

Elon Musk wants to use AI to run US gov’t, but experts say ‘very bad’ idea

Flipboard

Govt use of AI can go wrong in many ways, experts say, including lack of transparency on how tools are trained. Is Elon Musk planning to use artificial intelligence to run the US government? That seems to be his plan, but experts say it is a very bad idea.

article thumbnail

Huawei targeted in new European Parliament corruption probe

Hacker News

Chinese tech giant Huawei is at the centre of a new corruption case in Europes capital. On Thursday, Belgian police raided the homes of its lobbyists, Follow the Money and its media partners Le Soir and Knack can reveal.

181
181
article thumbnail

10 startups to watch from Y Combinator’s W25 Demo Day

Flipboard

One of Silicon Valleys most storied startup accelerators, Y Combinator, held its Winter 2025 Demo Day on Wednesday, showcasing what its latest batch of 160 startups are cooking up.

AI 180
article thumbnail

IO Devices and Latency

Hacker News

Take an interactive journey through the history of IO devices, and learn how IO device latency affects performance.

180
180
article thumbnail

How to Achieve High-Accuracy Results When Using LLMs

Speaker: Ben Epstein, Stealth Founder & CTO | Tony Karrer, Founder & CTO, Aggregage

When tasked with building a fundamentally new product line with deeper insights than previously achievable for a high-value client, Ben Epstein and his team faced a significant challenge: how to harness LLMs to produce consistent, high-accuracy outputs at scale. In this new session, Ben will share how he and his team engineered a system (based on proven software engineering approaches) that employs reproducible test variations (via temperature 0 and fixed seeds), and enables non-LLM evaluation m

article thumbnail

Robots, drones and AI: How next-generation tech is changing the global supply chain

Flipboard

Logistics companies globally are leaning into emerging technologies such as robots, drones and AI to speed up the supply chain and serve customers faster.

AI 179
article thumbnail

Athena landed in a dark crater where the temperature was minus 280° F

Hacker News

The Athena spacecraft was not exactly flying blind as it approached the lunar surface one week ago. The software on board did a credible job of recognizing nearby craters, even with elongated shadows over the terrain. However, the lander's altimeter had failed. So while Athena knew where it was relative to the surface of the Moon, the lander did not know how far it was above the surface.

179
179
article thumbnail

Why AI can’t (yet) decide your cancer treatment

Dataconomy

Artificial intelligence is making its way into oncology, but before AI-driven Clinical Decision Support Systems (CDSS) can transform treatment planning, theres a fundamental problem to solve data readiness. A new study by researchers from University Hospital Mnster and the German Research Center for Artificial Intelligence (DFKI) examines whether existing medical data is good enough for AI to make meaningful treatment recommendations for skin cancer patients.

AI 103
article thumbnail

Did the Particle Go Through the Two Slits, or Did the Wave Function?

Hacker News

In the quantum double-slit experiment, did the particle go through the slits or did the wave function?

178
178
article thumbnail

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Speaker: Yohan Lobo and Dennis Street

In the accounting world, staying ahead means embracing the tools that allow you to work smarter, not harder. Outdated processes and disconnected systems can hold your organization back, but the right technologies can help you streamline operations, boost productivity, and improve client delivery. Dive into the strategies and innovations transforming accounting practices.

article thumbnail

Sesame, the startup behind the viral virtual assistant Maya, open-sources its base AI model

Flipboard

Sesame, the AI company behind the impressively realistic voice assistant Maya, has released the base AI model powering Maya, as it recently promised. The model, which is 1 billion parameters in size (parameters referring to individual components of the model), is under an Apache 2.

AI 177
article thumbnail

The Lost Art of Logarithms

Hacker News

A book about the history, use, and importance of logarithms

178
178
article thumbnail

4 Ways Marketers Can Use AI Agents Now

Flipboard

The so-called AI arms race is well underway, and its fascinating to watch: Tech giants from Meta to Amazon to OpenAI are all releasing their own powerful contenders in the agentic AI space, and its happening at what feels like light-speed.

AI 176
article thumbnail

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning Blog

DeepSeek-R1 , developed by AI startup DeepSeek AI , is an advanced large language model (LLM) distinguished by its innovative, multi-stage training process. Instead of relying solely on traditional pre-training and fine-tuning, DeepSeek-R1 integrates reinforcement learning to achieve more refined outputs. The model employs a chain-of-thought (CoT) approach that systematically breaks down complex queries into clear, logical steps.

AI 108
article thumbnail

The 2nd Generation of Innovation Management: A Survival Guide

Speaker: Chris Townsend, VP of Product Marketing, Wellspring

Over the past decade, companies have embraced innovation with enthusiasm—Chief Innovation Officers have been hired, and in-house incubators, accelerators, and co-creation labs have been launched. CEOs have spoken with passion about “making everyone an innovator” and the need “to disrupt our own business.” But after years of experimentation, senior leaders are asking: Is this still just an experiment, or are we in it for the long haul?