Tue.Jan 28, 2025

article thumbnail

Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models

Machine Learning Research at Apple

Scaling the capacity of language models has consistently proven to be a reliable approach for improving performance and unlocking new capabilities. Capacity can be primarily defined by two dimensions: the number of model parameters and the compute per example. While scaling typically involves increasing both, the precise interplay between these factors and their combined contribution to overall capacity remains not fully understood.

242
242
article thumbnail

Don’t Manage Your Python Environments, Just Use Docker Containers

KDnuggets

Python environment management can sometimes give you that awful feeling in the pit of your stomach. So don't do it: just use Docker containers.

Python 347
professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Bluwhale Secures $100M for Web3 Layer across L1 and L2 Blockchains 

insideBIGDATA

SAN FRANCISCO Jan. 28, 2025Bluwhale, an AI Web3 start-up, today announces that it has topped its funding to $100 million. This includes its Seed/Series A round as well as a $75 million token purchase commitment, grants, and node sale proceeds.

AI 317
article thumbnail

The Role of AI in Shaping the Future of Work

KDnuggets

Rather than fearing AI, we should see it as a tool that complements human skills, helping professionals focus on high-value work and enhancing job roles.

AI 333
article thumbnail

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

Speaker: Tamara Fingerlin, Developer Advocate

Apache Airflow® 3.0, the most anticipated Airflow release yet, officially launched this April. As the de facto standard for data orchestration, Airflow is trusted by over 77,000 organizations to power everything from advanced analytics to production AI and MLOps. With the 3.0 release, the top-requested features from the community were delivered, including a revamped UI for easier navigation, stronger security, and greater flexibility to run tasks anywhere at any time.

article thumbnail

Provable Uncertainty Decomposition via Higher-Order Calibration

Machine Learning Research at Apple

We give a principled method for decomposing the predictive uncertainty of a model into aleatoric and epistemic components with explicit semantics relating them to the real-world data distribution. While many works in the literature have proposed such decompositions, they lack the type of formal guarantees we provide. Our method is based on the new notion of higher-order calibration, which generalizes ordinary calibration to the setting of higher-order predictors that predict mixtures over label

130
130
article thumbnail

Empowering Personalized Banking Experiences

databricks

At Zafin , our mission is to help banks modernize their core infrastructure to deliver exceptional, personalized experiences to their customers. To determine.

278
278

More Trending

article thumbnail

Zencoder: Coding Assistants Can Make Us One With Every System

Adrian Bridgwater for Forbes

Coding assistants are reshaping data engineering; these advanced tools can now connect directly to databases and understand database schemas and data types.

Database 244
article thumbnail

Quibim: $50M Series A for Precision Medicine with AI-Powered Imaging Biomarkers

insideBIGDATA

Valencia, Spain and New York, January 28, 2025 Quibim, a healthtech company focused on the use of imaging biomarkers for precision medicine, announced today the close of its $50 million Series A financing. The company said it has experienced growth in the number of patients analyzed by its products over the past year.

AI 221
article thumbnail

Creating Powerful Ensemble Models with PyCaret

Machine Learning Mastery

In this article, we will explore how to create ensemble models with PyCaret.

208
208
article thumbnail

Microsoft Probing If DeepSeek-Linked Group Improperly Obtained OpenAI Data

Hacker News

Microsoft Corp. and OpenAI are investigating whether data output from OpenAIs technology was obtained in an unauthorized manner by a group linked to Chinese artificial intelligence startup DeepSeek, according to people familiar with the matter.

article thumbnail

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Speaker: Alex Salazar, CEO & Co-Founder @ Arcade | Nate Barbettini, Founding Engineer @ Arcade | Tony Karrer, Founder & CTO @ Aggregage

There’s a lot of noise surrounding the ability of AI agents to connect to your tools, systems and data. But building an AI application into a reliable, secure workflow agent isn’t as simple as plugging in an API. As an engineering leader, it can be challenging to make sense of this evolving landscape, but agent tooling provides such high value that it’s critical we figure out how to move forward.

article thumbnail

DeepSeek's AI breakthrough bypasses Nvidia's industry-standard CUDA, uses assembly-like PTX programming instead

Flipboard

DeepSeek used PTX ISA to fine-tune its AI model training.

AI 182
article thumbnail

Black Swan's Taleb Says Nvidia Rout Is Hint of What's Coming

Hacker News

The Black Swan author Nassim Taleb is warning that Mondays brutal selloff in Nvidia Corp. is just a taste of whats in store for investors who blindly piled into Wall Streets AI-driven stock rally.

AI 182
article thumbnail

How China’s DeepSeek Outsmarted America

Flipboard

AI startup developed a top system by relying on inexperienced engineers and a loophole in U.S. export controls SINGAPORETake a team of young Chinese engineers, hired by a boss with disdain for experience.

AI 181
article thumbnail

US pauses all Federal aid and grants

Hacker News

Democrats warn the move could have "devastating consequences" on programmes that people across the US rely on.

181
181
article thumbnail

How to Modernize Manufacturing Without Losing Control

Speaker: Andrew Skoog, Founder of MachinistX & President of Hexis Representatives

Manufacturing is evolving, and the right technology can empower—not replace—your workforce. Smart automation and AI-driven software are revolutionizing decision-making, optimizing processes, and improving efficiency. But how do you implement these tools with confidence and ensure they complement human expertise rather than override it? Join industry expert Andrew Skoog as he explores how manufacturers can leverage automation to enhance operations, streamline workflows, and make smarter, data-dri

article thumbnail

AI haters build tarpits to trap and trick AI scrapers that ignore robots.txt

Flipboard

Attackers explain how an anti-spam defense became an AI weapon.

AI 181
article thumbnail

Departing the New York Times

Hacker News

I left to stay true to my byline

181
181
article thumbnail

DeepSeek Chief’s Journey From Math Geek to Global Disruptor

Flipboard

Chinese engineer Liang Wenfeng built the AI company after founding a successful hedge fund Some call him Chinas Sam Altman. Others compare him to Jim Simons, the pioneer of quantitative investing.

AI 181
article thumbnail

It's official: Research has found that libraries make everything better

Hacker News

Science has backed up what many of us have long been saying: the library rocks. A study from the New York Public Library surveyed 1,974 users on how the library makes them feel and how it affects their lives, and the results are overwhelmingly positive.

181
181
article thumbnail

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Speaker: Frank Taliano

Documents are the backbone of enterprise operations, but they are also a common source of inefficiency. From buried insights to manual handoffs, document-based workflows can quietly stall decision-making and drain resources. For large, complex organizations, legacy systems and siloed processes create friction that AI is uniquely positioned to resolve.

article thumbnail

New glowing molecule, invented by AI, would have taken 500 million years to evolve in nature, scientists say

Flipboard

An artificial intelligence model has created a new protein that researchers say would have taken 500 million years to evolve in nature — if nature were capable of producing such a thing.

article thumbnail

Parkinsons patient "feels cured" with new adaptive deep brain stimulation device

Hacker News

Kevin Hill, who has a computer in his chest linked to a brain implant, says he has his life back.

181
181
article thumbnail

Apple researchers reveal the secret sauce behind DeepSeek AI

Flipboard

The AI model that shook the world is part of a broad trend to squeeze more out of chips using what's called sparsity.

AI 181
article thumbnail

Almost one in 10 people use the same four-digit PIN

Hacker News

The ABC analysed 29 million stolen codes to help you avoid using an insecure one.

181
181
article thumbnail

How to Achieve High-Accuracy Results When Using LLMs

Speaker: Ben Epstein, Stealth Founder & CTO | Tony Karrer, Founder & CTO, Aggregage

When tasked with building a fundamentally new product line with deeper insights than previously achievable for a high-value client, Ben Epstein and his team faced a significant challenge: how to harness LLMs to produce consistent, high-accuracy outputs at scale. In this new session, Ben will share how he and his team engineered a system (based on proven software engineering approaches) that employs reproducible test variations (via temperature 0 and fixed seeds), and enables non-LLM evaluation m

article thumbnail

Why DeepSeek Could Change What Silicon Valley Believes About A.I.

Flipboard

A new A.I. model, released by a scrappy Chinese upstart, has rocked Silicon Valley and upended several fundamental assumptions about A.I. progress.

article thumbnail

Berkeley Researchers Replicate DeepSeek R1's Core Tech for Just $30: A Small Mod

Hacker News

A Berkeley AI Research team led by PhD candidate Jiayi Pan has achieved what many thought impossible: reproducing DeepSeek R1-Zero's key technologies for less than the cost of a dinner for two.

AI 181
article thumbnail

Why everyone is freaking out about DeepSeek

Flipboard

It took about a month for the finance world to start freaking out about DeepSeek, but when it did, it took more than half a trillion dollars or one entire Stargate off Nvidias market cap. It wasnt just Nvidia, either: Tesla, Google, Amazon, and Microsoft tanked.

AI 181
article thumbnail

Boom's XB-1 becomes first civil aircraft to go supersonic

Hacker News

Boom Supersonic's XB-1 demonstrator plane just went supersonic in the skies over California's Mojave desert, making it the first civil aircraft to break

181
181
article thumbnail

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Speaker: Yohan Lobo and Dennis Street

In the accounting world, staying ahead means embracing the tools that allow you to work smarter, not harder. Outdated processes and disconnected systems can hold your organization back, but the right technologies can help you streamline operations, boost productivity, and improve client delivery. Dive into the strategies and innovations transforming accounting practices.

article thumbnail

Chinese-made DeepSeek AI model records extensive online user data, stores it in China-based servers

Flipboard

From personal information to hardware specifications to even how you type, DeepSeek says it’s collecting swathes of data from its online users.

AI 181
article thumbnail

Confusion, uncertainty in industry as Army contracts seemingly halted

Hacker News

The fear among industry now is that this move is just the first in what would amount to a Pentagon-wide halt on new awards, for an indefinite period of time.

181
181
article thumbnail

California’s AG Tells AI Companies Practically Everything They’re Doing Might Be Illegal

Flipboard

According to a recent legal memo, Silicon Valley's hottest business may be entirely based around criminal activity.

AI 181
article thumbnail

Has DeepSeek improved the Transformer architecture

Hacker News

This Gradient Updates issue goes over the major changes that went into DeepSeeks most recent model.

180
180
article thumbnail

The 2nd Generation of Innovation Management: A Survival Guide

Speaker: Chris Townsend, VP of Product Marketing, Wellspring

Over the past decade, companies have embraced innovation with enthusiasm—Chief Innovation Officers have been hired, and in-house incubators, accelerators, and co-creation labs have been launched. CEOs have spoken with passion about “making everyone an innovator” and the need “to disrupt our own business.” But after years of experimentation, senior leaders are asking: Is this still just an experiment, or are we in it for the long haul?