Sat.Nov 02, 2024

article thumbnail

Jamba 1.5: Hybrid Mamba-Transformer Model for Advanced NLP

Analytics Vidhya

Jamba 1.5 is an instruction-tuned large language model that comes in two versions: Jamba 1.5 Large with 94 billion active parameters and Jamba 1.5 Mini with 12 billion active parameters. It combines the Mamba Structured State Space Model (SSM) with the traditional Transformer architecture. This model, developed by AI21 Labs, can process a 256K effective […] The post Jamba 1.5: Hybrid Mamba-Transformer Model for Advanced NLP appeared first on Analytics Vidhya.

Analytics 244
article thumbnail

Deploying Custom Detectron2 Models with a REST API: A Step-by-Step Guide.

Towards AI

Author(s): Gennaro Daniele Acciaro Originally published on Towards AI. An image generated using Midjourney In the life of a Machine Learning Engineer, training a model is only half the battle. Indeed, after obtaining a neural network that accurately predicts all the test data, it remains useless unless it’s made accessible to the world. Model deployment is the process of making a model accessible and usable in production environments, where it can generate predictions and provide real-time insig

professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Spann: Highly-Efficient Billion-Scale Approximate Nearest Neighbor Search (2021)

Hacker News

The in-memory algorithms for approximate nearest neighbor search (ANNS) have achieved great success for fast high-recall search, but are extremely expensive when handling very large scale database. Thus, there is an increasing request for the hybrid ANNS solutions with small memory and inexpensive solid-state drive (SSD). In this paper, we present a simple but efficient memory-disk hybrid indexing and search system, named SPANN, that follows the inverted index methodology.

article thumbnail

OpenAI Launches ChatGPT Search

Towards AI

Last Updated on November 2, 2024 by Editorial Team Author(s): Get The Gist Originally published on Towards AI. Plus: Claude AI Gets Desktop App This member-only story is on us. Upgrade to access all of Medium. Welcome to Get The Gist, where every weekday we share an easy-to-read summary of the latest and greatest developments in AI — news, innovations, and trends — all delivered in under 5 minutes!

AI 102
article thumbnail

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

Speaker: Tamara Fingerlin, Developer Advocate

Apache Airflow® 3.0, the most anticipated Airflow release yet, officially launched this April. As the de facto standard for data orchestration, Airflow is trusted by over 77,000 organizations to power everything from advanced analytics to production AI and MLOps. With the 3.0 release, the top-requested features from the community were delivered, including a revamped UI for easier navigation, stronger security, and greater flexibility to run tasks anywhere at any time.

article thumbnail

WEKA Introduces New WEKApod Appliances to Accelerate Enterprise AI Deployments

insideBIGDATA

WekaIO (WEKA), the AI-native data platform company, unveiled two new WEKApod™ data platform applianced: the WEKApod Nitro for large-scale enterprise AI deployments and the WEKApod Prime for smaller-scale AI deployments and multi-purpose high-performance data use cases.

AI 221
article thumbnail

Support Vector Machines Math Intuitions

Towards AI

Last Updated on November 3, 2024 by Editorial Team Author(s): Fernando Guzman Originally published on Towards AI. Support Vector Machines, or SVM, is a machine learning algorithm that, in its original form, is utilized for binary classification. The SVM model seeks to determine the optimal separation line between two classes, understood as the best margin between these classes, as demonstrated in the following example: SVM Example by OSCAR CONTRERAS CARRASCO As shown in the image, we have a sepa

More Trending

article thumbnail

Security flaws found in all Nvidia GeForce GPUs. Update drivers ASAP

Hacker News

If you have an Nvidia GeForce graphics card, you need to download the latest driver updates now.

181
181
article thumbnail

25 Simple Concepts We’re Tired of Explaining Again and Again

Flipboard

25 Simple Concepts We’re Tired of Explaining Again and Again

article thumbnail

Rivian's chief software officer says in-car buttons are 'an anomaly'

Hacker News

The trend of big touchscreens in cars has left many yearning for the not-so-distant days when most user interactions happened with physical buttons.

180
180
article thumbnail

Speed, scale and reliability: 25 years of Google datacenter networking evolution

Hacker News

Google networking leaders reflect on the milestones that led to Jupiter supporting 13 petabits per second bandwidth, and what comes next.

177
177
article thumbnail

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Speaker: Alex Salazar, CEO & Co-Founder @ Arcade | Nate Barbettini, Founding Engineer @ Arcade | Tony Karrer, Founder & CTO @ Aggregage

There’s a lot of noise surrounding the ability of AI agents to connect to your tools, systems and data. But building an AI application into a reliable, secure workflow agent isn’t as simple as plugging in an API. As an engineering leader, it can be challenging to make sense of this evolving landscape, but agent tooling provides such high value that it’s critical we figure out how to move forward.

article thumbnail

Next Generation Out of Band Garbage Collection

Hacker News

In this post we look at the impact of a new feature of Ruby 3.4 on Shopify’s monolith.

173
173
article thumbnail

Neurotechnology boosts memory without surgery

Hacker News

EPFL researchers have combined virtual reality, non-invasive brain stimulation and advanced brain imaging techniques to improve spatial navigation in healthy participants. The study is a first step in addressing dementia in an aging population without medication or surgery.

161
161
article thumbnail

Solid-state batteries enter pilot production, costs expected to drastically drop

Hacker News

The latest findings from Taipei-based intelligence provider TrendForce show that all-solid-state battery production volumes could have GWh levels by 2027. The rapid expansion will lead to cell price declines, reaching CNY 0.6-0.7/Wh ($0,084-$0,098) level by 2035.

152
152
article thumbnail

Solving the Siberian Crater Mystery

Hacker News

Why permafrost in the tundra has begun to explode.

141
141
article thumbnail

How to Modernize Manufacturing Without Losing Control

Speaker: Andrew Skoog, Founder of MachinistX & President of Hexis Representatives

Manufacturing is evolving, and the right technology can empower—not replace—your workforce. Smart automation and AI-driven software are revolutionizing decision-making, optimizing processes, and improving efficiency. But how do you implement these tools with confidence and ensure they complement human expertise rather than override it? Join industry expert Andrew Skoog as he explores how manufacturers can leverage automation to enhance operations, streamline workflows, and make smarter, data-dri

article thumbnail

Hacking cars in JavaScript (Replay attacks in the browser with the HackRF)

Hacker News

Collection of side projects, conference talks and blog posts experimenting with frontend technologies and human-computer interaction

140
140
article thumbnail

Britain's postwar sugar craze confirms harms of sweet diets in early life

Hacker News

Comments

136
136
article thumbnail

The Language of Faces

Hacker News

How Culture, Psychology, and Social Context Shape Our Expressions

135
135
article thumbnail

Weird Lexical Syntax

Hacker News

Let's explore the most unusual lexical syntax of popular programming languages.

132
132
article thumbnail

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Speaker: Frank Taliano

Documents are the backbone of enterprise operations, but they are also a common source of inefficiency. From buried insights to manual handoffs, document-based workflows can quietly stall decision-making and drain resources. For large, complex organizations, legacy systems and siloed processes create friction that AI is uniquely positioned to resolve.

article thumbnail

Show HN: A minimalist (brutalist?) website for sharing all your links

Hacker News

Lynx.Boo lets you create a compact, fast-loading links page. A minimalist approach to sharing all your important links in one place.

129
129
article thumbnail

The evolutionary mystery of the German cockroach

Hacker News

The species evolved to exploit human-built environments and exists nowhere else. So where did it come from?

129
129
article thumbnail

Ratting on wildlife crime: training rats to detect illegally trafficked wildlife

Hacker News

The illegal wildlife trade (IWT) is one of the largest global crime economies, directly threatening species and their habitats, and biodiversity, and indirec.

128
128
article thumbnail

Don't return named tuples in new APIs

Hacker News

In my opinion, you should only introduce a named tuple to your code when you're updating a preexisting API that was already returning a tuple or you are wrapping a tuple return value from another API. Let's start with when you should use named tuples.

127
127
article thumbnail

How to Achieve High-Accuracy Results When Using LLMs

Speaker: Ben Epstein, Stealth Founder & CTO | Tony Karrer, Founder & CTO, Aggregage

When tasked with building a fundamentally new product line with deeper insights than previously achievable for a high-value client, Ben Epstein and his team faced a significant challenge: how to harness LLMs to produce consistent, high-accuracy outputs at scale. In this new session, Ben will share how he and his team engineered a system (based on proven software engineering approaches) that employs reproducible test variations (via temperature 0 and fixed seeds), and enables non-LLM evaluation m

article thumbnail

Listening in on the Mysterious Marbled Murrelet

Hacker News

Applying machine learning to forest soundscapes helps researchers pinpoint rare and threatened birds.

article thumbnail

Chi-Fi Tuning – Why It Sounds So Damn Piercing to Western Ears (2020)

Hacker News

Our opinion why Chi-Fi tuning can hurt your ears!

123
123
article thumbnail

Killing the Command message: should we use Events or Documents? (2007)

Hacker News

Comments

117
117
article thumbnail

Static Basic Block Versioning

Hacker News

Comments

116
116
article thumbnail

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Speaker: Yohan Lobo and Dennis Street

In the accounting world, staying ahead means embracing the tools that allow you to work smarter, not harder. Outdated processes and disconnected systems can hold your organization back, but the right technologies can help you streamline operations, boost productivity, and improve client delivery. Dive into the strategies and innovations transforming accounting practices.

article thumbnail

Colossus AI Supercluster with over 100k Nvidia H100 GPUs

Hacker News

Comments

AI 113
article thumbnail

Cash: An absurdly small jQuery alternative for modern browsers

Hacker News

An absurdly small jQuery alternative for modern browsers.

111
111
article thumbnail

Show HN: Someday, Open-Source Calendly Alternative for Gmail / Google App Script

Hacker News

Free and open-source cal.com / calendly alternative built on Google-App-Script for Gmail users. Built with modern technologies like React, TypeScript, Shadcn/UI, and Vite.

109
109
article thumbnail

131M American Buildings

Hacker News

Benchmarks & Tips for Big Data, Hadoop, AWS, Google Cloud, PostgreSQL, Spark, Python & More.

Hadoop 104
article thumbnail

The 2nd Generation of Innovation Management: A Survival Guide

Speaker: Chris Townsend, VP of Product Marketing, Wellspring

Over the past decade, companies have embraced innovation with enthusiasm—Chief Innovation Officers have been hired, and in-house incubators, accelerators, and co-creation labs have been launched. CEOs have spoken with passion about “making everyone an innovator” and the need “to disrupt our own business.” But after years of experimentation, senior leaders are asking: Is this still just an experiment, or are we in it for the long haul?