Blog - Data Science Current

Build Audio LLM Apps with AssemblyAI

AssemblyAI

APRIL 5, 2024

Hey 👋, this weekly update contains the latest info on our new product features, tutorials, and our community LeMUR Cookbooks: Build Audio LLM Apps LeMUR is the easiest way to code applications that apply LLMs to speech. Processing Speaker Labels with LeMUR. Check our blog for full details.

Python

Python AI AI

Improved Hold Music Detection + Build LLM Audio Apps with LeMUR

AssemblyAI

DECEMBER 1, 2023

LeMUR: Build LLM apps on voice data LeMUR is the easiest way to code applications that apply LLMs to speech. Processing Speaker Labels with LeMUR. Processing Edited Transcripts with LeMUR. Processing Edited Transcripts with LeMUR. Creating Chapter Summaries with LeMUR.

Python

Python AI AI

Lower latency, reduced prices, and our Java SDK release

AssemblyAI

JANUARY 12, 2024

You can now access our Speech AI models with the below pricing: Async Speech-to-Text for $0.37 per hour (previously $0.65) Real-time Speech-to-Text for $0.47 Read more>> Our Trending YouTube Tutorials Build Talking AI ChatBot with Text-to-Speech using Python! speakerLabels(true).build();

AI

AI AI Python

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

The Project Clinic: Assessing Project Health, Planning, and Execution

MORE WEBINARS

4 ways generative AI addresses manufacturing challenges

IBM Journey to AI blog

APRIL 15, 2024

The industry must continually optimize process, improve efficiency, and improve overall equipment effectiveness. If the machine or equipment fails, the maintenance engineers can use gen AI to quickly diagnose problems based on the maintenance manual and an analysis of the process parameters.

AI

AI AI Data Lakes Analytics

Universal Speech Model (USM): State-of-the-art speech AI for 100+ languages

Google Research AI blog

MARCH 6, 2023

Today, we are excited to share more about the Universal Speech Model (USM), a critical first step towards supporting 1,000 languages. USM is a family of state-of-the-art speech models with 2B parameters trained on 12 million hours of speech and 28 billion sentences of text, spanning 300+ languages.

Supervised Learning

Supervised Learning AI AI Algorithm

2023 at AssemblyAI - A Year in Review

AssemblyAI

DECEMBER 20, 2023

Join Us On Discord 2023 at AssemblyAI - A Year in Review Here are some of the new products and features we've launched for customers in 2023: Conformer-1 and Conformer-2 AI Models Released : The year saw the launch of Conformer-2 , our enhanced AI model for automatic speech recognition. Processing Speaker Labels with LeMUR.

Python

Python AWS Database AI

Conformer-1: A robust speech recognition model trained on 650K hours of data

AssemblyAI

MARCH 15, 2023

Image: Google Research Blog. We determined that for a 300 million parameter Language model, we'd need roughly 6 billion tokens of text, which corresponds to about 625K hours of speech [ II ]. "Conformer: Convolution-augmented transformer for speech recognition." each node is connected to every other node).

Larger language models do in-context learning differently

Google Research AI blog

MAY 15, 2023

In general, models’ success at in-context learning is enabled by: Their use of semantic prior knowledge from pre-training to predict labels while following the format of in-context examples (e.g., Learning the input-label mappings in context from the presented examples (e.g.,

Natural Language Processing

Natural Language Processing Machine Learning Machine Learning

Beginner’s Guide to ML-001: Introducing the Wonderful World of Machine Learning: An Introduction

Towards AI

FEBRUARY 20, 2024

I am starting a series with this blog, which will guide a beginner to get the hang of the ‘Machine learning world’. labeled as Cat. The computer model analyses different features with the label. Supervised learning: This involves learning from labeled data, where each data point has a known outcome.

Machine Learning

Machine Learning Machine Learning ML ML

Community Spotlight: Dr. Helen Yannakoudakis

DrivenData Labs

MAY 18, 2023

I work on machine learning for natural language processing, and I’m particularly interested in few-shot learning, lifelong learning, and societal and health applications such as abuse detection, misinformation, mental ill-health detection, and language assessment. Data science is a broad field. What areas are you particularly interested in?

Natural Language Processing

Natural Language Processing Machine Learning Machine Learning Data Science

A journey from hieroglyphs to chatbots: Understanding NLP over Google’s USM updates

Dataconomy

MARCH 14, 2023

In recent years, natural language processing and conversational AI have gained significant attention as technologies that are transforming the way we interact with machines and each other. The model’s automatic speech recognition (ASR) capabilities are not limited to commonly spoken languages like English and Mandarin.

Natural Language Processing

Natural Language Processing Machine Learning Machine Learning Supervised Learning

Google at Interspeech 2023

Google Research AI blog

AUGUST 21, 2023

Posted by Catherine Armato, Program Manager, Google This week, the 24th Annual Conference of the International Speech Communication Association (INTERSPEECH 2023) is being held in Dublin, Ireland, representing one of the world’s most extensive conferences on research and technology of spoken language understanding and processing.

Clustering

Clustering AI AI

Generative AI Terminology — An Evolving Taxonomy To Get You Started

Towards AI

JANUARY 30, 2024

While I post the list in this blog, I’m also maintaining a “live” list that is detailed and will keep on updating. Example: TinyLlama, Pythia Large Multimodal Models (LMMs) MultiModal refers to the ability of the model to process and generate not just text but also other data modalities like image, video, speech, audio, etc.

AI

AI AI Natural Language Processing Supervised Learning

How to Use Hugging Face Pipelines?

Towards AI

FEBRUARY 13, 2023

This blog will walk you through how to perform NLP tasks with Hugging Face Pipelines. Here are topics we’ll discuss in this blog. It also overcomes complex challenges in speech recognition and computer vision, such as creating a transcript of a sound sample or a description of an image. It helps you label text.

Python

Python Deep Learning Deep Learning Natural Language Processing

Natural Language Processing (NLP) Concepts With NLTK

Heartbeat

MARCH 22, 2023

Learn NLP data processing operations with NLTK, visualize data with Kangas , build a spam classifier, and track it with Comet Machine Learning Platform Photo by Stephen Phillips — Hostreviews.co.uk on Unsplash At its core, the discipline of Natural Language Processing (NLP) tries to make the human language “palatable” to computers.

Natural Language Processing

Natural Language Processing Deep Learning Deep Learning Machine Learning

The Ultimate Guide to Data Preparation for Machine Learning

DagsHub

FEBRUARY 29, 2024

Improved data preparation techniques focused on efficient data labeling, management, augmentation and curation while keeping the model relatively fixed in its architecture has led to significantly better model outcomes. In this blog, I will describe how to prepare data for machine learning in depth. million per year.

Data Preparation

Data Preparation Machine Learning Machine Learning Data Governance

Who Said What? Recorder's On-device Solution for Labeling Speakers

Google Research AI blog

DECEMBER 14, 2022

It leverages recent developments in on-device machine learning to transcribe speech , recognize audio events , suggest tags for titles, and help users navigate transcripts. During the Made By Google event this year, we announced the " speaker labels " feature for the Recorder app. Left : Recorder transcript without speaker labels.

Clustering

Clustering Algorithm Machine Learning Machine Learning

Streamline diarization using AI as an assistive technology: ZOO Digital’s story

AWS Machine Learning Blog

FEBRUARY 20, 2024

This time-consuming process must be completed before content can be dubbed into another language. The engagement focused on delivering a functional solution for the localization process, while providing hands-on training to ZOO Digital developers on SageMaker, Amazon Transcribe , and Amazon Translate. in a code subdirectory.

AWS

AWS AI AI Machine Learning

Data-centric ML benchmarking: Announcing DataPerf’s 2023 challenges

Google Research AI blog

MARCH 30, 2023

The process of creating high quality datasets is complicated and error-prone, from the initial selection and cleaning of raw data, to labeling the data and splitting it into training and test sets. For instance: Data selection: Often, we have a larger pool of available data than we can label or train on effectively.

ML

ML ML Algorithm Natural Language Processing

Whisper models for automatic speech recognition now available in Amazon SageMaker JumpStart

AWS Machine Learning Blog

OCTOBER 10, 2023

Whisper is a pre-trained model for automatic speech recognition (ASR) and speech translation. Trained on 680 thousand hours of labelled data, Whisper models demonstrate a strong ability to generalize to many datasets and domains without the need for fine-tuning. The original code can be found in this GitHub repository.

Machine Learning

Machine Learning Machine Learning ML ML

Supervised learning vs Unsupervised learning

Pickl AI

APRIL 3, 2023

Significantly, one of the most prominent examples of a Machine learning model is Siri by Apple, which can recognise speech converting audio into textual form. Supervised Learning is the type of Machine Learning where the training of an algorithm takes place using labelled data.

Supervised Learning

Supervised Learning Machine Learning Machine Learning Clustering

Best practices for building secure applications with Amazon Transcribe

AWS Machine Learning Blog

MARCH 25, 2024

Amazon Transcribe is an AWS service that allows customers to convert speech to text in either batch or streaming mode. It uses machine learning–powered automatic speech recognition (ASR), automatic language identification, and post-processing technologies. The customer data is cleaned up for both complete and failure cases.

AWS

AWS Natural Language Processing Machine Learning Machine Learning

ODSC’s AI Weekly Recap: Week of March 8th

ODSC - Open Data Science

MARCH 8, 2024

Source ) In a blog post released today, OpenAI fired back at Elon Musk’s lawsuit and moved to dismiss his claims about the company’s motives. MetaVoice is a text-to-speech foundational model for human-like expression. parameter base model trained on 100K hours of speech for TTS (text-to-speech). Blogs or News to Share?

Artificial Intelligence

Artificial Intelligence Artificial Intelligence AI AI

SNOMED CT Entity Linking Challenge - Benchmark

DrivenData Labs

JANUARY 22, 2024

This guest blog post from our partners at Veratai contains code for training the benchmark entity linking model for the SNOMED CT Entity Linking Challenge. One way to analyze clinical notes is to identify and label the portions of each note that correspond to specific medical concepts. We define the following token labels: O.

Named Entity Recognition With SpaCy

Heartbeat

APRIL 17, 2023

One of the goals of ML is to enable computers to process and analyze data in a way that is similar to how humans process information. Human brains are capable of processing vast amounts of information from the environment and making complex decisions based on that information. What is Named Entity Recognition (NRE)?

Natural Language Processing

Natural Language Processing Support Vector Machines Machine Learning Machine Learning

Beyond prompting: getting production quality LLM performance with Snorkel Flow

Snorkel AI

AUGUST 9, 2023

In this blog post, we walk through an anonymized real-world use case comparing a variety of state-of-the-art LLMs (GPT 4, GPT 3.5, In this blog post, we’ll look specifically at the task of extracting “product resistances” for rugs. We corrected these via prompt where possible and then later via additional labeled data for fine-tuning.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence AI AI

Beyond prompting: getting production quality LLM performance with Snorkel Flow

Snorkel AI

AUGUST 9, 2023

In this blog post, we walk through an anonymized real-world use case comparing a variety of state-of-the-art LLMs (GPT 4, GPT 3.5, In this blog post, we’ll look specifically at the task of extracting “product resistances” for rugs. We corrected these via prompt where possible and then later via additional labeled data for fine-tuning.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence AI AI

Converting data into SQuAD format for fine-tuning LLM models

Mlearning.ai

APRIL 21, 2023

Text annotation Text annotation is the process of adding structured information to unstructured text data in order to make it more understandable and useful for downstream applications. The resulting annotated text data can then be used to train and improve the accuracy of LLM models for specific natural language processing tasks.

Natural Language Processing

Natural Language Processing Supervised Learning Machine Learning Machine Learning

Top 10 Deep Learning Algorithms in Machine Learning

Pickl AI

AUGUST 3, 2023

These algorithms have shown remarkable success in solving a wide range of complex tasks, such as image recognition, natural language processing, speech recognition, and more. This process is known as training, and it relies on large amounts of labeled data. Read Blog: How to build a Machine Learning Model?

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

PRESTO – A multilingual dataset for parsing realistic task-oriented dialogues

Google Research AI blog

MARCH 27, 2023

In the natural language processing (NLP) literature, this is mainly framed as a task-oriented dialogue parsing task, where a given dialogue needs to be parsed by a system to understand the user intent and carry out the operation to fulfill that intent. We’d also like to thank Tom Small for the animations in this blog post.

Natural Language Processing

Natural Language Processing Data Quality

Understanding Natural Language Processing — Sentiment Analysis

Mlearning.ai

JANUARY 26, 2023

Introduction Natural language processing (NLP) sentiment analysis is a powerful tool for understanding people’s opinions and feelings toward specific topics. NLP sentiment analysis uses natural language processing (NLP) to identify, extract, and analyze sentiment from text data. What is NLP Sentiment Analysis?

Natural Language Processing

Natural Language Processing Deep Learning Deep Learning Algorithm

Machine Learning vs. Deep Learning - A Comparison

Heartbeat

OCTOBER 11, 2023

This process is known as machine learning or deep learning. Supervised learning uses labeled data with input-output pairs, unsupervised learning discovers hidden patterns and structures in unlabeled data, and reinforcement learning learns through interactions with an environment and rewards or punishments.

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

What Is Media Monitoring? (Definition, Benefits, and AI)

AssemblyAI

SEPTEMBER 12, 2023

Media monitoring is the process of tracking, collecting, and analyzing mentions of brands, topics, or keywords across platforms. Here's how brands can leverage AI to supercharge their media monitoring activities: Speech and video recognition : AI models can process and transcribe video and audio content.

AI

AI AI

How Image Annotation Teaches Machines to See

Defined.ai blog

JULY 31, 2022

Everything you need to know about image data labelling and image annotation Computer vision technology has massive potential. However, precise data labelling is crucial to training accurate computer vision models. Machine learning models learn in the same way, which is why accuracy in data labelling is key to success.

Deep Learning

Deep Learning Deep Learning Natural Language Processing Machine Learning

What Is a Transformer Model?

Hacker News

MARCH 25, 2022

Transformers are translating text and speech in near real-time, opening meetings and classrooms to diverse and hearing-impaired attendees. No Labels, More Performance. Before transformers arrived, users had to train neural networks with large, labeled datasets that were costly and time-consuming to produce.

Machine Learning

Machine Learning Machine Learning AI AI

Seattle Police Department using AI software to analyze body cam footage and officer behavior

Flipboard

FEBRUARY 3, 2023

A spokeswoman said “it’s too early in the process to speak to measurable outcomes.” The final product of Truleo’s Audio Analysis is a rich analysis of thousands of conversations, enabling departments to quickly identify at-risk incidents and department-wide trends,” the company said in a blog post explaining how the AI platform works.

AI

AI AI Natural Language Processing Artificial Intelligence

Unlocking the Potential of LLMs: From MLOps to LLMOps

Heartbeat

OCTOBER 5, 2023

These LLMs can generate human-like text, understand context, and perform various Natural Language Processing (NLP) tasks. Let's delve into some key distinctions: Data Collection and Labeling MLOps: Focuses on sourcing, wrangling, cleaning, and labeling data. Clean and preprocess data to enhance quality. Exciting News!

ML

ML ML Machine Learning Machine Learning

Seven Reasons why Human Evaluation of your Text-to-Speech Projects is Important

Defined.ai blog

SEPTEMBER 20, 2022

To claim that Text-to-Speech (TTS) technology is transforming the way users communicate with mobile-enabled handheld devices and smart assistants is more of an understatement than an assertion these days. Here are the top seven reasons on why human evaluation of your speech models is important: 1.

AI

AI AI Artificial Intelligence Artificial Intelligence

Seven Reasons why Human Evaluation of your Text-to-Speech Projects is Important

Defined.ai blog

SEPTEMBER 20, 2022

To claim that Text-to-Speech (TTS) technology is transforming the way users communicate with mobile-enabled handheld devices and smart assistants is more of an understatement than an assertion these days. Here are the top seven reasons on why human evaluation of your speech models is important: 1.

AI

AI AI Artificial Intelligence Artificial Intelligence

How to build a Machine Learning Model?

Pickl AI

AUGUST 1, 2023

Machine Learning models play a crucial role in this process, serving as the backbone for various applications, from image recognition to natural language processing. In this blog, we will delve into the fundamental concepts of data model for Machine Learning, exploring their types. What is Machine Learning?

Machine Learning

Machine Learning Machine Learning Support Vector Machines Decision Trees

The most valuable AI use cases for business

IBM Journey to AI blog

FEBRUARY 14, 2024

Voice-based queries use natural language processing (NLP) and sentiment analysis for speech recognition so their conversations can begin immediately. With text to speech and NLP, AI can respond immediately to texted queries and instructions. AIOps is one of the fastest ways to boost ROI from digital transformation investments.

AI

AI AI ML ML

Live Meeting Assistant with Amazon Transcribe, Amazon Bedrock, and Knowledge Bases for Amazon Bedrock

AWS Machine Learning Blog

APRIL 18, 2024

It uses Amazon Transcribe for speech to text, Knowledge Bases for Amazon Bedrock for contextual queries against your company’s documents and knowledge sources, and Amazon Bedrock models for customizable transcription insights and summaries. Processing flow overview How did LMA transcribe and analyze your meeting?

AWS

AWS Analytics Analytics AI

Improving ALBERT’s Efficiency with Knowledge Distillation

Heartbeat

JUNE 28, 2023

These models are typically designed to perform specific tasks, such as image or speech recognition, with high accuracy while minimizing power consumption and memory usage. The idea is to use these soft targets instead of the hard labels that are typically used to train models. to(device) labels = batch[2].to(device)

Machine Learning

Machine Learning Machine Learning Deep Learning Deep Learning

The business value of operating core insurance solutions on the cloud

IBM Journey to AI blog

JUNE 23, 2023

Core modernization (processes and technology) is a top priority for every insurer. The supervised learning that is used to train AI requires a lot of human effort, is difficult, requires intensive labeling and takes months of effort. Insurers want to shift from fixed to variable, “pay-as-you-go” operating costs.

Supervised Learning

Supervised Learning AI AI Artificial Intelligence

Build Audio LLM Apps with AssemblyAI

Improved Hold Music Detection + Build LLM Audio Apps with LeMUR

Webinars

Trending Sources

Lower latency, reduced prices, and our Java SDK release

Webinars

4 ways generative AI addresses manufacturing challenges

Universal Speech Model (USM): State-of-the-art speech AI for 100+ languages

2023 at AssemblyAI - A Year in Review

Conformer-1: A robust speech recognition model trained on 650K hours of data

Larger language models do in-context learning differently

Beginner’s Guide to ML-001: Introducing the Wonderful World of Machine Learning: An Introduction

Community Spotlight: Dr. Helen Yannakoudakis

A journey from hieroglyphs to chatbots: Understanding NLP over Google’s USM updates

Google at Interspeech 2023

Generative AI Terminology — An Evolving Taxonomy To Get You Started

How to Use Hugging Face Pipelines?

Natural Language Processing (NLP) Concepts With NLTK

The Ultimate Guide to Data Preparation for Machine Learning

Who Said What? Recorder's On-device Solution for Labeling Speakers

Streamline diarization using AI as an assistive technology: ZOO Digital’s story

Data-centric ML benchmarking: Announcing DataPerf’s 2023 challenges

Whisper models for automatic speech recognition now available in Amazon SageMaker JumpStart

Supervised learning vs Unsupervised learning

Best practices for building secure applications with Amazon Transcribe

ODSC’s AI Weekly Recap: Week of March 8th

SNOMED CT Entity Linking Challenge - Benchmark

Named Entity Recognition With SpaCy

Beyond prompting: getting production quality LLM performance with Snorkel Flow

Beyond prompting: getting production quality LLM performance with Snorkel Flow

Converting data into SQuAD format for fine-tuning LLM models

Top 10 Deep Learning Algorithms in Machine Learning

PRESTO – A multilingual dataset for parsing realistic task-oriented dialogues

Understanding Natural Language Processing — Sentiment Analysis

Machine Learning vs. Deep Learning - A Comparison

What Is Media Monitoring? (Definition, Benefits, and AI)

How Image Annotation Teaches Machines to See

What Is a Transformer Model?

Seattle Police Department using AI software to analyze body cam footage and officer behavior

Unlocking the Potential of LLMs: From MLOps to LLMOps

Seven Reasons why Human Evaluation of your Text-to-Speech Projects is Important

Seven Reasons why Human Evaluation of your Text-to-Speech Projects is Important

How to build a Machine Learning Model?

The most valuable AI use cases for business

Live Meeting Assistant with Amazon Transcribe, Amazon Bedrock, and Knowledge Bases for Amazon Bedrock

Improving ALBERT’s Efficiency with Knowledge Distillation

The business value of operating core insurance solutions on the cloud

Stay Connected