Data Science Current

Consistency LLM: converting LLMs to parallel decoders accelerates inference 3.5x

Hacker News

MAY 8, 2024

In this blog, we show pretrained LLMs can be easily taught to operate as efficient parallel decoders. We introduce Consistency Large Language Models (CLLMs), a new family of parallel decoders capable of reducing inference latency by efficiently decoding an $n$-token sequence per inference step.

Community Spotlight: Dr. Helen Yannakoudakis

DrivenData Labs

MAY 18, 2023

In this post we sit down with Dr. Helen Yannakoudakis, a winner of the Hateful Memes competition and an Assistant Professor at King’s College London, Visiting Researcher at the University of Cambridge, and co-founder and Chief Scientific Officer at Kinhub. Data science is a broad field. What areas are you particularly interested in?

Natural Language Processing

Natural Language Processing Machine Learning Machine Learning Data Science

Large language models: A beginner’s guide to 2023’s top technology

Data Science Dojo

JUNE 20, 2023

The buzz surrounding large language models is wreaking havoc and for all the good reason! What are large language models? These expansive language models represent a highly effective utilization of transformer models. Translation : LLMs can translate text from one language to another.

Natural Language Processing

Natural Language Processing Data Science AI AI

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

MORE WEBINARS

GPT 3.5 and GPT 4 comparative analysis

Data Science Dojo

NOVEMBER 30, 2023

GPT stands for Generative Pretrained Transformer, which is a large language model (LLM) chatbot developed by OpenAI. It is a powerful tool that can be used for a variety of tasks, including generating text, translating languages, and writing different kinds of creative content. Data-to-Text Model GPT-3.5:

Algorithm

Algorithm AI AI

Top Important LLM Papers for the Week from 18/03 to 24/03

Towards AI

MARCH 28, 2024

Stay Updated with Recent Large Language Models Research Large language models (LLMs) have advanced rapidly in recent years. As new generations of models are developed, researchers and engineers need to stay informed on the latest progress. From research to projects and ideas.

Machine Learning

Machine Learning Machine Learning Data Science AI

How Phenomenal Is Sophia from the Stanford Team for Training LLMs?

Flipboard

JULY 3, 2023

A team of researchers at Stanford University has introduced a groundbreaking optimization technique called Sophia, designed to revolutionize the pretraining process for large language models (LLMs).

Computer Science

Computer Science Computer Science Artificial Intelligence Artificial Intelligence

CodeTF: One-Stop Transformer Library for State-of-the-Art Code LLM

Hacker News

JUNE 7, 2023

Recently, deep learning-based models, especially Transformer-based large language models (LLMs), have demonstrated remarkable potential in tackling these tasks by leveraging massive open-source code data and programming language features. Code intelligence plays a key role in transforming modern software engineering.

Machine Learning

Machine Learning Machine Learning Deep Learning Deep Learning

Reinforcement Learning from Human Feedback (RLHF)

Towards AI

OCTOBER 31, 2023

We will focus on text-to-text language models U+1F4DD, such as GPT-3, BLOOM, and T5. Models like BERT, which are encoder-only, are not addressed. Once you have that pretrained language model, you can also do an extra optional step, called Supervised Fine-Tuning (STF). This will be our reward model.

Algorithm

Algorithm AI AI Artificial Intelligence

Revolutionize LLM with Llama 2 fine-tuning

Data Science Dojo

OCTOBER 1, 2023

With the introduction of LLaMA v1, we witnessed a surge in customized models like Alpaca , Vicuna , and WizardLM. This surge motivated various businesses to launch their own foundational models, such as OpenLLaMA , Falcon , and XGen , with licenses suitable for commercial purposes.

Machine Learning

Machine Learning Machine Learning AI AI

NLP News Cypher | 07.26.20

Towards AI

JULY 21, 2023

Photo by Will Truettner on Unsplash NATURAL LANGUAGE PROCESSING (NLP) WEEKLY NEWSLETTER NLP News Cypher | 07.26.20 However, Salesforce Research has recently released a unidirectional language model called SimpleTOD, that attempts to solve all the sub-tasks in an end-to-end manner. We show that… blog.einstein.ai

Natural Language Processing

Natural Language Processing ML ML Python

Researchers Introduce Proxy-Tuning: An Efficient Alternative to Finetuning Large Language Models

ODSC - Open Data Science

JANUARY 25, 2024

Researchers from the University of Washington and the Allen Institute for AI have set a new precedent in the work of fine-tuning LLMs. Smith, introduces a concept known as “proxy-tuning,” a method that promises to streamline the adaptation of large pretrained LMs efficiently. This is where proxy tuning comes into play.

Data Science

Data Science Algorithm AI AI

AI Technology NYUTron Accurately Predicts Health Outcomes

NYU Center for Data Science

JUNE 30, 2023

NYUTron , the large language model (LLM), is able to read physicians’ notes and estimate patients’ risk of death, length of hospital stays, and other health factors. Physicians often write in individualized language, and the data reorganization required to compile the information into neat tables is time-consuming.

AI

AI AI Natural Language Processing Artificial Intelligence

SQuARE: Towards Multi-Domain and Few-Shot Collaborating Question Answering Agents

ODSC - Open Data Science

APRIL 3, 2023

Are you fascinated by the power of Question Answering (QA) models but find yourself intimidated by technical challenges? Do you yearn to compare different QA models but dread the time-consuming process of setting them up? Or do you want to compare the capabilities of ChatGPT against regular fine-tuned QA models?

Natural Language Processing

Natural Language Processing Machine Learning Machine Learning Data Science

The NLP Cypher | 03.07.21

Towards AI

JULY 20, 2023

The Lookout — “All’s Well” | Homer NATURAL LANGUAGE PROCESSING (NLP) WEEKLY NEWSLETTER The NLP Cypher | 03.07.21 OpenChat OpenChat is an awesome repo where one can interact with top tier dialogue models with just 1 line of code. hyunwoongko/openchat OpenChat is opensource chatting framework for generative models.

Natural Language Processing

Natural Language Processing Machine Learning Machine Learning Python

Inside RAFT: UC Berkeley’s Method to Improve RAG for Domain Specific Scenarios

Towards AI

MARCH 26, 2024

The goal is to keep you up to date with machine learning projects, research papers, and concepts. When these LLMs are applied to specific tasks, it’s often necessary to integrate additional information, such as the latest news or specialized knowledge, into the already trained model.

Machine Learning

Machine Learning Machine Learning Artificial Intelligence Artificial Intelligence

What Is a Transformer Model?

Hacker News

MARCH 25, 2022

So, What’s a Transformer Model? A transformer model is a neural network that learns context and thus meaning by tracking relationships in sequential data like the words in this sentence. First described in a 2017 paper from Google, transformers are among the newest and one of the most powerful classes of models invented to date.

Machine Learning

Machine Learning Machine Learning AI AI

Meet BLIP-2: Salesforce New Open Source Visual-Language Model that is Faster the Simpler than GPT-4

Towards AI

APRIL 1, 2023

The model radically improves in the cost and efficiency of pretraining for visual-language models. The goal is to keep you up to date with machine learning projects, research papers and concepts. Visual-language models have been… Read the full blog for free on Medium.

Machine Learning

Machine Learning Machine Learning Artificial Intelligence Artificial Intelligence

Inside XGen-Image-1: How Salesforce Research Built, Trained, and Evaluated a Massive Text-to-Image Model

Towards AI

AUGUST 14, 2023

One of the most efficient training processes for text-to-image models ever implemented. Image Credit: Salesforce Research I recently started an AI-focused educational newsletter, that already has over 160,000 subscribers. The goal is to keep you up to date with machine learning projects, research papers, and concepts.

Machine Learning

Machine Learning Machine Learning Artificial Intelligence Artificial Intelligence

Microsoft Researchers Propose a Novel Framework for LLM Calibration Using Pareto Optimal Self-Supervision without Using Labeled Training Data

Flipboard

JULY 3, 2023

Recent developments have seen a remarkable increase in the capability of large language models (LLMs), with generative pretrained transformer (GPT) models showing significant promise. Additionally, generative models are frequently used in a variety of sectors to generate data for different applications.

AI

AI AI ML ML

6 Examples of Doman-Specific Large Language Models

ODSC - Open Data Science

SEPTEMBER 6, 2023

Most people who have experience working with large language models such as Google’s Bard or OpenAI’s ChatGPT have worked with an LLM that is general, and not industry-specific. But as time has gone on, many industries have realized the power of these models. This is where BioBERT comes in.

Data Science

Data Science Supervised Learning Python AI

Google Research, 2022 & beyond: Robotics

Google Research AI blog

FEBRUARY 14, 2023

Posted by Kendra Byrne, Senior Product Manager, and Jie Tan, Staff Research Scientist, Robotics at Google (This is Part 6 in our series of posts covering different topical areas of research at Google. When applied to robotics, LLMs let people task robots more easily — just by asking — with natural language.

Algorithm

Algorithm System Architecture Deep Learning Deep Learning

This AI newsletter is all you need #83

Towards AI

JANUARY 24, 2024

As Zuckerberg explains, Meta’s new, broader focus on AGI was influenced by the release of Llama 2, its latest large language model, last year. In his vision, he also highlights the need to lean towards open-source and more transparent models for as long as it makes sense and is the safe and responsible thing to do.

AI

AI AI Algorithm Analytics

A comprehensive guide to learning LLMs (Foundational Models)

Mlearning.ai

JUNE 14, 2023

Learning Large Language Models The LLM (Foundational Models) space has seen tremendous and rapid growth. LLM Learning MindMap: Lucidspark Learning Large Language Models Here is a print friendly view of all the resources. YouTube BERT Research — Ep. With so much information out there.

Natural Language Processing

Natural Language Processing ML ML Support Vector Machines

Google Introduces Imagen: A Text-to-Image Diffusion Model With a Focus on Photorealism

ODSC - Open Data Science

JULY 7, 2023

Google has unveiled Imagen , a groundbreaking text-to-image diffusion model that pushes the boundaries of photorealism while demonstrating an advanced level of language understanding. According to the post by Google Research, Imagen combines the power of large transformer language models with the capabilities of diffusion models.

Data Science

Data Science AI AI Artificial Intelligence

This AI newsletter is all you need #61

Towards AI

AUGUST 22, 2023

What happened this week in AI by Louie In recent months we have continued to see large language model (LLM) advancements and a gradual introduction of novel techniques but we haven’t yet seen competition directly aiming to displace GPT-4 as the most advanced (and training compute-intensive) model.

AI

AI AI Azure ML

ODSC’s AI Weekly Recap: Week of January 12th

ODSC - Open Data Science

JANUARY 12, 2024

The New York Times reporting on how AI-power robots and Chatbots are poised to transform 2024 New research has found a way to utilize the power of artificial intelligence to bring efficiency to the crowdsourcing process by targeting ideas. represents a significant advancement in Multimodal Large Language Models (MLLMs).

AI

AI AI Artificial Intelligence Artificial Intelligence

Efficiently fine-tune the ESM-2 protein language model with Amazon SageMaker

AWS Machine Learning Blog

MARCH 6, 2024

In this post, we demonstrate how to efficiently fine-tune a state-of-the-art protein language model (pLM) to predict protein subcellular localization using Amazon SageMaker. Because of this, many life science researchers need to answer questions about proteins faster, cheaper, and more accurately. COVID-19 Spikevax Moderna $21.8

AWS

AWS Machine Learning Machine Learning ML

Our Approach to Alignment Research

OpenAI

AUGUST 24, 2022

Introduction Our alignment research aims to make artificial general intelligence (AGI) aligned with human values and follow human intent. We believe that even without fundamentally new alignment ideas, we can likely build sufficiently aligned AI systems to substantially advance alignment research itself.

AI

AI AI

Emily Webber of AWS on Pretraining Large Language Models

ODSC - Open Data Science

AUGUST 4, 2023

As newer fields emerge within data science and the research is still hard to grasp, sometimes it’s best to talk to the experts and pioneers of the field. She’s the author of “Pretrain Vision and Large Language Models in Python: End-to-end techniques for building and deploying foundation models on AWS.”

AWS

AWS Machine Learning Machine Learning Data Science

Unlocking the Potential: The Fascinating World of Language Model Optimization with ChatGPT

Pickl AI

SEPTEMBER 21, 2023

Introduction and Inventor of ChatGPT In recent years, we’ve witnessed an unprecedented surge in the capabilities of Artificial Intelligence , and at the forefront of this revolution are language models. The rapid advancement of language models has revolutionized the way we interact with technology.

Natural Language Processing

Natural Language Processing AI AI Algorithm

Trends in AI?—?2023 Round-up

Towards AI

JANUARY 25, 2023

Trends in AI — 2023 Round-up What’s next for Language Models, Reinforcement Learning, Computer Vision, and leading AI companies like OpenAI and Google? Language Models research is far from over. Community Twitter has long been the biggest online space where AI research people share and discuss their work publicly.

AI

AI AI Deep Learning Deep Learning

4 new papers show foundation models can build on themselves

Snorkel AI

AUGUST 31, 2023

While the surest way to improve the performance of foundation models (FMs) is through more and better data, Snorkel researchers have explored how FMs can learn from themselves. Foundation models contain a great deal of additional information that can be relied on for further benefit. Let’s dive in.

Data Scientist

Data Scientist Artificial Intelligence Artificial Intelligence Supervised Learning

Unlocking Document Intelligence: E2E Azure-Powered Chatbot with Vector-Based Search (Part 2 — Q&A)

Towards AI

FEBRUARY 28, 2024

Let’s continue our exploration by delving into the exciting realm of querying this vector store using the natural language questions that drive the heart of our document processing pipeline. Now, we’ll explore how to query this vector store using natural language questions. From research to projects and ideas. Architecture1.2

Azure

Azure AI AI Python

Debugging Object Detection Models, 8 Trending LLMs, New AI Tools, and Generative AI as a Must-Have…

ODSC - Open Data Science

JUNE 29, 2023

Debugging Object Detection Models, 8 Trending LLMs, New AI Tools, and Generative AI as a Must-Have Skill Debug Object Detection Models with the Responsible AI Dashboard This blog will focus on the Azure Machine Learning Responsible AI Dashboard’s new vision insights capabilities, supporting debugging capabilities for object detection models.

AI

AI AI Predictive Analytics Azure

Who is Harry Potter? Inside Microsoft Research’s Fine-Tuning Method for Unlearning Concepts in LLMs

Towards AI

OCTOBER 16, 2023

The goal is to keep you up to date with machine learning projects, research papers, and concepts. The datasets used in the pretraining of LLMs often including copyrighted material, triggering both legal and ethical concerns for developers, users, and original content creators.

Machine Learning

Machine Learning Machine Learning Artificial Intelligence Artificial Intelligence

Generative AI that’s tailored for your business needs with watsonx.ai

IBM Journey to AI blog

SEPTEMBER 28, 2023

An AI and data platform, such as watsonx, can help empower businesses to leverage foundation models and accelerate the pace of generative AI adoption across their organization. Business-targeted, IBM-developed foundation models built from sound data Business leaders charged with adopting generative AI need model flexibility and choice.

AI

AI AI Algorithm Artificial Intelligence

New AI classifier for indicating AI-written text

Hacker News

JANUARY 31, 2023

It performs significantly worse in other languages and it is unreliable on code. Training the classifier Our classifier is a language model fine-tuned on a dataset of pairs of human-written text and AI-written text on the same topic. We recommend using the classifier only for English text.

AI

AI AI

Evolving Trends in Prompt Engineering for Large Language Models (LLMs) with Built-in Responsible AI…

ODSC - Open Data Science

AUGUST 24, 2023

Evolving Trends in Prompt Engineering for Large Language Models (LLMs) with Built-in Responsible AI Practices Editor’s note: Jayachandran Ramachandran and Rohit Sroch are speakers for ODSC APAC this August 22–23. Auto Eval Common Metric Eval Human Eval Custom Model Eval 3.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence Natural Language Processing AI

Demystifying AI for everyone: Part 1 -NLP Basics

Towards AI

JULY 19, 2023

We speak with each other using various languages Ex: English, German, French, Hindi, etc… Photo by Alexandra on Unsplash Natural Language Processing (NLP) is just one part of Artificial Intelligence (AI) that helps Computers understand and process human language. From research to projects and ideas.

Natural Language Processing

Natural Language Processing AI AI Artificial Intelligence

4 new papers show foundation models can build on themselves

Snorkel AI

AUGUST 31, 2023

While the surest way to improve the performance of foundation models (FMs) is through more and better data, Snorkel researchers have explored how FMs can learn from themselves. Foundation models contain a great deal of additional information that can be relied on for further benefit. Let’s dive in.

Data Scientist

Data Scientist Artificial Intelligence Artificial Intelligence Supervised Learning

How ChatGPT actually works

AssemblyAI

DECEMBER 23, 2022

ChatGPT is the latest language model from OpenAI and represents a significant improvement over its predecessor GPT-3. Similarly to many Large Language Models, ChatGPT is capable of generating text in a wide range of styles and for different purposes, but with remarkably greater precision, detail, and coherence.

Supervised Learning

Supervised Learning Algorithm Deep Learning Deep Learning

5 Papers You Can’t Miss: Large Language Models

Mlearning.ai

MAY 13, 2023

Explore the 5 most Impactful Large Language Model Papers of 2023 Image by Author with @MidJoruney Language models have revolutionized the field of natural language processing (NLP), allowing for unprecedented advances in applications such as chatbots, virtual assistants, and text generation. Interested in more?

Natural Language Processing

Natural Language Processing AI AI ML

New research expands limitations of weak supervision, foundation models

Snorkel AI

MARCH 24, 2023

Snorkel AI researchers continue to push the frontier of machine learning, as demonstrated by the 18 research papers recently added to our website. This batch of research papers, all published in 2022, present new developments in weak supervision and foundation models. Dataset Debt in Biomedical Language Modeling J.

Natural Language Processing

Natural Language Processing Machine Learning Machine Learning Supervised Learning

New research expands limitations of weak supervision, foundation models

Snorkel AI

MARCH 24, 2023

Snorkel AI researchers continue to push the frontier of machine learning, as demonstrated by the 18 research papers recently added to our website. This batch of research papers, all published in 2022, present new developments in weak supervision and foundation models. Dataset Debt in Biomedical Language Modeling J.

Natural Language Processing

Natural Language Processing Machine Learning Machine Learning Supervised Learning

Consistency LLM: converting LLMs to parallel decoders accelerates inference 3.5x

Community Spotlight: Dr. Helen Yannakoudakis

Webinars

Trending Sources

Large language models: A beginner’s guide to 2023’s top technology

Webinars

GPT 3.5 and GPT 4 comparative analysis

Top Important LLM Papers for the Week from 18/03 to 24/03

How Phenomenal Is Sophia from the Stanford Team for Training LLMs?

CodeTF: One-Stop Transformer Library for State-of-the-Art Code LLM

Reinforcement Learning from Human Feedback (RLHF)

Revolutionize LLM with Llama 2 fine-tuning

NLP News Cypher | 07.26.20

Researchers Introduce Proxy-Tuning: An Efficient Alternative to Finetuning Large Language Models

AI Technology NYUTron Accurately Predicts Health Outcomes

SQuARE: Towards Multi-Domain and Few-Shot Collaborating Question Answering Agents

The NLP Cypher | 03.07.21

Inside RAFT: UC Berkeley’s Method to Improve RAG for Domain Specific Scenarios

What Is a Transformer Model?

Meet BLIP-2: Salesforce New Open Source Visual-Language Model that is Faster the Simpler than GPT-4

Inside XGen-Image-1: How Salesforce Research Built, Trained, and Evaluated a Massive Text-to-Image Model

Microsoft Researchers Propose a Novel Framework for LLM Calibration Using Pareto Optimal Self-Supervision without Using Labeled Training Data

6 Examples of Doman-Specific Large Language Models

Google Research, 2022 & beyond: Robotics

This AI newsletter is all you need #83

A comprehensive guide to learning LLMs (Foundational Models)

Google Introduces Imagen: A Text-to-Image Diffusion Model With a Focus on Photorealism

This AI newsletter is all you need #61

ODSC’s AI Weekly Recap: Week of January 12th

Efficiently fine-tune the ESM-2 protein language model with Amazon SageMaker

Our Approach to Alignment Research

Emily Webber of AWS on Pretraining Large Language Models

Unlocking the Potential: The Fascinating World of Language Model Optimization with ChatGPT

Trends in AI?—?2023 Round-up

4 new papers show foundation models can build on themselves

Unlocking Document Intelligence: E2E Azure-Powered Chatbot with Vector-Based Search (Part 2 — Q&A)

Debugging Object Detection Models, 8 Trending LLMs, New AI Tools, and Generative AI as a Must-Have…

Who is Harry Potter? Inside Microsoft Research’s Fine-Tuning Method for Unlearning Concepts in LLMs

Generative AI that’s tailored for your business needs with watsonx.ai

New AI classifier for indicating AI-written text

Evolving Trends in Prompt Engineering for Large Language Models (LLMs) with Built-in Responsible AI…

Demystifying AI for everyone: Part 1 -NLP Basics

4 new papers show foundation models can build on themselves

How ChatGPT actually works

5 Papers You Can’t Miss: Large Language Models

New research expands limitations of weak supervision, foundation models

New research expands limitations of weak supervision, foundation models

Stay Connected