Data Science Current

research latent-language-diffusion-model

Understanding Sora: An OpenAI model for video generation

Data Science Dojo

FEBRUARY 16, 2024

It is a new generative AI Text-to-Video model that can create minute-long videos from a textual prompt. Moreover, the model can express emotions in its visual characters. While it is a Text-to-Video generative model, OpenAI highlights that Sora can work with a diverse range of prompts, including existing images and videos.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence AI AI

AI Painting: Release of the Stable Diffusion 3 Model

Towards AI

MARCH 6, 2024

The recent publication of the Stable Diffusion 3 paper has brought exciting news! Upon evaluation, Stable Diffusion 3 has surpassed other leading systems in text-to-image generation, including DALL·E 3, Midjourney v6, and Ideogram v1. To achieve this, we’ve utilized some pre-trained models to assist AI in “translating”.

AI AI Machine Learning Machine Learning

Join 20,000+

professionals

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

MORE WEBINARS

Trending Sources

Diffusion Models vs. GANs vs. VAEs: Comparison of Deep Generative Models

Towards AI

MAY 11, 2023

Diffusion Models vs. GANs vs. VAEs: Comparison of Deep Generative Models Deep generative models are applied to diverse domains such as image, audio, video synthesis, and natural language processing. Overview of different types of generative models. Figure created by the author. Training by adversarial loss.

Natural Language Processing

Natural Language Processing Deep Learning Deep Learning AI

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

MORE WEBINARS

Sora AI: Unraveling Sora’s Architecture and Working Intuitively!

Towards AI

APRIL 1, 2024

A new text-to-video generative AI model has gained lots of interest and focus during the past few days. Although the model and its implementation have not been released to the public yet, don’t worry, my fellow enthusiasts! Before we dive into the details, let’s talk about existing research. Here comes a new AI again.

AI AI Natural Language Processing Artificial Intelligence

How DALL-E 2 Actually Works

AssemblyAI

SEPTEMBER 29, 2023

OpenAI's groundbreaking model DALL-E 2 hit the scene at the beginning of the month, setting a new bar for image generation and manipulation. DALL-E 2's impressive results have many wondering exactly how such a powerful model works under the hood. DALL-E 3 OpenAI has recently announced DALL-E 3, the successor to DALL-E 2.

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

Generating Images from Audio with Machine Learning

Heartbeat

JANUARY 13, 2024

Quick Summary In this article, I’ll show you how to create amazing images from audio using the magic of Machine Learning and the Transformers models. I’ll explain each step clearly, uncover the secrets behind Whisper, and highlight the incredible abilities of Hugging Face models. Imagine it as a jack of all NLP trades!

Machine Learning

Machine Learning Machine Learning Python Deep Learning

Inside XGen-Image-1: How Salesforce Research Built, Trained, and Evaluated a Massive Text-to-Image Model

Towards AI

AUGUST 14, 2023

One of the most efficient training processes for text-to-image models ever implemented. Image Credit: Salesforce Research I recently started an AI-focused educational newsletter, that already has over 160,000 subscribers. The goal is to keep you up to date with machine learning projects, research papers, and concepts.

Machine Learning

Machine Learning Machine Learning Artificial Intelligence Artificial Intelligence

An Exhaustive List of Open-source Generative AI Models in 2023

Heartbeat

AUGUST 10, 2023

Photo by Milad Fakurian on Unsplash Introduction With advanced models like Generative Pre-trained Transformer 3 (GPT-3), which provides human-like responses to user queries, AI is progressing toward generative tools to create realistic content, including text, videos, images, and audio.

AI AI ML ML

Google at ICLR 2023

Google Research AI blog

APRIL 30, 2023

We are proud to be a Diamond Sponsor of ICLR 2023, a premier conference on deep learning, where Google researchers contribute at all levels. Continue below to find the many ways in which Google researchers are engaged at ICLR 2023, including workshops, papers, posters and talks (Google affiliations in bold ).

Supervised Learning

Supervised Learning Machine Learning Machine Learning Deep Learning

5000x Generative AI: Intro, Overview, Models, Prompts, Technology, Tools, Comparisons & the Best…

Mlearning.ai

JANUARY 17, 2024

95x: Generative AI history 600+: Key Technological Concepts 2,350+: Models & Mediums — Text, Image, Video, Sound, Code, etc. Classic AI models are usually focused on a single task. Generative AI models usually have millions of neurons and billions of synapses (aka „ parameters “). Image credit: Yang, Jingfeng et.

AI AI Deep Learning Deep Learning

Inside OpenAI Sora: Five Key Technical Details We Learned About the Amazing Video Generation Model

Towards AI

FEBRUARY 20, 2024

The goal is to keep you up to date with machine learning projects, research papers, and concepts. Inside Sora Breaking Away from Tradition In text-to-video models, researchers traditionally have explored various techniques, including recurrent networks, generative adversarial networks, autoregressive transformers, and diffusion models.

Machine Learning

Machine Learning Machine Learning AI AI

Upscale images with Stable Diffusion in Amazon SageMaker JumpStart

AWS Machine Learning Blog

JANUARY 25, 2023

In November 2022, we announced that AWS customers can generate images from text with Stable Diffusion models in Amazon SageMaker JumpStart. Today, we announce a new feature that lets you upscale images (resize images without losing quality) with Stable Diffusion models in JumpStart.

Deep Learning

Deep Learning Deep Learning Python Natural Language Processing

What AI Music Generators Can Do (And How They Do It)

AssemblyAI

SEPTEMBER 22, 2023

Last year’s emergence of user-friendly interfaces for models like DALL-E 2 or Stable Diffusion for images and ChatGPT for text generation was key to boost the world’s attention to generative AI. Last week – StabilityAI launched StableAudio , a subscription-based platform for creating music with AI models.

AI AI Machine Learning Machine Learning

Can’t-Miss Sessions Announced for the Free Generative AI Summit on July 20

ODSC - Open Data Science

JULY 17, 2023

Recent Advances in Diffusion Generative Models Stefano Ermon PhD | Asst Professor @ Stanford University This session will present an alternative base for generative models: the vector field of gradients of the data distribution. This hallucination problem can cause the models to produce inaccurate information.

AI AI AWS Data Science

ODSC’s AI Weekly Recap: Week of March 1st

ODSC - Open Data Science

MARCH 1, 2024

Source ) Texas A&M has joined the Artificial Intelligence Safety Institute Consortium (AISIC), focusing on AI safety and reliability ( Source ) The Swiss National Science Foundation (SNSF) has set a stance on its position concerning the deployment of artificial intelligence technologies by researchers seeking its funding.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence AI AI

Google at CVPR 2023

Google Research AI blog

JUNE 18, 2023

As a leader in computer vision research and a Platinum Sponsor , Google Research will have a strong presence across CVPR 2023 with 90 papers being presented at the main conference and active involvement in over 40 conference workshops and tutorials.

Supervised Learning

Supervised Learning Deep Learning Deep Learning Artificial Intelligence

Fine-tune text-to-image Stable Diffusion models with Amazon SageMaker JumpStart

AWS Machine Learning Blog

FEBRUARY 20, 2023

In November 2022, we announced that AWS customers can generate images from text with Stable Diffusion models in Amazon SageMaker JumpStart. Stable Diffusion is a deep learning model that allows you to generate realistic, high-quality images and stunning art in just a few seconds.

Algorithm

Algorithm Python Machine Learning Machine Learning

Inpaint images with Stable Diffusion using Amazon SageMaker JumpStart

AWS Machine Learning Blog

APRIL 10, 2023

In November 2022, we announced that AWS customers can generate images from text with Stable Diffusion models using Amazon SageMaker JumpStart. Today, we are excited to introduce a new feature that enables users to inpaint images with Stable Diffusion models.

Algorithm

Algorithm Deep Learning Deep Learning Python

Google Research, 2022 & Beyond: Language, Vision and Generative Models

Google Research AI blog

JANUARY 18, 2023

Posted by Jeff Dean, Senior Fellow and SVP of Google Research, on behalf of the Google Research community Today we kick off a series of blog posts about exciting new developments from Google Research. Please keep your eye on this space and look for the title “Google Research, 2022 & Beyond” for more articles in the series.

ML ML AI AI

Recent developments in Generative AI for Audio

AssemblyAI

JUNE 27, 2023

With various foundational ideas from large language models and text-to-image generation being adapted and incorporated into the audio modality , the latest AI-powered audio-generative systems are reaching a new unprecedented level of quality. But how do these new audio-generative models work?

AI AI Deep Learning Deep Learning

Top Ten Game-Changing Generative AI Projects in 2023

ODSC - Open Data Science

FEBRUARY 2, 2023

It’s more than just a simple Q&A model like a chatbot, as it can admit when it’s wrong or doesn’t have enough data, write out responses that are clear and comprehensive, write code, and even write out responses at length, making research a breeze for people. From recipes to writing blogs, it’s amazing how well it performs.

AI AI Data Science Artificial Intelligence

Google at ICML 2023

Google Research AI blog

JULY 23, 2023

Posted by Cat Armato, Program Manager, Google Groups across Google actively pursue research in the field of machine learning (ML), ranging from theory and application. We build ML systems to solve deep scientific and engineering challenges in areas of language, music, visual processing, algorithm development, and more.

Machine Learning

Machine Learning Machine Learning ML ML

Generative AI Space and the Mental Imagery of Alien Minds

Hacker News

JULY 17, 2023

Let’s say we use a typical generative AI to go from a description in human language—like “a cat in a party hat”—to a generated image: It’s exactly the kind of image we’d expect—which isn’t surprising, because it comes from a generative AI that’s trained to “do as we would”. but aren’t of things we humans have come up with words for.

AI AI Artificial Intelligence Artificial Intelligence

Google at NeurIPS 2022

Google Research AI blog

NOVEMBER 28, 2022

NeurIPS 2022 will be held as a hybrid event, in person in New Orleans, LA with some virtual attendance options, and includes invited talks, demonstrations and presentations of some of the latest in machine learning research. You can learn more about our work being presented in the list below (Google affiliations highlighted in bold ).

Machine Learning

Machine Learning Machine Learning Clustering Algorithm

10 Can’t-Miss Sessions on Language Models Coming to ODSC West 2023

ODSC - Open Data Science

OCTOBER 4, 2023

Evaluation Techniques for Large Language Models Rajiv Shah, PhD | Machine Learning Engineer | Hugging Face Selecting the right LLM for your needs has become increasingly complex. Towards Explainable and Language-Agnostic LLMs Walid S.

Supervised Learning

Supervised Learning Machine Learning Machine Learning Data Science

How to use Stable Audio, music generator of Stability AI

Dataconomy

SEPTEMBER 19, 2023

Last year, the company introduced Dance Diffusion, an AI solution designed to create songs and sound effects based on user-provided prompts. Despite its ingenuity, Dance Diffusion was left in its prototype phase as the R&D team pivoted to focus on their newly minted music generator.

AI AI Artificial Intelligence Artificial Intelligence

Understanding Sora: An OpenAI model for video generation

AI Painting: Release of the Stable Diffusion 3 Model

Webinars

Trending Sources

Diffusion Models vs. GANs vs. VAEs: Comparison of Deep Generative Models

Webinars

Sora AI: Unraveling Sora’s Architecture and Working Intuitively!

How DALL-E 2 Actually Works

Generating Images from Audio with Machine Learning

Inside XGen-Image-1: How Salesforce Research Built, Trained, and Evaluated a Massive Text-to-Image Model

An Exhaustive List of Open-source Generative AI Models in 2023

Google at ICLR 2023

5000x Generative AI: Intro, Overview, Models, Prompts, Technology, Tools, Comparisons & the Best…

Inside OpenAI Sora: Five Key Technical Details We Learned About the Amazing Video Generation Model

Upscale images with Stable Diffusion in Amazon SageMaker JumpStart

What AI Music Generators Can Do (And How They Do It)

Can’t-Miss Sessions Announced for the Free Generative AI Summit on July 20

ODSC’s AI Weekly Recap: Week of March 1st

Google at CVPR 2023

Fine-tune text-to-image Stable Diffusion models with Amazon SageMaker JumpStart

Inpaint images with Stable Diffusion using Amazon SageMaker JumpStart

Google Research, 2022 & Beyond: Language, Vision and Generative Models

Recent developments in Generative AI for Audio

Top Ten Game-Changing Generative AI Projects in 2023

Google at ICML 2023

Generative AI Space and the Mental Imagery of Alien Minds

Google at NeurIPS 2022

10 Can’t-Miss Sessions on Language Models Coming to ODSC West 2023

How to use Stable Audio, music generator of Stability AI

Stay Connected