AWS, Books and Deep Learning - Data Science Current

Achieve multi-Region resiliency for your conversational AI chatbots with Amazon Lex

AWS Machine Learning Blog

OCTOBER 30, 2024

Global Resiliency is a new Amazon Lex capability that enables near real-time replication of your Amazon Lex V2 bots in a second AWS Region. Additionally, we discuss how to handle integrations with AWS Lambda and Amazon CloudWatch after enabling Global Resiliency. We walk through the instructions to replicate the bot later in this post.

AWS

AWS AI AI Natural Language Processing

Build a Search Engine: Setting Up AWS OpenSearch

Flipboard

MAY 5, 2025

Home Table of Contents Build a Search Engine: Setting Up AWS OpenSearch Introduction What Is AWS OpenSearch? What AWS OpenSearch Is Commonly Used For Key Features of AWS OpenSearch How Does AWS OpenSearch Work? Why Use AWS OpenSearch for Semantic Search? Looking for the source code to this post?

AWS

AWS Clustering Deep Learning Deep Learning

Manage your Amazon Lex bot via AWS CloudFormation templates

AWS Machine Learning Blog

APRIL 16, 2024

It employs advanced deep learning technologies to understand user input, enabling developers to create chatbots, virtual assistants, and other applications that can interact with users in natural language. Version control – With AWS CloudFormation, you can use version control systems like Git to manage your CloudFormation templates.

AWS

AWS Deep Learning Deep Learning Artificial Intelligence

Webinars

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

MORE WEBINARS

Evaluate models or RAG systems using Amazon Bedrock Evaluations – Now generally available

AWS Machine Learning Blog

APRIL 4, 2025

When we launched LLM-as-a-judge (LLMaJ) and Retrieval Augmented Generation (RAG) evaluation capabilities in public preview at AWS re:Invent 2024 , customers used them to assess their foundation models (FMs) and generative AI applications, but asked for more flexibility beyond Amazon Bedrock models and knowledge bases. Original price: $153.15

AWS

AWS AI AI ML

Improve the performance of your Generative AI applications with Prompt Optimization on Amazon Bedrock

AWS Machine Learning Blog

NOVEMBER 29, 2024

About the Authors Shreyas Subramanian is a Principal Data Scientist and helps customers by using generative AI and deep learning to solve their business challenges using AWS services. Shreyas has a background in large-scale optimization and ML and in the use of ML and reinforcement learning for accelerating optimization tasks.

AI

AI AI ML ML

Your guide to generative AI and ML at AWS re:Invent 2023

AWS Machine Learning Blog

NOVEMBER 22, 2023

Yes, the AWS re:Invent season is upon us and as always, the place to be is Las Vegas! You marked your calendars, you booked your hotel, and you even purchased the airfare. Now all you need is some guidance on generative AI and machine learning (ML) sessions to attend at this twelfth edition of re:Invent.

AWS

AWS ML ML AI

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

AWS Machine Learning Blog

APRIL 29, 2024

For AWS and Outerbounds customers, the goal is to build a differentiated machine learning and artificial intelligence (ML/AI) system and reliably improve it over time. First, the AWS Trainium accelerator provides a high-performance, cost-effective, and readily available solution for training and fine-tuning large models.

AWS

AWS ML ML Python

Build a Search Engine: Semantic Search System Using OpenSearch

PyImageSearch

MAY 19, 2025

run_opensearch.sh Running OpenSearch Locally A script to start OpenSearch using Docker for local testing before deploying to AWS. Register the Sentence Transformer model in AWS OpenSearch: AWS users must ensure that OpenSearch can access the model before indexing. These can be used for evaluation and comparison.

K-nearest Neighbors

K-nearest Neighbors AWS Deep Learning Deep Learning

Protect sensitive data in RAG applications with Amazon Bedrock

Flipboard

APRIL 23, 2025

To assist in this effort, AWS provides a range of generative AI security strategies that you can use to create appropriate threat models. For all data stored in Amazon Bedrock, the AWS shared responsibility model applies.

AWS

AWS ML ML AI

Build a Search Engine: Deploy Models and Index Data in AWS OpenSearch

PyImageSearch

MAY 12, 2025

Home Table of Contents Build a Search Engine: Deploy Models and Index Data in AWS OpenSearch Introduction What Will We Do in This Blog? However, we will also provide AWS OpenSearch instructions so you can apply the same setup in the cloud. This is useful for running OpenSearch locally for testing before deploying it on AWS.

AWS

AWS K-nearest Neighbors Deep Learning Deep Learning

Implement human-in-the-loop confirmation with Amazon Bedrock Agents

AWS Machine Learning Blog

APRIL 9, 2025

You can recreate the example manually or using the AWS Cloud Development Kit (AWS CDK) by following our GitHub repository. However, booking, updating, or canceling a PTO request requires changes on a database and are actions that should be confirmed before execution. With over 10 years of experience in AI/ML.

AWS

AWS ML ML AI

Unlock cost savings with the new scale down to zero feature in SageMaker Inference

Flipboard

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce a new feature for Amazon SageMaker inference endpoints: the ability to scale SageMaker inference endpoints to zero instances. This long-awaited capability is a game changer for our customers using the power of AI and machine learning (ML) inference in the cloud.

ML

ML ML AWS Machine Learning

Optimize price-performance of LLM inference on NVIDIA GPUs using the Amazon SageMaker integration with NVIDIA NIM Microservices

AWS Machine Learning Blog

MARCH 18, 2024

NIM is available as a paid offering as part of the NVIDIA AI Enterprise software subscription available on AWS Marketplace. He works with Amazon.com to design, build, and deploy technology solutions on AWS, and has a particular interest in AI and machine learning. Qing Lan is a Software Development Engineer in AWS.

Deep Learning

Deep Learning Deep Learning AWS Machine Learning

Welcome to a New Era of Building in the Cloud with Generative AI on AWS

AWS Machine Learning Blog

NOVEMBER 30, 2023

The number of companies launching generative AI applications on AWS is substantial and building quickly, including adidas, Booking.com, Bridgewater Associates, Clariant, Cox Automotive, GoDaddy, and LexisNexis Legal & Professional, to name just a few. Innovative startups like Perplexity AI are going all in on AWS for generative AI.

AWS

AWS AI AI ML

Maximize Stable Diffusion performance and lower inference costs with AWS Inferentia2

AWS Machine Learning Blog

JULY 26, 2023

In this post, we show how you can run Stable Diffusion models and achieve high performance at the lowest cost in Amazon Elastic Compute Cloud (Amazon EC2) using Amazon EC2 Inf2 instances powered by AWS Inferentia2. versions on AWS Inferentia2 cost-effectively. You can run both Stable Diffusion 2.1 The Stable Diffusion 2.1

AWS

AWS Deep Learning Deep Learning ML

Optimize AWS Inferentia utilization with FastAPI and PyTorch models on Amazon EC2 Inf1 & Inf2 instances

AWS Machine Learning Blog

JULY 24, 2023

When deploying Deep Learning models at scale, it is crucial to effectively utilize the underlying hardware to maximize performance and cost benefits. In this post we walk you through the process of deploying FastAPI model servers on AWS Inferentia devices (found on Amazon EC2 Inf1 and Amazon EC Inf2 instances).

AWS

AWS Deep Learning Deep Learning Python

How Amazon Search M5 saved 30% for LLM training cost by using AWS Trainium

AWS Machine Learning Blog

NOVEMBER 22, 2023

From the earliest days, Amazon has used ML for various use cases such as book recommendations, search, and fraud detection. Similar to the rest of the industry, the advancements of accelerated hardware have allowed Amazon teams to pursue model architectures using neural networks and deep learning (DL).

AWS

AWS ML ML Deep Learning

Unlocking generative AI for enterprises: How SnapLogic powers their low-code Agent Creator using Amazon Bedrock

AWS Machine Learning Blog

OCTOBER 23, 2024

SnapLogic uses Amazon Bedrock to build its platform, capitalizing on the proximity to data already stored in Amazon Web Services (AWS). To address customers’ requirements about data privacy and sovereignty, SnapLogic deploys the data plane within the customer’s VPC on AWS.

AI

AI AI Database AWS

Scaling Large Language Model (LLM) training with Amazon EC2 Trn1 UltraClusters

Flipboard

FEBRUARY 16, 2023

In October 2022, we launched Amazon EC2 Trn1 Instances , powered by AWS Trainium , which is the second generation machine learning accelerator designed by AWS. Trn1 instances are purpose built for high-performance deep learning model training while offering up to 50% cost-to-train savings over comparable GPU-based instances.

Clustering

Clustering AWS Deep Learning Deep Learning

Boost employee productivity with automated meeting summaries using Amazon Transcribe, Amazon SageMaker, and LLMs from Hugging Face

AWS Machine Learning Blog

MAY 7, 2024

Solution overview The entire infrastructure of the solution is provisioned using the AWS Cloud Development Kit (AWS CDK), which is an infrastructure as code (IaC) framework to programmatically define and deploy AWS resources. AWS CDK version 2.0 AWS CDK version 2.0

AWS

AWS ML ML Machine Learning

Use zero-shot large language models on Amazon Bedrock for custom named entity recognition

AWS Machine Learning Blog

JUNE 18, 2024

Some examples include extracting players and positions in an NFL game summary, products mentioned in an AWS keynote transcript, or key names from an article on a favorite tech company. We extract the default generic entities through the AWS SDK for Python (Boto3) as follows: import pandas as pd comprehend_client = boto3.client("comprehend")

AWS

AWS Natural Language Processing Machine Learning Machine Learning

Educating a New Generation of Workers

O'Reilly Media

NOVEMBER 26, 2024

Too many students think that engineering is about getting the answer in the back of the book, not about making the trade-offs that are necessary in the real world. For example in Topic 1, the skills “AWS” and “cloud” map to the job titles cloud engineer, AWS solutions architect, and technology consultant.

Cloud Computing

Cloud Computing AWS Azure Machine Learning

5 most useful AI translation tools – Diversify your business

Data Science Dojo

OCTOBER 17, 2023

NMT is a deep learning approach to translation that uses neural networks to learn the patterns of human language and generate translations. Offers seamless integration with other AWS services. Imagine LLMs as really smart translators who have been trained on a mountain of books and articles.

AI

AI AI Deep Learning Deep Learning

Build a multilingual automatic translation pipeline with Amazon Translate Active Custom Translation

AWS Machine Learning Blog

JUNE 15, 2023

Dive into Deep Learning ( D2L.ai ) is an open-source textbook that makes deep learning accessible to everyone. It is a challenging endeavor to have an online book that is continuously kept up to date, written by multiple authors, and available in multiple languages. In this post, we present a solution that D2L.ai

AWS

AWS Natural Language Processing Deep Learning Deep Learning

Create summaries of recordings using generative AI with Amazon Bedrock and Amazon Transcribe

AWS Machine Learning Blog

DECEMBER 13, 2023

It uses advanced deep learning technologies to accurately transcribe audio into text. The solution presented in this post is orchestrated using an AWS Step Functions state machine that is triggered when you upload a recording to the designated Amazon Simple Storage Service (Amazon S3) bucket.

AWS

AWS AI AI ETL

Getting started with Amazon Titan Text Embeddings

AWS Machine Learning Blog

JANUARY 31, 2024

These models are based on deep learning architectures such as Transformers, which can capture the contextual information and relationships between words in a sentence more effectively. You can use it via either the Amazon Bedrock REST API or the AWS SDK. Why do we need an embeddings model? Nitin Eusebius is a Sr.

Natural Language Processing

Natural Language Processing AWS Machine Learning Machine Learning

A review of purpose-built accelerators for financial services

AWS Machine Learning Blog

SEPTEMBER 11, 2024

Examples of other PBAs now available include AWS Inferentia and AWS Trainium , Google TPU, and Graphcore IPU. Together, these elements lead to the start of a period of dramatic progress in ML, with NN being redubbed deep learning. Thirdly, the presence of GPUs enabled the labeled data to be processed.

AWS

AWS ML ML Clustering

Build a computer vision-based asset inventory application with low or no training

Flipboard

APRIL 16, 2025

LLMs are large deep learning models that are pre-trained on vast amounts of data. The solution uses various AWS services to create an end-to-end system that enables field technicians to capture label images, extract data using AI models, verify the accuracy, and seamlessly update the inventory database.

AWS

AWS Database AI AI

How BigBasket improved AI-enabled checkout at their physical stores using Amazon SageMaker

AWS Machine Learning Blog

FEBRUARY 13, 2024

Model training was accelerated by 50% through the use of the SMDDP library, which includes optimized communication algorithms designed specifically for AWS infrastructure. For SageMaker distributed training, the instances need to be in the same AWS Region and Availability Zone. days in AWS vs. 9 days on their legacy platform).

AWS

AWS AI AI ML

CRISPR-Cas9 guide RNA efficiency prediction with efficiently tuned models in Amazon SageMaker

AWS Machine Learning Blog

SEPTEMBER 16, 2024

DNABERT 6 Dataset For this post, we use the gRNA data released by researchers in a paper about gRNA prediction using deep learning. CRISPRon is a CNN based deep learning model. We also provided code that can help you jumpstart your biology applications in AWS. Yudi Zhang is an Applied Scientist at AWS marketing.

Natural Language Processing

Natural Language Processing AWS Deep Learning Deep Learning

Inference AudioCraft MusicGen models using Amazon SageMaker

AWS Machine Learning Blog

AUGUST 6, 2024

Originating from advancements in artificial intelligence (AI) and deep learning, these models are designed to understand and translate descriptive text into coherent, aesthetically pleasing music. Obtain the AWS Deep Learning Containers for Large Model Inference from pre-built HuggingFace Inference Containers.

AWS

AWS Deep Learning Deep Learning AI

Amazon SageMaker model parallel library now accelerates PyTorch FSDP workloads by up to 20%

AWS Machine Learning Blog

DECEMBER 22, 2023

The SMP configuration is as follows: { "hybrid_shard_degree": 16 } To learn more about the advantages of hybrid sharded data parallelism, refer to Near-linear scaling of gigantic-model training on AWS. He leads frameworks, compilers, and optimization techniques for deep learning training.

Clustering

Clustering Deep Learning Deep Learning AWS

Inference Llama 2 models with real-time response streaming using Amazon SageMaker

AWS Machine Learning Blog

JANUARY 9, 2024

When it comes to deploying models on SageMaker endpoints, you can containerize the models using specialized AWS Deep Learning Container (DLC) images available for popular open source libraries. We then identified two approaches for deploying and inferencing Llama 2 Chat models using AWS DLCs—LMI and Hugging Face TGI.

AWS

AWS ML ML Deep Learning

Use RAG for drug discovery with Knowledge Bases for Amazon Bedrock

AWS Machine Learning Blog

FEBRUARY 29, 2024

Knowledge Bases for Amazon Bedrock allows you to build performant and customized Retrieval Augmented Generation (RAG) applications on top of AWS and third-party vector stores using both AWS and third-party models. You can also use the StartIngestionJob API to trigger the sync via the AWS SDK.

AWS

AWS Machine Learning Machine Learning ML

Accelerate client success management through email classification with Hugging Face on Amazon SageMaker

AWS Machine Learning Blog

SEPTEMBER 12, 2023

Solution overview Scalable Capital’s ML infrastructure consists of two AWS accounts: one as an environment for the development stage and the other one for the production stage. To learn more about Hugging Face and SageMaker, refer to the following resources: Use Hugging Face with Amazon SageMaker What are AWS Deep Learning Containers?

Data Science

Data Science Data Scientist AWS ML

Deploy generative AI agents in your contact center for voice and chat using Amazon Connect, Amazon Lex, and Amazon Bedrock Knowledge Bases

AWS Machine Learning Blog

SEPTEMBER 24, 2024

Working with the AWS Generative AI Innovation Center , DoorDash built a solution to provide Dashers with a low-latency self-service voice experience to answer frequently asked questions, reducing the need for live agent assistance, in just 2 months. “We You can deploy the solution in your own AWS account and try the example solution.

AWS

AWS AI AI Analytics

Learn AI Together — Towards AI Community Newsletter #26

Towards AI

MAY 30, 2024

If you’ve enjoyed the list of courses at Gen AI 360, wait for this… Today, I am super excited to finally announce that we at towards_AI have released our first book: Building LLMs for Production. This 470-page book is all about LLMs and how to work with them. Good morning, fellow learners. Get your copy now! Our must-read articles 1.

AI

AI AI Data Pipeline Deep Learning

Generate creative advertising using generative AI deployed on Amazon SageMaker

AWS Machine Learning Blog

AUGUST 9, 2023

To scale the proposed solution for production and streamline the deployment of AI models in the AWS environment, we demonstrate it using SageMaker endpoints. Prerequisites We have developed an AWS CloudFormation template that will create the SageMaker notebooks used to deploy the endpoints and run inference.

AWS

AWS AI AI Machine Learning

How I cleared AWS Machine Learning Specialty with three weeks of preparation (I will burst some…

Mlearning.ai

FEBRUARY 2, 2023

How I cleared AWS Machine Learning Specialty with three weeks of preparation (I will burst some myths of the online exam) How I prepared for the test, my emotional journey during preparation, and my actual exam experience Certified AWS ML Specialty Badge source Introduction:- I recently gave and cleared AWS ML certification on 29th Dec 2022.

Machine Learning

Machine Learning Machine Learning AWS ML

Improve performance of Falcon models with Amazon SageMaker

AWS Machine Learning Blog

OCTOBER 11, 2023

About the Authors Abhi Shivaditya is a Senior Solutions Architect at AWS, working with strategic global enterprise organizations to facilitate the adoption of AWS services in areas such as Artificial Intelligence, distributed computing, networking, and storage. Dhawal Patel is a Principal Machine Learning Architect at AWS.

AWS

AWS Machine Learning Machine Learning ML

Deploy large models at high performance using FasterTransformer on Amazon SageMaker

AWS Machine Learning Blog

APRIL 17, 2023

However, as the size and complexity of the deep learning models that power generative AI continue to grow, deployment can be a challenging task. Then, we highlight how Amazon SageMaker large model inference deep learning containers (LMI DLCs) can help with optimization and deployment.

AWS

AWS Deep Learning Deep Learning Machine Learning

Deploy thousands of model ensembles with Amazon SageMaker multi-model endpoints on GPU to minimize your hosting costs

AWS Machine Learning Blog

AUGUST 8, 2023

Recent scientific breakthroughs in deep learning (DL), large language models (LLMs), and generative AI is allowing customers to use advanced state-of-the-art solutions with almost human-like performance. In this post, we show how to run multiple deep learning ensemble models on a GPU instance with a SageMaker MME.

Deep Learning

Deep Learning Deep Learning AWS ML

How Clearwater Analytics is revolutionizing investment management with generative AI and Amazon SageMaker JumpStart

Flipboard

DECEMBER 13, 2024

To remain competitive, capital markets firms are adopting Amazon Web Services (AWS) Cloud services across the trade lifecycle to rearchitect their infrastructure, remove capacity constraints, accelerate innovation, and optimize costs. trillion in assets across thousands of accounts worldwide.

Analytics

Analytics Analytics AI AI

Build production-ready generative AI applications for enterprise search using Haystack pipelines and Amazon SageMaker JumpStart with LLMs

AWS Machine Learning Blog

AUGUST 14, 2023

SageMaker JumpStart SageMaker JumpStart serves as a model hub encapsulating a broad array of deep learning models for text, vision, audio, and embedding use cases. With over 500 models, its model hub comprises both public and proprietary models from AWS’s partners such as AI21, Stability AI, Cohere, and LightOn.

AWS

AWS Database AI AI

Achieve multi-Region resiliency for your conversational AI chatbots with Amazon Lex

Build a Search Engine: Setting Up AWS OpenSearch

Webinars

Trending Sources

Manage your Amazon Lex bot via AWS CloudFormation templates

Webinars

Evaluate models or RAG systems using Amazon Bedrock Evaluations – Now generally available

Improve the performance of your Generative AI applications with Prompt Optimization on Amazon Bedrock

Your guide to generative AI and ML at AWS re:Invent 2023

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

Build a Search Engine: Semantic Search System Using OpenSearch

Protect sensitive data in RAG applications with Amazon Bedrock

Build a Search Engine: Deploy Models and Index Data in AWS OpenSearch

Implement human-in-the-loop confirmation with Amazon Bedrock Agents

Unlock cost savings with the new scale down to zero feature in SageMaker Inference

Optimize price-performance of LLM inference on NVIDIA GPUs using the Amazon SageMaker integration with NVIDIA NIM Microservices

Welcome to a New Era of Building in the Cloud with Generative AI on AWS

Maximize Stable Diffusion performance and lower inference costs with AWS Inferentia2

Optimize AWS Inferentia utilization with FastAPI and PyTorch models on Amazon EC2 Inf1 & Inf2 instances

How Amazon Search M5 saved 30% for LLM training cost by using AWS Trainium

Unlocking generative AI for enterprises: How SnapLogic powers their low-code Agent Creator using Amazon Bedrock

Scaling Large Language Model (LLM) training with Amazon EC2 Trn1 UltraClusters

Boost employee productivity with automated meeting summaries using Amazon Transcribe, Amazon SageMaker, and LLMs from Hugging Face

Use zero-shot large language models on Amazon Bedrock for custom named entity recognition

Educating a New Generation of Workers

5 most useful AI translation tools – Diversify your business

Build a multilingual automatic translation pipeline with Amazon Translate Active Custom Translation

Create summaries of recordings using generative AI with Amazon Bedrock and Amazon Transcribe

Getting started with Amazon Titan Text Embeddings

A review of purpose-built accelerators for financial services

Build a computer vision-based asset inventory application with low or no training

How BigBasket improved AI-enabled checkout at their physical stores using Amazon SageMaker

CRISPR-Cas9 guide RNA efficiency prediction with efficiently tuned models in Amazon SageMaker

Inference AudioCraft MusicGen models using Amazon SageMaker

Amazon SageMaker model parallel library now accelerates PyTorch FSDP workloads by up to 20%

Inference Llama 2 models with real-time response streaming using Amazon SageMaker

Use RAG for drug discovery with Knowledge Bases for Amazon Bedrock

Accelerate client success management through email classification with Hugging Face on Amazon SageMaker

Deploy generative AI agents in your contact center for voice and chat using Amazon Connect, Amazon Lex, and Amazon Bedrock Knowledge Bases

Learn AI Together — Towards AI Community Newsletter #26

Generate creative advertising using generative AI deployed on Amazon SageMaker

How I cleared AWS Machine Learning Specialty with three weeks of preparation (I will burst some…

Improve performance of Falcon models with Amazon SageMaker

Deploy large models at high performance using FasterTransformer on Amazon SageMaker

Deploy thousands of model ensembles with Amazon SageMaker multi-model endpoints on GPU to minimize your hosting costs

How Clearwater Analytics is revolutionizing investment management with generative AI and Amazon SageMaker JumpStart

Build production-ready generative AI applications for enterprise search using Haystack pipelines and Amazon SageMaker JumpStart with LLMs

Stay Connected