AWS, Clustering and Natural Language Processing

Deploy Meta Llama 3.1 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

AWS Machine Learning Blog

NOVEMBER 25, 2024

8B and 70B inference support on AWS Trainium and AWS Inferentia instances in Amazon SageMaker JumpStart. multilingual large language models (LLMs) are a collection of pre-trained and instruction tuned generative models. An AWS Identity and Access Management (IAM) role to access SageMaker. Meta Llama 3.1 by up to 50%.

AWS

AWS Python ML ML

Build a Search Engine: Setting Up AWS OpenSearch

Flipboard

MAY 5, 2025

Home Table of Contents Build a Search Engine: Setting Up AWS OpenSearch Introduction What Is AWS OpenSearch? What AWS OpenSearch Is Commonly Used For Key Features of AWS OpenSearch How Does AWS OpenSearch Work? Why Use AWS OpenSearch for Semantic Search? Looking for the source code to this post?

AWS

AWS Clustering Deep Learning Deep Learning

Syngenta develops a generative AI assistant to support sales representatives using Amazon Bedrock Agents

Flipboard

DECEMBER 3, 2024

Syngenta and AWS collaborated to develop Cropwise AI , an innovative solution powered by Amazon Bedrock Agents , to accelerate their sales reps’ ability to place Syngenta seed products with growers across North America. The collaboration between Syngenta and AWS showcases the transformative power of LLMs and AI agents.

AWS

AWS AI AI Machine Learning

Webinars

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

MORE WEBINARS

Transforming financial analysis with CreditAI on Amazon Bedrock: Octus’s journey with AWS

AWS Machine Learning Blog

MARCH 10, 2025

We walk through the journey Octus took from managing multiple cloud providers and costly GPU instances to implementing a streamlined, cost-effective solution using AWS services including Amazon Bedrock, AWS Fargate , and Amazon OpenSearch Service. Along the way, it also simplified operations as Octus is an AWS shop more generally.

AWS

AWS Database AI AI

How Aetion is using generative AI and Amazon Bedrock to unlock hidden insights about patient populations

AWS Machine Learning Blog

JANUARY 30, 2025

Smart Subgroups For a user-specified patient population, the Smart Subgroups feature identifies clusters of patients with similar characteristics (for example, similar prevalence profiles of diagnoses, procedures, and therapies). The features are stored in Amazon S3 and encrypted with AWS Key Management Service (AWS KMS) for downstream use.

Clustering

Clustering Natural Language Processing AI AI

How Fastweb fine-tuned the Mistral model using Amazon SageMaker HyperPod as a first step to build an Italian large language model

AWS Machine Learning Blog

DECEMBER 18, 2024

Training an LLM is a compute-intensive and complex process, which is why Fastweb, as a first step in their AI journey, used AWS generative AI and machine learning (ML) services such as Amazon SageMaker HyperPod. The team opted for fine-tuning on AWS.

Clustering

Clustering AWS AI AI

Accelerating Mixtral MoE fine-tuning on Amazon SageMaker with QLoRA

AWS Machine Learning Blog

NOVEMBER 22, 2024

Although QLoRA helps optimize memory during fine-tuning, we will use Amazon SageMaker Training to spin up a resilient training cluster, manage orchestration, and monitor the cluster for failures. To take complete advantage of this multi-GPU cluster, we use the recent support of QLoRA and PyTorch FSDP. 24xlarge compute instance.

Clustering

Clustering AWS ML ML

Train, optimize, and deploy models on edge devices using Amazon SageMaker and Qualcomm AI Hub

AWS Machine Learning Blog

OCTOBER 18, 2024

We demonstrate this solution by walking you through a comprehensive step-by-step guide on how to fine-tune YOLOv8 , a real-time object detection model, on Amazon Web Services (AWS) using a custom dataset. The process uses a single ml.g5.2xlarge instance (providing one NVIDIA A10G Tensor Core GPU) with SageMaker for fine-tuning.

AWS

AWS AI AI Machine Learning

Sprinklr improves performance by 20% and reduces cost by 25% for machine learning inference on AWS Graviton3

AWS Machine Learning Blog

JUNE 11, 2024

Sprinklr’s specialized AI models streamline data processing, gather valuable insights, and enable workflows and analytics at scale to drive better decision-making and productivity. During this journey, we collaborated with our AWS technical account manager and the Graviton software engineering teams.

Machine Learning

Machine Learning Machine Learning AWS Natural Language Processing

Detect hallucinations for RAG-based systems

Flipboard

MAY 16, 2025

Prerequisites To use the methods presented in this post, you need an AWS account with access to Amazon SageMaker , Amazon Bedrock , and Amazon Simple Storage Service (Amazon S3). Statement: 'AWS is Amazon subsidiary that provides cloud computing services.' Finally, we compare approaches in terms of their performance and latency.

AWS

AWS Cloud Computing Natural Language Processing AI

How Lumi streamlines loan approvals with Amazon SageMaker AI

AWS Machine Learning Blog

APRIL 4, 2025

To achieve this, Lumi developed a classification model based on BERT (Bidirectional Encoder Representations from Transformers) , a state-of-the-art natural language processing (NLP) technique. The pipeline leverages several AWS services familiar to Lumis team. Follow him on LinkedIn.

AI

AI AI Machine Learning Machine Learning

The future of productivity agents with NinjaTech AI and AWS Trainium

AWS Machine Learning Blog

JUNE 27, 2024

In this post, we describe how we built our cutting-edge productivity agent NinjaLLM, the backbone of MyNinja.ai, using AWS Trainium chips. For training, we chose to use a cluster of trn1.32xlarge instances to take advantage of Trainium chips. We used a cluster of 32 instances in order to efficiently parallelize the training.

AWS

AWS AI AI Clustering

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

AWS Machine Learning Blog

OCTOBER 5, 2023

In this post, we walk through how to fine-tune Llama 2 on AWS Trainium , a purpose-built accelerator for LLM training, to reduce training times and costs. We review the fine-tuning scripts provided by the AWS Neuron SDK (using NeMo Megatron-LM), the various configurations we used, and the throughput results we saw.

AWS

AWS Machine Learning Machine Learning Deep Learning

Reduce energy consumption of your machine learning workloads by up to 90% with AWS purpose-built accelerators

Flipboard

JUNE 20, 2023

For reference, GPT-3, an earlier generation LLM has 175 billion parameters and requires months of non-stop training on a cluster of thousands of accelerated processors. The Carbontracker study estimates that training GPT-3 from scratch may emit up to 85 metric tons of CO2 equivalent, using clusters of specialized hardware accelerators.

AWS

AWS Machine Learning Machine Learning ML

Build enterprise-ready generative AI solutions with Cohere foundation models in Amazon Bedrock and Weaviate vector database on AWS Marketplace

AWS Machine Learning Blog

JANUARY 24, 2024

We demonstrate how to build an end-to-end RAG application using Cohere’s language models through Amazon Bedrock and a Weaviate vector database on AWS Marketplace. Cohere Embed comes in two forms, an English language model and a multilingual model, both of which are now available on Amazon Bedrock.

AWS

AWS Database AI AI

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

AWS Machine Learning Blog

APRIL 29, 2024

Historically, natural language processing (NLP) would be a primary research and development expense. In 2024, however, organizations are using large language models (LLMs), which require relatively little focus on NLP, shifting research and development from modeling to the infrastructure needed to support LLM workflows.

AWS

AWS ML ML Python

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

AWS Machine Learning Blog

MAY 1, 2024

Llama2 by Meta is an example of an LLM offered by AWS. Llama 2 is an auto-regressive language model that uses an optimized transformer architecture and is intended for commercial and research use in English. Virginia) and US West (Oregon) AWS Regions, and most recently announced general availability in the US East (Ohio) Region.

AWS

AWS ML ML Clustering

How Apoidea Group enhances visual information extraction from banking documents with multimodal models using LLaMA-Factory on Amazon SageMaker HyperPod

AWS Machine Learning Blog

MAY 15, 2025

Amazon SageMaker HyperPod offers an effective solution for provisioning resilient clusters to run ML workloads and develop state-of-the-art models. Whether youre processing financial statements, KYC documents, or complex reports, we encourage you to evaluate its potential for optimizing your document workflows.

AWS

AWS ML ML Machine Learning

Retrieval-Augmented Generation with LangChain, Amazon SageMaker JumpStart, and MongoDB Atlas semantic search

Flipboard

NOVEMBER 17, 2023

Set up a MongoDB cluster To create a free tier MongoDB Atlas cluster, follow the instructions in Create a Cluster. Specify the AWS Lambda function that will interact with MongoDB Atlas and the LLM to provide responses. Delete the MongoDB Atlas cluster. As always, AWS welcomes feedback.

K-nearest Neighbors

K-nearest Neighbors AWS Clustering Database

Deploy pre-trained models on AWS Wavelength with 5G edge using Amazon SageMaker JumpStart

AWS Machine Learning Blog

APRIL 7, 2023

Retailers can deliver more frictionless experiences on the go with natural language processing (NLP), real-time recommendation systems, and fraud detection. In this post, we demonstrate how to deploy a SageMaker model to AWS Wavelength to reduce model inference latency for 5G network-based applications. Choose Manage.

AWS

AWS Clustering ML ML

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

AWS Machine Learning Blog

JANUARY 17, 2024

Today, we’re excited to announce the availability of Llama 2 inference and fine-tuning support on AWS Trainium and AWS Inferentia instances in Amazon SageMaker JumpStart. In this post, we demonstrate how to deploy and fine-tune Llama 2 on Trainium and AWS Inferentia instances in SageMaker JumpStart.

AWS

AWS Python Machine Learning Machine Learning

Multi-tenancy in RAG applications in a single Amazon Bedrock knowledge base with metadata filtering

AWS Machine Learning Blog

APRIL 7, 2025

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies and AWS. Solution overview The following diagram provides a high-level overview of AWS services and features through a sample use case.

Database

Database AWS Natural Language Processing AI

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

AWS Machine Learning Blog

SEPTEMBER 3, 2024

Cost optimization – The serverless nature of the integration means you only pay for the compute resources you use, rather than having to provision and maintain a persistent cluster. This same interface is also used for provisioning EMR clusters. This same interface is also used for provisioning EMR clusters.

AWS

AWS Clustering Big Data Big Data

How Untold Studios empowers artists with an AI assistant built on Amazon Bedrock

AWS Machine Learning Blog

FEBRUARY 7, 2025

Key AWS services used include: Amazon Bedrock Including Anthropics Claude 3.5 Sonnet model for natural language processing. This means artists can focus on their creative process rather than worrying about precise phrasing or navigating complex menu structures.

AWS

AWS AI AI Python

Pre-training genomic language models using AWS HealthOmics and Amazon SageMaker

AWS Machine Learning Blog

MAY 31, 2024

Genomic language models are a new and exciting field in the application of large language models to challenges in genomics. In this blog post and open source project , we show you how you can pre-train a genomics language model, HyenaDNA , using your genomic data in the AWS Cloud.

AWS

AWS ML ML Machine Learning

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container

AWS Machine Learning Blog

MARCH 11, 2025

The MoE architecture allows activation of 37 billion parameters, enabling efficient inference by routing queries to the most relevant expert clusters. By integrating this model with Amazon SageMaker AI , you can benefit from the AWS scalable infrastructure while maintaining high-quality language model capabilities.

AWS

AWS ML ML Natural Language Processing

Scale LLMs with PyTorch 2.0 FSDP on Amazon EKS – Part 2

AWS Machine Learning Blog

APRIL 1, 2024

Machine learning (ML) research has proven that large language models (LLMs) trained with significantly large datasets result in better model quality. Distributed model training requires a cluster of worker nodes that can scale. The following figure shows how FSDP works for two data parallel processes.

Clustering

Clustering AWS ML ML

Monitor embedding drift for LLMs deployed from Amazon SageMaker JumpStart

AWS Machine Learning Blog

FEBRUARY 2, 2024

Embeddings capture the information content in bodies of text, allowing natural language processing (NLP) models to work with language in a numeric form. Then we use K-Means to identify a set of cluster centers. A visual representation of the silhouette score can be seen in the following figure.

AWS

AWS Clustering ETL Database

Connecting Amazon Redshift and RStudio on Amazon SageMaker

AWS Machine Learning Blog

DECEMBER 29, 2022

AWS offers tools such as RStudio on SageMaker and Amazon Redshift to help tackle these challenges. Note: If you already have an RStudio domain and Amazon Redshift cluster you can skip this step. Amazon Redshift Serverless cluster. I acknowledge that AWS CloudFormation might create IAM resources with custom names.

AWS

AWS Machine Learning Machine Learning Clustering

Build agentic AI solutions with DeepSeek-R1, CrewAI, and Amazon SageMaker AI

Flipboard

FEBRUARY 10, 2025

These services support single GPU to HyperPods (cluster of GPUs) for training and include built-in FMOps tools for tracking, debugging, and deployment. Solution overview CrewAI provides a robust framework for developing multi-agent systems that integrate with AWS services, particularly SageMaker AI.

AI

AI AI AWS ML

Getting started with Amazon Titan Text Embeddings

AWS Machine Learning Blog

JANUARY 31, 2024

Embeddings play a key role in natural language processing (NLP) and machine learning (ML). Text embedding refers to the process of transforming text into numerical representations that reside in a high-dimensional vector space. You can use it via either the Amazon Bedrock REST API or the AWS SDK.

Natural Language Processing

Natural Language Processing AWS Machine Learning Machine Learning

How Cisco accelerated the use of generative AI with Amazon SageMaker Inference

AWS Machine Learning Blog

AUGUST 8, 2024

Webex’s focus on delivering inclusive collaboration experiences fuels their innovation, which uses artificial intelligence (AI) and machine learning (ML), to remove the barriers of geography, language, personality, and familiarity with technology. Webex works with the world’s leading business and productivity apps—including AWS.

AWS

AWS AI AI Clustering

Connect Amazon EMR and RStudio on Amazon SageMaker

AWS Machine Learning Blog

APRIL 17, 2023

Using RStudio on SageMaker and Amazon EMR together, you can continue to use the RStudio IDE for analysis and development, while using Amazon EMR managed clusters for larger data processing. In this post, we demonstrate how you can connect your RStudio on SageMaker domain with an EMR cluster. Choose Create stack.

Clustering

Clustering AWS Machine Learning Machine Learning

eSentire delivers private and secure generative AI interactions to customers with Amazon SageMaker

AWS Machine Learning Blog

JUNE 21, 2024

To accomplish this, eSentire built AI Investigator, a natural language query tool for their customers to access security platform data by using AWS generative artificial intelligence (AI) capabilities. The additional benefit of SageMaker notebook instances is its streamlined integration with eSentire’s AWS environment.

AWS

AWS AI AI Natural Language Processing

Optimizing AI implementation costs with Automat-it

AWS Machine Learning Blog

FEBRUARY 28, 2025

This post was written by Claudiu Bota, Oleg Yurchenko, and Vladyslav Melnyk of AWS Partner Automat-it. As organizations adopt AI and machine learning (ML), theyre using these technologies to improve processes and enhance products. This approach was inspired by several AWS blog posts that can be found in the references section.

AWS

AWS AI AI Deep Learning

Training large language models on Amazon SageMaker: Best practices

AWS Machine Learning Blog

MARCH 6, 2023

These factors require training an LLM over large clusters of accelerated machine learning (ML) instances. In the past few years, numerous customers have been using the AWS Cloud for LLM training. We recommend working with your AWS account team or contacting AWS Sales to determine the appropriate Region for your LLM workload.

AWS

AWS Clustering ML ML

Scalable training platform with Amazon SageMaker HyperPod for innovation: a video generation case study

AWS Machine Learning Blog

SEPTEMBER 26, 2024

However, building large distributed training clusters is a complex and time-intensive process that requires in-depth expertise. Clusters are provisioned with the instance type and count of your choice and can be retained across workloads. As a result of this flexibility, you can adapt to various scenarios.

Clustering

Clustering Algorithm ML ML

Alida gains deeper understanding of customer feedback with Amazon Bedrock

AWS Machine Learning Blog

MARCH 4, 2024

However, when employing the use of traditional natural language processing (NLP) models, they found that these solutions struggled to fully understand the nuanced feedback found in open-ended survey responses. About the authors Kinman Lam is an ISV/DNB Solution Architect for AWS.

AWS

AWS ML ML Machine Learning

Architect personalized generative AI SaaS applications on Amazon SageMaker

Flipboard

MARCH 9, 2023

In this post, we review the technical requirements and application design considerations for fine-tuning and serving hyper-personalized AI models at scale on AWS. For example, the G5 instance type, which is equipped with NVIDIA A10g GPUs (unique to AWS), offers a strong price-performance ratio, both for model training and hosting.

AWS

AWS AI AI ML

Accelerate hyperparameter grid search for sentiment analysis with BERT models using Weights & Biases, Amazon EKS, and TorchElastic

AWS Machine Learning Blog

MARCH 2, 2023

In our solution, we implement a hyperparameter grid search on an EKS cluster for tuning a bert-base-cased model for classifying positive or negative sentiment for stock market data headlines. A desired cluster can simply be configured using the eks.conf file and launched by running the eks-create.sh to launch the cluster.

Clustering

Clustering AWS Deep Learning Deep Learning

CRISPR-Cas9 guide RNA efficiency prediction with efficiently tuned models in Amazon SageMaker

AWS Machine Learning Blog

SEPTEMBER 16, 2024

The clustered regularly interspaced short palindromic repeat (CRISPR) technology holds the promise to revolutionize gene editing technologies, which is transformative to the way we understand and treat diseases. We also provided code that can help you jumpstart your biology applications in AWS.

Natural Language Processing

Natural Language Processing AWS Deep Learning Deep Learning

Deploy a Hugging Face (PyAnnote) speaker diarization model on Amazon SageMaker as an asynchronous endpoint

AWS Machine Learning Blog

APRIL 25, 2024

We provide a comprehensive guide on how to deploy speaker segmentation and clustering solutions using SageMaker on the AWS Cloud. Solution overview Amazon Transcribe is the go-to service for speaker diarization in AWS. Make sure the AWS account has a service quota for hosting a SageMaker endpoint for an ml.g5.2xlarge instance.

AWS

AWS ML ML Python

Techniques for automatic summarization of documents using language models

Flipboard

DECEMBER 6, 2023

Tools like LangChain , combined with a large language model (LLM) powered by Amazon Bedrock or Amazon SageMaker JumpStart , simplify the implementation process. Click here to open the AWS console and follow along. The model then uses a clustering algorithm to group the sentences into clusters.

AWS

AWS Clustering Artificial Intelligence Artificial Intelligence

Fine-tune multimodal models for vision and text use cases on Amazon SageMaker JumpStart

AWS Machine Learning Blog

NOVEMBER 15, 2024

Prerequisites To try out this solution using SageMaker JumpStart, you need the following prerequisites: An AWS account that will contain all of your AWS resources. An AWS Identity and Access Management (IAM) role to access SageMaker. Appendix Language models such as Meta Llama are more than 10 GB or even 100 GB in size.

ML

ML ML Python AWS

Deploy Meta Llama 3.1 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

Build a Search Engine: Setting Up AWS OpenSearch

Webinars

Trending Sources

Syngenta develops a generative AI assistant to support sales representatives using Amazon Bedrock Agents

Webinars

Transforming financial analysis with CreditAI on Amazon Bedrock: Octus’s journey with AWS

How Aetion is using generative AI and Amazon Bedrock to unlock hidden insights about patient populations

How Fastweb fine-tuned the Mistral model using Amazon SageMaker HyperPod as a first step to build an Italian large language model

Accelerating Mixtral MoE fine-tuning on Amazon SageMaker with QLoRA

Train, optimize, and deploy models on edge devices using Amazon SageMaker and Qualcomm AI Hub

Sprinklr improves performance by 20% and reduces cost by 25% for machine learning inference on AWS Graviton3

Detect hallucinations for RAG-based systems

How Lumi streamlines loan approvals with Amazon SageMaker AI

The future of productivity agents with NinjaTech AI and AWS Trainium

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

Reduce energy consumption of your machine learning workloads by up to 90% with AWS purpose-built accelerators

Build enterprise-ready generative AI solutions with Cohere foundation models in Amazon Bedrock and Weaviate vector database on AWS Marketplace

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

How Apoidea Group enhances visual information extraction from banking documents with multimodal models using LLaMA-Factory on Amazon SageMaker HyperPod

Retrieval-Augmented Generation with LangChain, Amazon SageMaker JumpStart, and MongoDB Atlas semantic search

Deploy pre-trained models on AWS Wavelength with 5G edge using Amazon SageMaker JumpStart

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

Multi-tenancy in RAG applications in a single Amazon Bedrock knowledge base with metadata filtering

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

How Untold Studios empowers artists with an AI assistant built on Amazon Bedrock

Pre-training genomic language models using AWS HealthOmics and Amazon SageMaker

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container

Scale LLMs with PyTorch 2.0 FSDP on Amazon EKS – Part 2

Monitor embedding drift for LLMs deployed from Amazon SageMaker JumpStart

Connecting Amazon Redshift and RStudio on Amazon SageMaker

Build agentic AI solutions with DeepSeek-R1, CrewAI, and Amazon SageMaker AI

Getting started with Amazon Titan Text Embeddings

How Cisco accelerated the use of generative AI with Amazon SageMaker Inference

Connect Amazon EMR and RStudio on Amazon SageMaker

eSentire delivers private and secure generative AI interactions to customers with Amazon SageMaker

Optimizing AI implementation costs with Automat-it

Training large language models on Amazon SageMaker: Best practices

Scalable training platform with Amazon SageMaker HyperPod for innovation: a video generation case study

Alida gains deeper understanding of customer feedback with Amazon Bedrock

Architect personalized generative AI SaaS applications on Amazon SageMaker

Accelerate hyperparameter grid search for sentiment analysis with BERT models using Weights & Biases, Amazon EKS, and TorchElastic

CRISPR-Cas9 guide RNA efficiency prediction with efficiently tuned models in Amazon SageMaker

Deploy a Hugging Face (PyAnnote) speaker diarization model on Amazon SageMaker as an asynchronous endpoint

Techniques for automatic summarization of documents using language models

Fine-tune multimodal models for vision and text use cases on Amazon SageMaker JumpStart

Stay Connected