article thumbnail

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

AWS Machine Learning Blog

In this post, we walk through how to fine-tune Llama 2 on AWS Trainium , a purpose-built accelerator for LLM training, to reduce training times and costs. We review the fine-tuning scripts provided by the AWS Neuron SDK (using NeMo Megatron-LM), the various configurations we used, and the throughput results we saw.

AWS 98
article thumbnail

Build a medical imaging AI inference pipeline with MONAI Deploy on AWS

AWS Machine Learning Blog

AWS and NVIDIA have come together to make this vision a reality. AWS, NVIDIA, and other partners build applications and solutions to make healthcare more accessible, affordable, and efficient by accelerating cloud connectivity of enterprise imaging. AHI provides API access to ImageSet metadata and ImageFrames.

AWS 96
professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Generative AI and multi-modal agents in AWS: The key to unlocking new value in financial markets

AWS Machine Learning Blog

Implementing a multi-modal agent with AWS consolidates key insights from diverse structured and unstructured data on a large scale. All this is achieved using AWS services, thereby increasing the financial analyst’s efficiency to analyze multi-modal financial data (text, speech, and tabular data) holistically.

AWS 102
article thumbnail

Bundesliga Match Fact Keeper Efficiency: Comparing keepers’ performances objectively using machine learning on AWS

AWS Machine Learning Blog

Not only was he widely considered the top-rated goalkeeper in the league during the 2021/22 season, but he also held that title back in 2018/19 when Eintracht Frankfurt reached the Europa League semifinals. The BMF logic itself (except for the ML model) runs on an AWS Fargate container.

article thumbnail

Deploy large language models for a healthtech use case on Amazon SageMaker

AWS Machine Learning Blog

We implemented the solution using the AWS Cloud Development Kit (AWS CDK). One of the more popular and useful of the transformer architectures, Bidirectional Encoder Representations from Transformers (BERT), is a language representation model that was introduced in 2018. The first GPT model was introduced in 2018 by OpenAI.

AWS 99
article thumbnail

Industry Pulse April 2018 Highlights

DataRobot Blog

This past month we had news from SAS Global Forum, Microstrategy, Oracle, AWS, Google, Qlik Qonnections, Tableau and several other smaller vendors. by Jen Underwood. Fallout from the March Facebook scandal continued while GDPR. Read More.

Tableau 40
article thumbnail

Incorporate offline and online human – machine workflows into your generative AI applications on AWS

AWS Machine Learning Blog

We present the solution and provide an example by simulating a case where the tier one AWS experts are notified to help customers using a chat-bot. We provide LangChain and AWS SDK code-snippets, architecture and discussions to guide you on this important topic. Here, we use the on-demand option.

AWS 81