AWS, Download and Python - Data Science Current

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning Blog

NOVEMBER 6, 2024

AWS provides a powerful set of tools and services that simplify the process of building and deploying generative AI applications, even for those with limited experience in frontend and backend development. The Python application uses the Streamlit library to provide a user-friendly interface for interacting with a generative AI model.

AWS

AWS Python AI AI

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning Blog

DECEMBER 24, 2024

To simplify infrastructure setup and accelerate distributed training, AWS introduced Amazon SageMaker HyperPod in late 2023. In this blog post, we showcase how you can perform efficient supervised fine tuning for a Meta Llama 3 model using PEFT on AWS Trainium with SageMaker HyperPod. architectures/5.sagemaker-hyperpod/LifecycleScripts/base-config/

AWS

AWS Clustering Deep Learning Deep Learning

Accelerating ML experimentation with enhanced security: AWS PrivateLink support for Amazon SageMaker with MLflow

AWS Machine Learning Blog

DECEMBER 9, 2024

It simplifies the often complex and time-consuming tasks involved in setting up and managing an MLflow environment, allowing ML administrators to quickly establish secure and scalable MLflow environments on AWS. For example, you can give users access permission to download popular packages and customize the development environment.

AWS

AWS ML ML Data Scientist

Webinars

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

MORE WEBINARS

Generate AWS Resilience Hub findings in natural language using Amazon Bedrock

AWS Machine Learning Blog

NOVEMBER 18, 2024

In this post, we explore how to use the power of AWS Resilience Hub and Amazon Bedrock to bridge this gap and streamline the process of sharing architectural findings across your organization. Prerequisites For this walkthrough, the following are required: An AWS account. AWS Management Console access. A Python 3.12

AWS

AWS AI AI Machine Learning

Run small language models cost-efficiently with AWS Graviton and Amazon SageMaker AI

Flipboard

JUNE 5, 2025

AWS has always provided customers with choice. In terms of hardware choice, in addition to NVIDIA GPUs and AWS custom AI chips, CPU-based instances represent (thanks to the latest innovations in CPU hardware) an additional choice for customers who want to run generative AI inference, like hosting small language models and asynchronous agents.

AWS

AWS AI AI ML

Introducing AWS MCP Servers for code assistants (Part 1)

AWS Machine Learning Blog

APRIL 1, 2025

Were excited to announce the open source release of AWS MCP Servers for code assistants a suite of specialized Model Context Protocol (MCP) servers that bring Amazon Web Services (AWS) best practices directly to your development workflow. This post is the first in a series covering AWS MCP Servers.

AWS

AWS AI AI Python

Automate invoice processing with Streamlit and Amazon Bedrock

AWS Machine Learning Blog

NOVEMBER 14, 2024

Streamlit is an open source framework for data scientists to efficiently create interactive web-based data applications in pure Python. Prerequisites To perform this solution, complete the following: Create and activate an AWS account. Make sure your AWS credentials are configured correctly. Install Python 3.7

AWS

AWS Python AI AI

Accelerate edge AI development with SiMa.ai Edgematic with a seamless AWS integration

AWS Machine Learning Blog

MAY 16, 2025

This post is co-authored by Manuel Lopez Roldan, SiMa.ai, and Jason Westra, AWS Senior Solutions Architect. Prerequisites Before you get started, make sure you have the following: An AWS account. If you dont have an AWS account, you can create one. The AWS Command Line Interface (AWS CLI), Docker, and Git installed locally.

AWS

AWS AI AI ML

Align and monitor your Amazon Bedrock powered insurance assistance chatbot to responsible AI principles with AWS Audit Manager

AWS Machine Learning Blog

JANUARY 7, 2025

To address this need, AWS generative AI best practices framework was launched within AWS Audit Manager , enabling auditing and monitoring of generative AI applications. Figure 1 depicts the systems functionalities and AWS services. Select AWS Generative AI Best Practices Framework for assessment. Choose Create assessment.

AWS

AWS AI AI Database

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

AWS Machine Learning Blog

NOVEMBER 26, 2024

Using vLLM on AWS Trainium and Inferentia makes it possible to host LLMs for high performance inference and scalability. Deploy vLLM on AWS Trainium and Inferentia EC2 instances In these sections, you will be guided through using vLLM on an AWS Inferentia EC2 instance to deploy Meta’s newest Llama 3.2 You will use inf2.xlarge

AWS

AWS AI AI Artificial Intelligence

Create a generative AI assistant with Slack and Amazon Bedrock

Flipboard

NOVEMBER 27, 2024

In this post, we show you how to integrate the popular Slack messaging service with AWS generative AI services to build a natural language assistant where business users can ask questions of an unstructured dataset. In this solution, we have enabled the AWS provided profanity filter.

AWS

AWS AI AI Database

Faster distributed graph neural network training with GraphStorm v0.4

AWS Machine Learning Blog

FEBRUARY 11, 2025

Today, AWS AI released GraphStorm v0.4. Prerequisites To run this example, you will need an AWS account, an Amazon SageMaker Studio domain, and the necessary permissions to run BYOC SageMaker jobs. Using SageMaker Pipelines to train models provides several benefits, like reduced costs, auditability, and lineage tracking. million edges.

AWS

AWS Python ML ML

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning Blog

MARCH 3, 2025

Hybrid architecture with AWS Local Zones To minimize the impact of network latency on TTFT for users regardless of their locations, a hybrid architecture can be implemented by extending AWS services from commercial Regions to edge locations closer to end users. Next, create a subnet inside each Local Zone. Amazon Linux 2).

AWS

AWS AI AI Deep Learning

Llama 3.3 70B now available in Amazon SageMaker JumpStart

AWS Machine Learning Blog

DECEMBER 16, 2024

70B through SageMaker JumpStart offers two convenient approaches: using the intuitive SageMaker JumpStart UI or implementing programmatically through the SageMaker Python SDK. Prior to joining AWS, Dr. Li held data science roles in the financial and retail industries. Adriana Simmons is a Senior Product Marketing Manager at AWS.

AWS

AWS ML ML Python

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning Blog

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce the new Container Caching capability in Amazon SageMaker, which significantly reduces the time required to scale generative AI models for inference. Container Caching addresses this scaling challenge by pre-caching the container image, eliminating the need to download it when scaling up.

AI

AI AI AWS Machine Learning

Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models (LLMs) – part 1

AWS Machine Learning Blog

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce a new capability in Amazon SageMaker Inference that significantly reduces the time required to deploy and scale LLMs for inference using LMI: Fast Model Loader. To reduce the time it takes to download and load the container image, SageMaker now supports container caching.

AWS

AWS Machine Learning Machine Learning ML

Train, optimize, and deploy models on edge devices using Amazon SageMaker and Qualcomm AI Hub

AWS Machine Learning Blog

OCTOBER 18, 2024

We demonstrate this solution by walking you through a comprehensive step-by-step guide on how to fine-tune YOLOv8 , a real-time object detection model, on Amazon Web Services (AWS) using a custom dataset. Call the SageMaker control plane API using the SageMaker Python SDK for model training. An AWS account. environment.

AWS

AWS AI AI Machine Learning

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning Blog

JANUARY 29, 2025

In this post, we explore how to deploy distilled versions of DeepSeek-R1 with Amazon Bedrock Custom Model Import, making them accessible to organizations looking to use state-of-the-art AI capabilities within the secure and scalable AWS infrastructure at an effective cost. You can monitor costs with AWS Cost Explorer.

AWS

AWS ML ML AI

Accelerated PyTorch inference with torch.compile on AWS Graviton processors

AWS Machine Learning Blog

JULY 2, 2024

AWS optimized the PyTorch torch.compile feature for AWS Graviton3 processors. the optimizations are available in torch Python wheels and AWS Graviton PyTorch deep learning container (DLC). The goal for the AWS Graviton team was to optimize torch.compile backend for Graviton3 processors.

AWS

AWS Natural Language Processing Python ML

AWS Inferentia and AWS Trainium deliver lowest cost to deploy Llama 3 models in Amazon SageMaker JumpStart

AWS Machine Learning Blog

MAY 2, 2024

Today, we’re excited to announce the availability of Meta Llama 3 inference on AWS Trainium and AWS Inferentia based instances in Amazon SageMaker JumpStart. In this post, we demonstrate how easy it is to deploy Llama 3 on AWS Trainium and AWS Inferentia based instances in SageMaker JumpStart.

AWS

AWS ML ML Python

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

AWS Machine Learning Blog

APRIL 29, 2024

For AWS and Outerbounds customers, the goal is to build a differentiated machine learning and artificial intelligence (ML/AI) system and reliably improve it over time. First, the AWS Trainium accelerator provides a high-performance, cost-effective, and readily available solution for training and fine-tuning large models.

AWS

AWS ML ML Python

Accelerate NLP inference with ONNX Runtime on AWS Graviton processors

AWS Machine Learning Blog

MAY 15, 2024

AWS Graviton3 processors are optimized for ML workloads, including support for bfloat16, Scalable Vector Extension (SVE), and Matrix Multiplication (MMLA) instructions. In this post, we show how to run ONNX Runtime inference on AWS Graviton3-based EC2 instances and how to configure them to use optimized GEMM kernels.

AWS

AWS Natural Language Processing Python Deep Learning

Build a Search Engine: Semantic Search System Using OpenSearch

PyImageSearch

MAY 19, 2025

Jump Right To The Downloads Section Introduction In the previous post , we walked through the process of indexing and storing movie data in OpenSearch. If you havent already set up the project from the previous post, you can download the source code from the tutorials “Downloads” section. data queries_set_1.txt

K-nearest Neighbors

K-nearest Neighbors AWS Deep Learning Deep Learning

Create a SageMaker inference endpoint with custom model & extended container

AWS Machine Learning Blog

JANUARY 27, 2025

Write a Python model definition using the SageMaker inference.py For this post, we use the us-east-1 AWS Region: Have access to a POSIX based (Mac/Linux) system or SageMaker notebooks. file format. Define your model artifacts and inference file within a specific file structure, archive your model files as a tar.gz

AWS

AWS Deep Learning Deep Learning Machine Learning

Monitor AWS Sagemaker model using IBM Watson OpenScale

IBM Data Science in Practice

APRIL 4, 2023

Introduction This article shows how to monitor a model deployed on AWS Sagemaker for quality, bias and explainability, using IBM Watson OpenScale on the IBM Cloud Pak for Data platform. This article shows how to use the endpoint generated from that tutorial to demonstrate how to monitor the AWS deployment with Watson OpenScale.

AWS

AWS Machine Learning Machine Learning Data Science

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

AWS Machine Learning Blog

OCTOBER 5, 2023

In this post, we walk through how to fine-tune Llama 2 on AWS Trainium , a purpose-built accelerator for LLM training, to reduce training times and costs. We review the fine-tuning scripts provided by the AWS Neuron SDK (using NeMo Megatron-LM), the various configurations we used, and the throughput results we saw.

AWS

AWS Machine Learning Machine Learning Deep Learning

Optimized PyTorch 2.0 inference with AWS Graviton processors

AWS Machine Learning Blog

MAY 3, 2023

AWS, Arm, Meta and others helped optimize the performance of PyTorch 2.0 As a result, we are delighted to announce that AWS Graviton-based instance inference performance for PyTorch 2.0 times the speed for BERT, making Graviton-based instances the fastest compute optimized instances on AWS for these models. is up to 3.5

AWS

AWS Cloud Computing Python Machine Learning

Deploy an IBM Watson Studio Model on AWS Sagemaker

IBM Data Science in Practice

MARCH 8, 2023

You can then export the model and deploy it on Amazon Sagemaker on Amazon Web Server (AWS). If you are set up with the required systems, you can download the sample project and complete the steps for hands-on learning. This sample uses the following platforms: Watson Studio and Watson Machine Learning with a Python 3.9

AWS

AWS Machine Learning Machine Learning Algorithm

Reinventing a cloud-native federated learning architecture on AWS

AWS Machine Learning Blog

OCTOBER 10, 2023

Customers often need to train a model with data from different regions, organizations, or AWS accounts. Existing partner open-source FL solutions on AWS include FedML and NVIDIA FLARE. These open-source packages are deployed in the cloud by running in virtual machines, without using the cloud-native services available on AWS.

AWS

AWS ML ML Algorithm

Customize DeepSeek-R1 671b model using Amazon SageMaker HyperPod recipes – Part 2

AWS Machine Learning Blog

MAY 14, 2025

You can execute each step in the training pipeline by initiating the process through the SageMaker control plane using APIs, AWS Command Line Interface (AWS CLI), or the SageMaker ModelTrainer SDK. With HyperPod, users can begin the process by connecting to the login/head node of the Slurm cluster.

Clustering

Clustering AWS ML ML

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

AWS Machine Learning Blog

JANUARY 17, 2024

Today, we’re excited to announce the availability of Llama 2 inference and fine-tuning support on AWS Trainium and AWS Inferentia instances in Amazon SageMaker JumpStart. In this post, we demonstrate how to deploy and fine-tune Llama 2 on Trainium and AWS Inferentia instances in SageMaker JumpStart.

AWS

AWS Python Machine Learning Machine Learning

Llama 4 family of models from Meta are now available in SageMaker JumpStart

AWS Machine Learning Blog

APRIL 7, 2025

Virginia) AWS Region. Prerequisites To try the Llama 4 models in SageMaker JumpStart, you need the following prerequisites: An AWS account that will contain all your AWS resources. An AWS Identity and Access Management (IAM) role to access SageMaker AI. Access to accelerated instances (GPUs) for hosting the LLMs.

AWS

AWS Machine Learning Machine Learning ML

Deploy a serverless ML inference endpoint of large language models using FastAPI, AWS Lambda, and AWS CDK

AWS Machine Learning Blog

JUNE 23, 2023

Additionally, you can use AWS Lambda directly to expose your models and deploy your ML applications using your preferred open-source framework, which can prove to be more flexible and cost-effective. FastAPI is a modern, high-performance web framework for building APIs with Python. This will be used to deploy our solution.

AWS

AWS ML ML Python

Harmonize data using AWS Glue and AWS Lake Formation FindMatches ML to build a customer 360 view

Flipboard

JUNE 26, 2023

In this post, we look at how we can use AWS Glue and the AWS Lake Formation ML transform FindMatches to harmonize (deduplicate) customer data coming from different sources to get a complete customer profile to be able to provide better customer experience. Run the AWS Glue ML transform job.

AWS

AWS ML ML ETL

Achieve high performance with lowest cost for generative AI inference using AWS Inferentia2 and AWS Trainium on Amazon SageMaker

AWS Machine Learning Blog

MAY 4, 2023

AWS has been innovating with purpose-built chips to address the growing need for powerful, efficient, and cost-effective compute hardware. You can use ml.trn1 and ml.inf2 compatible AWS Deep Learning Containers (DLCs) for PyTorch, TensorFlow, Hugging Face, and large model inference (LMI) to easily get started. petaflops for BF16/FP16.

AWS

AWS Deep Learning Deep Learning ML

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

AWS Machine Learning Blog

MAY 1, 2024

Llama2 by Meta is an example of an LLM offered by AWS. To learn more about Llama 2 on AWS, refer to Llama 2 foundation models from Meta are now available in Amazon SageMaker JumpStart. Virginia) and US West (Oregon) AWS Regions, and most recently announced general availability in the US East (Ohio) Region.

AWS

AWS ML ML Clustering

Use Snowflake as a data source to train ML models with Amazon SageMaker

AWS Machine Learning Blog

MARCH 8, 2023

This post shows a way to do this using Snowflake as the data source and by downloading the data directly from Snowflake into a SageMaker Training job instance. We create a custom training container that downloads data directly from the Snowflake table into the training instance rather than first downloading the data into an S3 bucket.

ML

ML ML AWS Python

NeMo Retriever Llama 3.2 text embedding and reranking NVIDIA NIM microservices now available in Amazon SageMaker JumpStart

AWS Machine Learning Blog

MARCH 18, 2025

With this launch, you can now deploy NVIDIAs optimized reranking and embedding models to build, experiment, and responsibly scale your generative AI ideas on AWS. As part of NVIDIA AI Enterprise available in AWS Marketplace , NIM is a set of user-friendly microservices designed to streamline and accelerate the deployment of generative AI.

AWS

AWS AI AI Computer Science

Introduction to GitHub Actions for Python Projects

PyImageSearch

SEPTEMBER 30, 2024

Home Table of Contents Introduction to GitHub Actions for Python Projects Introduction What Is CICD? For Python projects, CI/CD pipelines ensure that your code is consistently integrated and delivered with high quality and reliability. Git is the most commonly used VCS for Python projects, enabling collaboration and version tracking.

Python

Python Deep Learning Deep Learning AWS

Open Earth Observation Data for the Pale Blue Dot: Visualization Challenge

DrivenData Labs

NOVEMBER 14, 2023

VIIRS' Day/Night Band (DNB) sensor captures nightlight imagery that is useful for mapping populations. ECOSTRESS has been used in the midwestern United States to detects droughts early enough to recover and protect crops.

Python

Python AWS Azure Data Analysis

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS Machine Learning Blog

MARCH 3, 2025

These recipes include a training stack validated by Amazon Web Services (AWS) , which removes the tedious work of experimenting with different model configurations, minimizing the time it takes for iterative evaluation and testing. Alternatively, you can also use AWS Systems Manager and run a command like the following to start the session.

Clustering

Clustering AWS ML ML

Build high-performance ML models using PyTorch 2.0 on AWS – Part 1

AWS Machine Learning Blog

JUNE 6, 2023

PyTorch is a machine learning (ML) framework that is widely used by AWS customers for a variety of applications, such as computer vision, natural language processing, content creation, and more. release, AWS customers can now do same things as they could with PyTorch 1.x 24xlarge with AWS PyTorch 2.0 on AWS PyTorch2.0

AWS

AWS ML ML Deep Learning

Automatically generate impressions from findings in radiology reports using generative AI on AWS

AWS Machine Learning Blog

AUGUST 30, 2023

This post demonstrates a strategy for fine-tuning publicly available LLMs for the task of radiology report summarization using AWS services. In the following sections, we demonstrate fine-tuning an LLM available on SageMaker JumpStart for summarization of a domain-specific task via the SageMaker Python SDK. We use an ml.t3.medium

AWS

AWS AI AI ML

Transform one-on-one customer interactions: Build speech-capable order processing agents with AWS and generative AI

AWS Machine Learning Blog

MARCH 15, 2024

In addition to Amazon Bedrock, you can use other AWS services like Amazon SageMaker JumpStart and Amazon Lex to create fully automated and easily adaptable generative AI order processing agents. In this post, we show you how to build a speech-capable order processing agent using Amazon Lex, Amazon Bedrock, and AWS Lambda.

AWS

AWS AI AI Python

Build and deploy a UI for your generative AI applications with AWS and Python

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

Webinars

Trending Sources

Accelerating ML experimentation with enhanced security: AWS PrivateLink support for Amazon SageMaker with MLflow

Webinars

Generate AWS Resilience Hub findings in natural language using Amazon Bedrock

Run small language models cost-efficiently with AWS Graviton and Amazon SageMaker AI

Introducing AWS MCP Servers for code assistants (Part 1)

Automate invoice processing with Streamlit and Amazon Bedrock

Accelerate edge AI development with SiMa.ai Edgematic with a seamless AWS integration

Align and monitor your Amazon Bedrock powered insurance assistance chatbot to responsible AI principles with AWS Audit Manager

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

Create a generative AI assistant with Slack and Amazon Bedrock

Faster distributed graph neural network training with GraphStorm v0.4

Reduce conversational AI response time through inference at the edge with AWS Local Zones

Llama 3.3 70B now available in Amazon SageMaker JumpStart

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models (LLMs) – part 1

Train, optimize, and deploy models on edge devices using Amazon SageMaker and Qualcomm AI Hub

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Accelerated PyTorch inference with torch.compile on AWS Graviton processors

AWS Inferentia and AWS Trainium deliver lowest cost to deploy Llama 3 models in Amazon SageMaker JumpStart

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

Accelerate NLP inference with ONNX Runtime on AWS Graviton processors

Build a Search Engine: Semantic Search System Using OpenSearch

Create a SageMaker inference endpoint with custom model & extended container

Monitor AWS Sagemaker model using IBM Watson OpenScale

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

Optimized PyTorch 2.0 inference with AWS Graviton processors

Deploy an IBM Watson Studio Model on AWS Sagemaker

Reinventing a cloud-native federated learning architecture on AWS

Customize DeepSeek-R1 671b model using Amazon SageMaker HyperPod recipes – Part 2

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

Llama 4 family of models from Meta are now available in SageMaker JumpStart

Deploy a serverless ML inference endpoint of large language models using FastAPI, AWS Lambda, and AWS CDK

Harmonize data using AWS Glue and AWS Lake Formation FindMatches ML to build a customer 360 view

Achieve high performance with lowest cost for generative AI inference using AWS Inferentia2 and AWS Trainium on Amazon SageMaker

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

Use Snowflake as a data source to train ML models with Amazon SageMaker

NeMo Retriever Llama 3.2 text embedding and reranking NVIDIA NIM microservices now available in Amazon SageMaker JumpStart

Introduction to GitHub Actions for Python Projects

Open Earth Observation Data for the Pale Blue Dot: Visualization Challenge

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

Build high-performance ML models using PyTorch 2.0 on AWS – Part 1

Automatically generate impressions from findings in radiology reports using generative AI on AWS

Transform one-on-one customer interactions: Build speech-capable order processing agents with AWS and generative AI

Stay Connected