AWS, Data Science and Download - Data Science Current

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning Blog

NOVEMBER 6, 2024

Traditionally, building frontend and backend applications has required knowledge of web development frameworks and infrastructure management, which can be daunting for those with expertise primarily in data science and machine learning. Choose the us-east-1 AWS Region from the top right corner. Choose Manage model access.

AWS

AWS Python AI AI

Run small language models cost-efficiently with AWS Graviton and Amazon SageMaker AI

Flipboard

JUNE 5, 2025

AWS has always provided customers with choice. In terms of hardware choice, in addition to NVIDIA GPUs and AWS custom AI chips, CPU-based instances represent (thanks to the latest innovations in CPU hardware) an additional choice for customers who want to run generative AI inference, like hosting small language models and asynchronous agents.

AWS

AWS AI AI ML

Revolutionizing knowledge management: VW’s AI prototype journey with AWS

AWS Machine Learning Blog

NOVEMBER 21, 2024

Using the PACE-Way (an Amazon-based development approach), the team developed a time-boxed prototype over a maximum of 6 weeks, which included a full stack solution with frontend and UX, backed by specialist expertise, such as data science, tailored for VW’s needs.

AWS

AWS AI AI Machine Learning

Webinars

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

MORE WEBINARS

Enhance your Amazon Redshift cloud data warehouse with easier, simpler, and faster machine learning using Amazon SageMaker Canvas

AWS Machine Learning Blog

OCTOBER 24, 2024

Conventional ML development cycles take weeks to many months and requires sparse data science understanding and ML development skills. Business analysts’ ideas to use ML models often sit in prolonged backlogs because of data engineering and data science team’s bandwidth and data preparation activities.

Data Warehouse

Data Warehouse Machine Learning Machine Learning Cloud Data

Monitor AWS Sagemaker model using IBM Watson OpenScale

IBM Data Science in Practice

APRIL 4, 2023

Introduction This article shows how to monitor a model deployed on AWS Sagemaker for quality, bias and explainability, using IBM Watson OpenScale on the IBM Cloud Pak for Data platform. This article shows how to use the endpoint generated from that tutorial to demonstrate how to monitor the AWS deployment with Watson OpenScale.

AWS

AWS Machine Learning Machine Learning Data Science

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning Blog

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce the new Container Caching capability in Amazon SageMaker, which significantly reduces the time required to scale generative AI models for inference. Container Caching addresses this scaling challenge by pre-caching the container image, eliminating the need to download it when scaling up.

AI

AI AI AWS Machine Learning

Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models (LLMs) – part 1

AWS Machine Learning Blog

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce a new capability in Amazon SageMaker Inference that significantly reduces the time required to deploy and scale LLMs for inference using LMI: Fast Model Loader. To reduce the time it takes to download and load the container image, SageMaker now supports container caching.

AWS

AWS Machine Learning Machine Learning ML

Deploy an IBM Watson Studio Model on AWS Sagemaker

IBM Data Science in Practice

MARCH 8, 2023

Introduction This article shows how to build a machine learning model on a Watson Studio platform running on IBM Cloud Pak for Data (CPD). You can then export the model and deploy it on Amazon Sagemaker on Amazon Web Server (AWS). environment running on IBM Cloud Pak for Data 4.5.x, Follow the steps to create a sample model.

AWS

AWS Machine Learning Machine Learning Algorithm

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

AWS Machine Learning Blog

APRIL 29, 2024

For AWS and Outerbounds customers, the goal is to build a differentiated machine learning and artificial intelligence (ML/AI) system and reliably improve it over time. First, the AWS Trainium accelerator provides a high-performance, cost-effective, and readily available solution for training and fine-tuning large models.

AWS

AWS ML ML Python

Llama 3.3 70B now available in Amazon SageMaker JumpStart

AWS Machine Learning Blog

DECEMBER 16, 2024

This feature eliminates one of the major bottlenecks in deployment scaling by pre-caching container images, removing the need for time-consuming downloads when adding new instances. Prior to joining AWS, Dr. Li held data science roles in the financial and retail industries. For large models like Llama 3.3

AWS

AWS ML ML Python

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

AWS Machine Learning Blog

OCTOBER 5, 2023

In this post, we walk through how to fine-tune Llama 2 on AWS Trainium , a purpose-built accelerator for LLM training, to reduce training times and costs. We review the fine-tuning scripts provided by the AWS Neuron SDK (using NeMo Megatron-LM), the various configurations we used, and the throughput results we saw.

AWS

AWS Machine Learning Machine Learning Deep Learning

Frugality meets Accuracy: Cost-efficient training of GPT NeoX and Pythia models with AWS Trainium

AWS Machine Learning Blog

DECEMBER 12, 2023

In this post, we’ll summarize training procedure of GPT NeoX on AWS Trainium , a purpose-built machine learning (ML) accelerator optimized for deep learning training. M tokens/$) trained such models with AWS Trainium without losing any model quality. We’ll outline how we cost-effectively (3.2 billion in Pythia.

AWS

AWS Machine Learning Machine Learning Deep Learning

Llama 3.1: All You Need to Know About Meta’s Latest LLM

Data Science Dojo

JULY 24, 2024

Despite its large size, Meta has made this model open-source and accessible through various platforms, including Hugging Face, GitHub, and several cloud providers like AWS, Nvidia, Microsoft Azure, and Google Cloud. Like the 405B model, the 70B version is also open-source and available for download and use on various platforms.

Azure

Azure AWS AI AI

Sprinklr improves performance by 20% and reduces cost by 25% for machine learning inference on AWS Graviton3

AWS Machine Learning Blog

JUNE 11, 2024

Sprinklr’s specialized AI models streamline data processing, gather valuable insights, and enable workflows and analytics at scale to drive better decision-making and productivity. During this journey, we collaborated with our AWS technical account manager and the Graviton software engineering teams.

Machine Learning

Machine Learning Machine Learning AWS Natural Language Processing

Build enterprise-ready generative AI solutions with Cohere foundation models in Amazon Bedrock and Weaviate vector database on AWS Marketplace

AWS Machine Learning Blog

JANUARY 24, 2024

We demonstrate how to build an end-to-end RAG application using Cohere’s language models through Amazon Bedrock and a Weaviate vector database on AWS Marketplace. Health Insurance Portability and Accountability Act (HIPAA) eligibility and General Data Protection Regulation (GDPR) compliance.

AWS

AWS Database AI AI

Unlock cost savings with the new scale down to zero feature in SageMaker Inference

Flipboard

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce a new feature for Amazon SageMaker inference endpoints: the ability to scale SageMaker inference endpoints to zero instances. This long-awaited capability is a game changer for our customers using the power of AI and machine learning (ML) inference in the cloud.

ML

ML ML AWS Machine Learning

Federated learning on AWS using FedML, Amazon EKS, and Amazon SageMaker

AWS Machine Learning Blog

MARCH 15, 2024

The need for federated learning in healthcare Healthcare relies heavily on distributed data sources to make accurate predictions and assessments about patient care. Limiting the available data sources to protect privacy negatively affects result accuracy and, ultimately, the quality of patient care.

AWS

AWS ML ML Machine Learning

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

Flipboard

APRIL 23, 2025

Solution overview The NER & LLM Gen AI Application is a document processing solution built on AWS that combines NER and LLMs to automate document analysis at scale. Click here to open the AWS console and follow along. The endpoint lifecycle is orchestrated through dedicated AWS Lambda functions that handle creation and deletion.

AWS

AWS ML ML AI

Modernize and migrate on-premises fraud detection machine learning workflows to Amazon SageMaker

AWS Machine Learning Blog

JUNE 5, 2025

By using the AWS Experience-Based Acceleration (EBA) program, they can enhance efficiency, scalability, and maintainability through close collaboration. To address these challenges and streamline modernization efforts, AWS offers the EBA program. This phase typically added 23 weeks to the timeline.

Machine Learning

Machine Learning Machine Learning AWS ML

Use Snowflake as a data source to train ML models with Amazon SageMaker

AWS Machine Learning Blog

MARCH 8, 2023

In such situations, it may be desirable to have the data accessible to SageMaker in the ephemeral storage media attached to the ephemeral training instances without the intermediate storage of data in Amazon S3. We add this data to Snowflake as a new table. Store your Snowflake account credentials in AWS Secrets Manager.

ML

ML ML AWS Python

Stop paying for APIs to calculate distances and use this Open Source tool!

Applied Data Science

FEBRUARY 7, 2022

Chia on Unsplash Calculating distances between a set of coordinates is something that regularly comes up in Data Science projects. Pay for a Cloud provider’s API, such as Google’s, AWS, or on Azure. Step 1: Download the map files for the region you want to cover. The map extracts can be downloaded from Geofrabrik.

Data Science

Data Science Azure AWS Python

Building the future of construction analytics: CONXAI’s AI inference on Amazon EKS

AWS Machine Learning Blog

FEBRUARY 7, 2025

In this post, we dive deep into how CONXAI hosts the state-of-the-art OneFormer segmentation model on AWS using Amazon Simple Storage Service (Amazon S3), Amazon Elastic Kubernetes Service (Amazon EKS), KServe, and NVIDIA Triton. Our journey to AWS Initially, CONXAI started with a small cloud provider specializing in offering affordable GPUs.

Analytics

Analytics Analytics AWS Clustering

Accuracy evaluation framework for Amazon Q Business – Part 2

Flipboard

APRIL 22, 2025

The application exchanges the Amazon Cognito token for an AWS IAM Identity Center token, granting scoped access to Amazon Q Business. This architecture uses AWS services to deliver a scalable, secure, and efficient evaluation solution for Amazon Q Business, combining automated and human-driven evaluations. An AWS account.

AWS

AWS AI AI ML

Auto-labeling module for deep learning-based Advanced Driver Assistance Systems on AWS

AWS Machine Learning Blog

JULY 3, 2023

However, it can be challenging, expensive, and time-consuming to label tens of thousands of miles of recorded video and LiDAR data for companies that are in the business of creating AV/ADAS systems. Set up the role and session For this example, we used a Data Science 3.0 First, we download and prepare the date for inference.

Deep Learning

Deep Learning Deep Learning AWS Machine Learning

Generate compliant content with Amazon Bedrock and ConstitutionalChain

AWS Machine Learning Blog

APRIL 1, 2025

The following code imports the necessary libraries, including Boto3 for AWS services, LangChain components, and Streamlit. Clean up When you have finished experimenting with this solution, clean up your resources to prevent AWS charges from being incurred: Empty the S3 buckets. Clone the GitHub repo to make a local copy.

AWS

AWS AI AI Data Scientist

Making traffic lights more efficient with Amazon Rekognition

AWS Machine Learning Blog

SEPTEMBER 23, 2024

Prerequisties The proposed solution can be implemented in a personal AWS environment using the code that we provide. Before running the labs in this post, ensure you have the following: An AWS account. The appropriate AWS Identity and Access Management (IAM) permissions to access services used in the lab.

AWS

AWS Data Scientist Machine Learning Machine Learning

Train and deploy ML models in a multicloud environment using Amazon SageMaker

AWS Machine Learning Blog

SEPTEMBER 20, 2023

For example, you might have acquired a company that was already running on a different cloud provider, or you may have a workload that generates value from unique capabilities provided by AWS. We show how you can build and train an ML model in AWS and deploy the model in another platform.

ML

ML ML Azure AWS

Streamline custom model creation and deployment for Amazon Bedrock with Provisioned Throughput using Terraform

AWS Machine Learning Blog

JUNE 4, 2024

As customers seek to incorporate their corpus of knowledge into their generative artificial intelligence (AI) applications, or to build domain-specific models, their data science teams often want to conduct A/B testing and have repeatable experiments. An S3 bucket stores training, validation, and output data.

AWS

AWS Python Artificial Intelligence Artificial Intelligence

Share medical image research on Amazon SageMaker Studio Lab for free

Flipboard

FEBRUARY 7, 2023

Like the fully featured Amazon SageMaker Studio , Studio Lab allows you to customize your own Conda environment and create CPU- and GPU-scalable JupyterLab version 3 notebooks , with easy access to the latest data science productivity tools and open-source libraries.

AWS

AWS ML ML Deep Learning

Host concurrent LLMs with LoRAX

AWS Machine Learning Blog

APRIL 16, 2025

Why LoRAX for LoRA deployment on AWS? The surge in popularity of fine-tuning LLMs has given rise to multiple inference container methods for deploying LoRA adapters on AWS. Prerequisites For this guide, you need access to the following prerequisites: An AWS account Proper permissions to deploy EC2 G6 instances.

AWS

AWS ML ML Artificial Intelligence

Generating fashion product descriptions by fine-tuning a vision-language model with SageMaker and Amazon Bedrock

AWS Machine Learning Blog

MAY 22, 2024

The inference workflow is then invoked through an AWS Lambda request, which first makes an HTTP request to the Sagemaker endpoint, and then uses that to make another request to Amazon Bedrock. For details, see Creating an AWS account. medium instance and the Data Science 3.0 We use Amazon SageMaker Studio with the ml.t3.medium

AWS

AWS Machine Learning Machine Learning Natural Language Processing

Explore data with ease: Use SQL and Text-to-SQL in Amazon SageMaker Studio JupyterLab notebooks

AWS Machine Learning Blog

APRIL 16, 2024

IAM role – SageMaker requires an AWS Identity and Access Management (IAM) role to be assigned to a SageMaker Studio domain or user profile to manage permissions effectively. An execution role update may be required to bring in data browsing and the SQL run feature. You need to create AWS Glue connections with specific connection types.

SQL

SQL AWS Database Data Scientist

Amazon SageMaker Domain in VPC only mode to support SageMaker Studio with auto shutdown Lifecycle Configuration and SageMaker Canvas with Terraform

AWS Machine Learning Blog

SEPTEMBER 11, 2023

SageMaker Studio is a fully integrated development environment (IDE) that provides a single web-based visual interface where you can access purpose-built tools to perform all ML development steps, from preparing data to building, training, and deploying your ML models, improving data science team productivity by up to 10x.

AWS

AWS ML ML Machine Learning

Build a robust text-to-SQL solution generating complex queries, self-correcting, and querying diverse data sources

AWS Machine Learning Blog

FEBRUARY 28, 2024

To learn more about text-to-SQL best practices and design patterns, see Generating value from enterprise data: Best practices for Text2SQL and generative AI. Our solution aims to address those challenges using Amazon Bedrock and AWS Analytics Services. Install the AWS Command Line Interface (AWS CLI). format('parquet').option('path',

SQL

SQL AWS Database ML

LLM continuous self-instruct fine-tuning framework powered by a compound AI system on Amazon SageMaker

AWS Machine Learning Blog

FEBRUARY 21, 2025

Prerequisites To create and run this compound AI system in your AWS account, complete the following prerequisites: Create an AWS account if you dont already have one. The synthetic data generation notebook automatically downloads the CUAD_v1 ZIP file and places it in the required folder named cuad_data.

AI

AI AI AWS Data Scientist

Fine-tune and deploy language models with Amazon SageMaker Canvas and Amazon Bedrock

AWS Machine Learning Blog

MAY 1, 2024

Prerequisites First-time users need an AWS account and AWS Identity and Access Management (IAM) role with SageMaker, Amazon Bedrock, and Amazon Simple Storage Service (Amazon S3) access. Prepare your dataset Complete the following steps to prepare your dataset: Download the following CSV dataset of question-answer pairs.

AWS

AWS Algorithm AI AI

Complete Guide of Quantum Computing Resources

Data Science 101

SEPTEMBER 20, 2023

Here is a large listing of resources for learning about quantum computing.

Azure

Azure AWS AI AI

Import a fine-tuned Meta Llama 3 model for SQL query generation on Amazon Bedrock

AWS Machine Learning Blog

AUGUST 1, 2024

To address this challenge, AWS recently announced the preview of Amazon Bedrock Custom Model Import , a feature that you can use to import customized models created in other environments—such as Amazon SageMaker , Amazon Elastic Compute Cloud (Amazon EC2) instances, and on premises—into Amazon Bedrock.

SQL

SQL AWS ML ML

Build an end-to-end MLOps pipeline using Amazon SageMaker Pipelines, GitHub, and GitHub Actions

AWS Machine Learning Blog

DECEMBER 13, 2023

The built-in project templates provided by Amazon SageMaker include integration with some of third-party tools, such as Jenkins for orchestration and GitHub for source control, and several utilize AWS native CI/CD tools such as AWS CodeCommit , AWS CodePipeline , and AWS CodeBuild. An AWS account.

AWS

AWS ML ML Data Preparation

Use Amazon DocumentDB to build no-code machine learning solutions in Amazon SageMaker Canvas

AWS Machine Learning Blog

DECEMBER 15, 2023

You want to gather insights on this data and build an ML model to predict how new restaurants will be rated, but find it challenging to perform analytics on unstructured data. You encounter bottlenecks because you need to rely on data engineering and data science teams to accomplish these goals.

Machine Learning

Machine Learning Machine Learning AWS ML

Talk to your slide deck using multimodal foundation models hosted on Amazon Bedrock and Amazon SageMaker – Part 1

AWS Machine Learning Blog

JANUARY 30, 2024

and AWS services including Amazon Bedrock and Amazon SageMaker to perform similar generative tasks on multimodal data. In this post, we use the slide deck titled Train and deploy Stable Diffusion using AWS Trainium & AWS Inferentia from the AWS Summit in Toronto, June 2023, to demonstrate the solution.

AWS

AWS ML ML K-nearest Neighbors

Transform customer engagement with no-code LLM fine-tuning using Amazon SageMaker Canvas and SageMaker JumpStart

AWS Machine Learning Blog

MAY 10, 2024

In addition, your data is not used to improve the base models, is not shared with third-party model providers, and stays entirely within your secure AWS environment. Prerequisites First-time users need an AWS account and AWS Identity and Access Management (IAM) role with SageMaker and Amazon Simple Storage Service (Amazon S3) access.

AWS

AWS ML ML Machine Learning

Automate the deployment of an Amazon Forecast time-series forecasting model

AWS Machine Learning Blog

MAY 4, 2023

Forecast uses ML to learn not only the best algorithm for each item, but also the best ensemble of algorithms for each item, automatically creating the best model for your data. The console and AWS CLI methods are best suited for quick experimentation to check the feasibility of time series forecasting using your data.

AWS

AWS ML ML Data Scientist

Get started with the open-source Amazon SageMaker Distribution

AWS Machine Learning Blog

JUNE 8, 2023

Data scientists need a consistent and reproducible environment for machine learning (ML) and data science workloads that enables managing dependencies and is secure. AWS Deep Learning Containers already provides pre-built Docker images for training and serving models in common frameworks such as TensorFlow, PyTorch, and MXNet.

AWS

AWS ML ML Data Scientist

Build and deploy a UI for your generative AI applications with AWS and Python

Run small language models cost-efficiently with AWS Graviton and Amazon SageMaker AI

Webinars

Trending Sources

Revolutionizing knowledge management: VW’s AI prototype journey with AWS

Webinars

Enhance your Amazon Redshift cloud data warehouse with easier, simpler, and faster machine learning using Amazon SageMaker Canvas

Monitor AWS Sagemaker model using IBM Watson OpenScale

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models (LLMs) – part 1

Deploy an IBM Watson Studio Model on AWS Sagemaker

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

Llama 3.3 70B now available in Amazon SageMaker JumpStart

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

Frugality meets Accuracy: Cost-efficient training of GPT NeoX and Pythia models with AWS Trainium

Llama 3.1: All You Need to Know About Meta’s Latest LLM

Sprinklr improves performance by 20% and reduces cost by 25% for machine learning inference on AWS Graviton3

Build enterprise-ready generative AI solutions with Cohere foundation models in Amazon Bedrock and Weaviate vector database on AWS Marketplace

Unlock cost savings with the new scale down to zero feature in SageMaker Inference

Federated learning on AWS using FedML, Amazon EKS, and Amazon SageMaker

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

Modernize and migrate on-premises fraud detection machine learning workflows to Amazon SageMaker

Use Snowflake as a data source to train ML models with Amazon SageMaker

Stop paying for APIs to calculate distances and use this Open Source tool!

Building the future of construction analytics: CONXAI’s AI inference on Amazon EKS

Accuracy evaluation framework for Amazon Q Business – Part 2

Auto-labeling module for deep learning-based Advanced Driver Assistance Systems on AWS

Generate compliant content with Amazon Bedrock and ConstitutionalChain

Making traffic lights more efficient with Amazon Rekognition

Train and deploy ML models in a multicloud environment using Amazon SageMaker

Streamline custom model creation and deployment for Amazon Bedrock with Provisioned Throughput using Terraform

Share medical image research on Amazon SageMaker Studio Lab for free

Host concurrent LLMs with LoRAX

Generating fashion product descriptions by fine-tuning a vision-language model with SageMaker and Amazon Bedrock

Explore data with ease: Use SQL and Text-to-SQL in Amazon SageMaker Studio JupyterLab notebooks

Amazon SageMaker Domain in VPC only mode to support SageMaker Studio with auto shutdown Lifecycle Configuration and SageMaker Canvas with Terraform

Build a robust text-to-SQL solution generating complex queries, self-correcting, and querying diverse data sources

LLM continuous self-instruct fine-tuning framework powered by a compound AI system on Amazon SageMaker

Fine-tune and deploy language models with Amazon SageMaker Canvas and Amazon Bedrock

Complete Guide of Quantum Computing Resources

Import a fine-tuned Meta Llama 3 model for SQL query generation on Amazon Bedrock

Build an end-to-end MLOps pipeline using Amazon SageMaker Pipelines, GitHub, and GitHub Actions

Use Amazon DocumentDB to build no-code machine learning solutions in Amazon SageMaker Canvas

Talk to your slide deck using multimodal foundation models hosted on Amazon Bedrock and Amazon SageMaker – Part 1

Transform customer engagement with no-code LLM fine-tuning using Amazon SageMaker Canvas and SageMaker JumpStart

Automate the deployment of an Amazon Forecast time-series forecasting model

Get started with the open-source Amazon SageMaker Distribution

Stay Connected