AI, AWS and Deep Learning - Data Science Current

Enhanced observability for AWS Trainium and AWS Inferentia with Datadog

AWS Machine Learning Blog

NOVEMBER 26, 2024

Neuron is the SDK used to run deep learning workloads on Trainium and Inferentia based instances. AWS AI chips, Trainium and Inferentia, enable you to build and deploy generative AI models at higher performance and lower cost. To get started, see AWS Inferentia and AWS Trainium Monitoring.

AWS

AWS ML ML Data Pipeline

Your guide to generative AI and ML at AWS re:Invent 2024

AWS Machine Learning Blog

NOVEMBER 19, 2024

The excitement is building for the fourteenth edition of AWS re:Invent, and as always, Las Vegas is set to host this spectacular event. As you continue to innovate and partner with us to advance the field of generative AI, we’ve curated a diverse range of sessions to support you at every stage of your journey.

AWS

AWS ML ML AI

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning Blog

DECEMBER 24, 2024

To reduce costs while continuing to use the power of AI , many companies have shifted to fine tuning LLMs on their domain-specific data using Parameter-Efficient Fine Tuning (PEFT). Manually managing such complexity can often be counter-productive and take away valuable resources from your businesses AI development.

AWS

AWS Clustering Deep Learning Deep Learning

Webinars

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

MORE WEBINARS

AWS Announces Generative AI Innovation Center with $100 million Investment

insideBIGDATA

JUNE 22, 2023

AWS), an Amazon.com, Inc. company (NASDAQ: AMZN), today announced the AWS Generative AI Innovation Center, a new program to help customers successfully build and deploy generative artificial intelligence (AI) solutions. Amazon Web Services, Inc.

AWS

AWS Artificial Intelligence Artificial Intelligence Machine Learning

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

AWS Machine Learning Blog

NOVEMBER 26, 2024

The use of large language models (LLMs) and generative AI has exploded over the last year. Using vLLM on AWS Trainium and Inferentia makes it possible to host LLMs for high performance inference and scalability. xlarge instances are only available in these AWS Regions. You will use inf2.xlarge xlarge as your instance type.

AWS

AWS AI AI Artificial Intelligence

Unlocking insights and enhancing customer service: Intact’s transformative AI journey with AWS

AWS Machine Learning Blog

OCTOBER 16, 2024

To address this, Intact turned to AI and speech-to-text technology to unlock insights from calls and improve customer service. The company developed an automated solution called Call Quality (CQ) using AI services from Amazon Web Services (AWS). It uses deep learning to convert audio to text quickly and accurately.

AWS

AWS AI AI Machine Learning

Run small language models cost-efficiently with AWS Graviton and Amazon SageMaker AI

Flipboard

JUNE 5, 2025

As organizations look to incorporate AI capabilities into their applications, large language models (LLMs) have emerged as powerful tools for natural language processing tasks. AWS has always provided customers with choice. Prerequisites To implement this solution, you need an AWS account with the necessary permissions.

AWS

AWS AI AI ML

Deploy Meta Llama 3.1 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

AWS Machine Learning Blog

NOVEMBER 25, 2024

8B and 70B inference support on AWS Trainium and AWS Inferentia instances in Amazon SageMaker JumpStart. Trainium and Inferentia, enabled by the AWS Neuron software development kit (SDK), offer high performance and lower the cost of deploying Meta Llama 3.1 An AWS Identity and Access Management (IAM) role to access SageMaker.

AWS

AWS Python ML ML

AWS and NVIDIA Extend Collaboration to Advance Generative AI Innovation

insideBIGDATA

MARCH 22, 2024

GTC—Amazon Web Services (AWS), an Amazon.com company (NASDAQ: AMZN), and NVIDIA (NASDAQ: NVDA) today announced that the new NVIDIA Blackwell GPU platform—unveiled by NVIDIA at GTC 2024—is coming to AWS.

AWS

AWS Artificial Intelligence Artificial Intelligence AI

Build a scalable AI assistant to help refugees using AWS

AWS Machine Learning Blog

JUNE 3, 2025

As organizations worldwide seek to use AI for social impact, the Danish humanitarian organization Bevar Ukraine has developed a comprehensive virtual generative AI-powered assistant called Victor, aimed at addressing the pressing needs of Ukrainian refugees integrating into Danish society.

AWS

AWS AI AI Machine Learning

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning Blog

MARCH 3, 2025

Recent advances in generative AI have led to the proliferation of new generation of conversational AI assistants powered by foundation models (FMs). Conversational AI assistants are typically deployed directly on users devices, such as smartphones, tablets, or desktop computers, enabling quick, local processing of voice or text input.

AWS

AWS AI AI Deep Learning

Fine-tune and host SDXL models cost-effectively with AWS Inferentia2

AWS Machine Learning Blog

FEBRUARY 6, 2025

One such groundbreaking model is Stable Diffusion XL (SDXL) , released by StabilityAI, advancing the text-to-image generative AI technology to unprecedented heights. We show how to then prepare the fine-tuned model to run on AWS Inferentia2 powered Amazon EC2 Inf2 instances , unlocking superior price performance for your inference workloads.

AWS

AWS Machine Learning Machine Learning Deep Learning

Speed up your AI inference workloads with new NVIDIA-powered capabilities in Amazon SageMaker

AWS Machine Learning Blog

DECEMBER 2, 2024

At re:Invent 2024, we are excited to announce new capabilities to speed up your AI inference workloads with NVIDIA accelerated computing and software offerings on Amazon SageMaker. They represent our continued commitment to delivering scalable, cost-effective, and flexible GPU-accelerated AI inference capabilities to our customers.

AWS

AWS AI AI Machine Learning

Top 10 AI and Data Science Trends in 2022

Analytics Vidhya

FEBRUARY 3, 2022

In this article, we shall discuss the upcoming innovations in the field of artificial intelligence, big data, machine learning and overall, Data Science Trends in 2022. Deep learning, natural language processing, and computer vision are examples […]. Times change, technology improves and our lives get better.

Data Science

Data Science Natural Language Processing Deep Learning Deep Learning

Get started quickly with AWS Trainium and AWS Inferentia using AWS Neuron DLAMI and AWS Neuron DLC

AWS Machine Learning Blog

JUNE 11, 2024

Starting with the AWS Neuron 2.18 release , you can now launch Neuron DLAMIs (AWS Deep Learning AMIs) and Neuron DLCs (AWS Deep Learning Containers) with the latest released Neuron packages on the same day as the Neuron SDK release. AWS Systems Manager Parameter Store support Neuron 2.18

AWS

AWS Deep Learning Deep Learning ML

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning Blog

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce the new Container Caching capability in Amazon SageMaker, which significantly reduces the time required to scale generative AI models for inference. In our tests, we’ve seen substantial improvements in scaling times for generative AI model endpoints across various frameworks.

AI

AI AI AWS Machine Learning

How to Become a Generative AI Engineer in 2025?

Towards AI

JANUARY 29, 2025

Last Updated on January 29, 2025 by Editorial Team Author(s): Vishwajeet Originally published on Towards AI. How to Become a Generative AI Engineer in 2025? From creating art and music to generating human-like text and designing virtual worlds, Generative AI is reshaping industries and opening up new possibilities.

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

AWS Machine Learning Blog

NOVEMBER 1, 2024

As enterprises increasingly embrace generative AI , they face challenges in managing the associated costs. With demand for generative AI applications surging across projects and multiple lines of business, accurately allocating and tracking spend becomes more complex.

AWS

AWS AI AI Deep Learning

Announcing New Tools for Building with Generative AI on AWS

Flipboard

APRIL 13, 2023

The seeds of a machine learning (ML) paradigm shift have existed for decades, but with the ready availability of scalable compute capacity, a massive proliferation of data, and the rapid advancement of ML technologies, customers across industries are transforming their businesses.

AWS

AWS ML ML AI

Achieve multi-Region resiliency for your conversational AI chatbots with Amazon Lex

AWS Machine Learning Blog

OCTOBER 30, 2024

Global Resiliency is a new Amazon Lex capability that enables near real-time replication of your Amazon Lex V2 bots in a second AWS Region. Additionally, we discuss how to handle integrations with AWS Lambda and Amazon CloudWatch after enabling Global Resiliency. We walk through the instructions to replicate the bot later in this post.

AWS

AWS AI AI Natural Language Processing

Build a Search Engine: Setting Up AWS OpenSearch

Flipboard

MAY 5, 2025

Home Table of Contents Build a Search Engine: Setting Up AWS OpenSearch Introduction What Is AWS OpenSearch? What AWS OpenSearch Is Commonly Used For Key Features of AWS OpenSearch How Does AWS OpenSearch Work? Why Use AWS OpenSearch for Semantic Search? Looking for the source code to this post?

AWS

AWS Clustering Deep Learning Deep Learning

AWS Machine Learning: A Beginner’s Guide

How to Learn Machine Learning

DECEMBER 24, 2024

If you’re diving into the world of machine learning, AWS Machine Learning provides a robust and accessible platform to turn your data science dreams into reality. Today, we’ll explore why Amazon’s cloud-based machine learning services could be your perfect starting point for building AI-powered applications.

Machine Learning

Machine Learning Machine Learning AWS ML

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

AWS Machine Learning Blog

DECEMBER 4, 2024

At AWS re:Invent 2024, we are excited to introduce Amazon Bedrock Marketplace. The NVIDIA Nemotron family, available as NVIDIA NIM microservices, offers a cutting-edge suite of language models now available through Amazon Bedrock Marketplace, marking a significant milestone in AI model accessibility and deployment.

AWS

AWS Machine Learning Machine Learning AI

Build an automated generative AI solution evaluation pipeline with Amazon Nova

Flipboard

APRIL 21, 2025

Evaluation plays a central role in the generative AI application lifecycle, much like in traditional machine learning. In this post, to address the aforementioned challenges, we introduce an automated evaluation framework that is deployable on AWS. In the following sections, we discuss various approaches to evaluate LLMs.

AWS

AWS AI AI Machine Learning

How Tealium built a chatbot evaluation platform with Ragas and Auto-Instruct using AWS generative AI services

AWS Machine Learning Blog

DECEMBER 11, 2024

In this post, we illustrate the importance of generative AI in the collaboration between Tealium and the AWS Generative AI Innovation Center (GenAIIC) team by automating the following: Evaluating the retriever and the generated answer of a RAG system based on the Ragas Repository powered by Amazon Bedrock.

AWS

AWS AI AI Data Scientist

AWS Announces Amazon DataZone GA to Simplify Data Discovery and Governance

insideBIGDATA

OCTOBER 6, 2023

Amazon Web Services (AWS) announced the general availability of Amazon DataZone, a data management service that enables customers to catalog, discover, govern, share, and analyze data at scale across organizational boundaries.

AWS

AWS Cloud Data Big Data Big Data

Rad AI reduces real-time inference latency by 50% using Amazon SageMaker

AWS Machine Learning Blog

NOVEMBER 26, 2024

This post is co-written with Ken Kao and Hasan Ali Demirci from Rad AI. Rad AI has reshaped radiology reporting, developing solutions that streamline the most tedious and repetitive tasks, and saving radiologists’ time. In this post, we share how Rad AI reduced real-time inference latency by 50% using Amazon SageMaker.

ML

ML ML AI AI

Transitioning from Amazon Rekognition people pathing: Exploring other alternatives

AWS Machine Learning Blog

OCTOBER 24, 2024

Example code The following code example is a Python script that can be used as an AWS Lambda function or as part of your processing pipeline. Combined with AWS tool offerings such as AWS Lambda and Amazon SageMaker, you can implement such open source tools for your applications.

AWS

AWS Python Algorithm ML

Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models (LLMs) – Part 2

Flipboard

DECEMBER 2, 2024

As the AI landscape continues to evolve and models grow even larger, innovations like Fast Model Loader become increasingly crucial. She has been actively involved in multiple Generative AI initiatives across APJ, harnessing the power of Large Language Models (LLMs). James Park is a Solutions Architect at Amazon Web Services.

AWS

AWS ML ML Machine Learning

Open Protocols for Agent Interoperability Part 1: Inter-Agent Communication on MCP

Flipboard

MAY 19, 2025

At AWS, open standards run deep in our DNA, driving all that we do. Thats why we decided to build Amazon Elastic Cloud Compute (EC2) as a protocol-agnostic cloud computing service and Amazon SageMaker as a framework-agnostic deep learning service.

Cloud Computing

Cloud Computing Deep Learning Deep Learning AWS

How Qualtrics built Socrates: An AI platform powered by Amazon SageMaker and Amazon Bedrock

Flipboard

MAY 15, 2025

The content and opinions in this post are those of the third-party author and AWS is not responsible for the content or accuracy of this post. In this post, we share how Qualtrics built an AI platform powered by Amazon SageMaker and Amazon Bedrock. This post is co-authored by Jay Kshirsagar and Ronald Quan from Qualtrics.

ML

ML ML AI AI

Build agentic AI solutions with DeepSeek-R1, CrewAI, and Amazon SageMaker AI

Flipboard

FEBRUARY 10, 2025

AI agents are rapidly becoming the next frontier in enterprise transformation, with 82% of organizations planning adoption within the next 3 years. According to a Capgemini survey of 1,100 executives at large enterprises, 10% of organizations already use AI agents, and more than half plan to use them in the next year.

AI

AI AI AWS ML

How Salesforce achieves high-performance model deployment with Amazon SageMaker AI

Flipboard

APRIL 17, 2025

This post is a joint collaboration between Salesforce and AWS and is being cross-published on both the Salesforce Engineering Blog and the AWS Machine Learning Blog. The Salesforce AI Model Serving team is working to push the boundaries of natural language processing and AI capabilities for enterprise applications.

AWS

AWS AI AI Machine Learning

Build a dynamic, role-based AI agent using Amazon Bedrock inline agents

AWS Machine Learning Blog

FEBRUARY 13, 2025

AI agents continue to gain momentum, as businesses use the power of generative AI to reinvent customer experiences and automate complex workflows. In this post, we explore how to build an application using Amazon Bedrock inline agents, demonstrating how a single AI assistant can adapt its capabilities dynamically based on user roles.

AI

AI AI AWS ML

Llama 3.3 70B now available in Amazon SageMaker JumpStart

AWS Machine Learning Blog

DECEMBER 16, 2024

In this post, we explore how to deploy this model efficiently on Amazon SageMaker AI , using advanced SageMaker AI features for optimal performance and cost management. 405B by less than 2% in 6 out of 10 standard AI benchmarks and actually outperforming it in three categories. Overview of the Llama 3.3 70B model Llama 3.3

AWS

AWS ML ML Python

The Key to Winning the Generative AI Race: Solve Real Problems

insideBIGDATA

DECEMBER 27, 2023

In this contributed article, Syed Hoda, Digital Innovation leader at AWS, discusses how businesses can generate real value from AI and generative AI.

AWS

AWS AI AI Big Data

Your guide to generative AI and ML at AWS re:Invent 2023

AWS Machine Learning Blog

NOVEMBER 22, 2023

Yes, the AWS re:Invent season is upon us and as always, the place to be is Las Vegas! Now all you need is some guidance on generative AI and machine learning (ML) sessions to attend at this twelfth edition of re:Invent. And although generative AI has appeared in previous events, this year we’re taking it to the next level.

AWS

AWS ML ML AI

Generate financial industry-specific insights using generative AI and in-context fine-tuning

AWS Machine Learning Blog

NOVEMBER 12, 2024

You may check out additional reference notebooks on aws-samples for how to use Meta’s Llama models hosted on Amazon Bedrock. You can implement these steps either from the AWS Management Console or using the latest version of the AWS Command Line Interface (AWS CLI). Solutions Architect at AWS. Varun Mehta is a Sr.

SQL

SQL AWS AI AI

Enable Amazon Bedrock cross-Region inference in multi-account environments

AWS Machine Learning Blog

MARCH 27, 2025

Amazon Bedrock cross-Region inference capability that provides organizations with flexibility to access foundation models (FMs) across AWS Regions while maintaining optimal performance and availability. This creates a challenging situation where organizations must balance security controls with using AI capabilities.

AWS

AWS Machine Learning Machine Learning AI

Improve the performance of your Generative AI applications with Prompt Optimization on Amazon Bedrock

AWS Machine Learning Blog

NOVEMBER 29, 2024

Prompt Optimizations can result in significant improvements for Generative AI tasks. In the Configurations pane, for Generative AI resource , choose Models and choose your preferred model. The reduced manual effort, will greatly accelerate the development of generative-AI applications in your organization. Choose Optimize.

AI

AI AI ML ML

Protect sensitive data in RAG applications with Amazon Bedrock

Flipboard

APRIL 23, 2025

Retrieval Augmented Generation (RAG) applications have become increasingly popular due to their ability to enhance generative AI tasks with contextually relevant information. See the OWASP Top 10 for Large Language Model Applications to learn more about the unique security risks associated with generative AI applications.

AWS

AWS ML ML AI

Unleash AI innovation with Amazon SageMaker HyperPod

AWS Machine Learning Blog

MARCH 18, 2025

The rise of generative AI has significantly increased the complexity of building, training, and deploying machine learning (ML) models. It now demands deep expertise, access to vast datasets, and the management of extensive compute clusters. During repair, your work is automatically saved, ensuring seamless resumption.

AI

AI AI AWS Clustering

Accelerating large-scale neural network training on CPUs with ThirdAI and AWS Graviton

AWS Machine Learning Blog

FEBRUARY 29, 2024

Large-scale deep learning has recently produced revolutionary advances in a vast array of fields. is a startup dedicated to the mission of democratizing artificial intelligence technologies through algorithmic and software innovations that fundamentally change the economics of deep learning. Founded in 2021, ThirdAI Corp.

AWS

AWS Deep Learning Deep Learning ML

TensorFlow vs. PyTorch: What’s Better for a Deep Learning Project?

Towards AI

AUGUST 8, 2024

Last Updated on August 8, 2024 by Editorial Team Author(s): Eashan Mahajan Originally published on Towards AI. Photo by Marius Masalar on Unsplash Deep learning. A subset of machine learning utilizing multilayered neural networks, otherwise known as deep neural networks. Let’s answer that question. In TensorFlow 2.0,

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

Enhanced observability for AWS Trainium and AWS Inferentia with Datadog

Your guide to generative AI and ML at AWS re:Invent 2024

Webinars

Trending Sources

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

Webinars

AWS Announces Generative AI Innovation Center with $100 million Investment

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

Unlocking insights and enhancing customer service: Intact’s transformative AI journey with AWS

Run small language models cost-efficiently with AWS Graviton and Amazon SageMaker AI

Deploy Meta Llama 3.1 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

AWS and NVIDIA Extend Collaboration to Advance Generative AI Innovation

Build a scalable AI assistant to help refugees using AWS

Reduce conversational AI response time through inference at the edge with AWS Local Zones

Fine-tune and host SDXL models cost-effectively with AWS Inferentia2

Speed up your AI inference workloads with new NVIDIA-powered capabilities in Amazon SageMaker

Top 10 AI and Data Science Trends in 2022

Get started quickly with AWS Trainium and AWS Inferentia using AWS Neuron DLAMI and AWS Neuron DLC

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

How to Become a Generative AI Engineer in 2025?

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

Announcing New Tools for Building with Generative AI on AWS

Achieve multi-Region resiliency for your conversational AI chatbots with Amazon Lex

Build a Search Engine: Setting Up AWS OpenSearch

AWS Machine Learning: A Beginner’s Guide

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

Build an automated generative AI solution evaluation pipeline with Amazon Nova

How Tealium built a chatbot evaluation platform with Ragas and Auto-Instruct using AWS generative AI services

AWS Announces Amazon DataZone GA to Simplify Data Discovery and Governance

Rad AI reduces real-time inference latency by 50% using Amazon SageMaker

Transitioning from Amazon Rekognition people pathing: Exploring other alternatives

Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models (LLMs) – Part 2

Open Protocols for Agent Interoperability Part 1: Inter-Agent Communication on MCP

How Qualtrics built Socrates: An AI platform powered by Amazon SageMaker and Amazon Bedrock

Build agentic AI solutions with DeepSeek-R1, CrewAI, and Amazon SageMaker AI

How Salesforce achieves high-performance model deployment with Amazon SageMaker AI

Build a dynamic, role-based AI agent using Amazon Bedrock inline agents

Llama 3.3 70B now available in Amazon SageMaker JumpStart

The Key to Winning the Generative AI Race: Solve Real Problems

Your guide to generative AI and ML at AWS re:Invent 2023

Generate financial industry-specific insights using generative AI and in-context fine-tuning

Enable Amazon Bedrock cross-Region inference in multi-account environments

Improve the performance of your Generative AI applications with Prompt Optimization on Amazon Bedrock

Protect sensitive data in RAG applications with Amazon Bedrock

Unleash AI innovation with Amazon SageMaker HyperPod

Accelerating large-scale neural network training on CPUs with ThirdAI and AWS Graviton

TensorFlow vs. PyTorch: What’s Better for a Deep Learning Project?

Stay Connected