AWS and Clustering - Data Science Current

From Chaos to Control: A Cost Maturity Journey with Databricks

databricks

JULY 24, 2025

inherits tags on the cluster definition, while serverless adheres to Serverless Budget Policies ( AWS | Azure | GCP ). Refer to this article ( AWS | AZURE | GCP ) for details about tagging different compute resources, and this article ( AWS | Azure | GCP ) for details about tagging Unity Catalog securables.

Clustering

Clustering SQL Azure AWS

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning Blog

DECEMBER 24, 2024

The process of setting up and configuring a distributed training environment can be complex, requiring expertise in server management, cluster configuration, networking and distributed computing. To simplify infrastructure setup and accelerate distributed training, AWS introduced Amazon SageMaker HyperPod in late 2023.

AWS

AWS Clustering Deep Learning Deep Learning

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

AWS Machine Learning Blog

NOVEMBER 26, 2024

AWS Trainium and AWS Inferentia based instances, combined with Amazon Elastic Kubernetes Service (Amazon EKS), provide a performant and low cost framework to run LLMs efficiently in a containerized environment. Solution overview The steps to implement the solution are as follows: Create the EKS cluster.

AWS

AWS Clustering ML ML

Webinars

Precision in Motion: Why Process Optimization Is the Future of Manufacturing

Airflow Best Practices for ETL/ELT Pipelines

MORE WEBINARS

Speed up your cluster procurement time with Amazon SageMaker HyperPod training plans

AWS Machine Learning Blog

DECEMBER 5, 2024

In this post, we demonstrate how you can address this requirement by using Amazon SageMaker HyperPod training plans , which can bring down your training cluster procurement wait time. We further guide you through using the training plan to submit SageMaker training jobs or create SageMaker HyperPod clusters.

Clustering

Clustering AWS Python ML

Your guide to generative AI and ML at AWS re:Invent 2024

AWS Machine Learning Blog

NOVEMBER 19, 2024

The excitement is building for the fourteenth edition of AWS re:Invent, and as always, Las Vegas is set to host this spectacular event. Third, we’ll explore the robust infrastructure services from AWS powering AI innovation, featuring Amazon SageMaker , AWS Trainium , and AWS Inferentia under AI/ML, as well as Compute topics.

AWS

AWS ML ML AI

Racing into the future: How AWS DeepRacer fueled my AI and ML journey

AWS Machine Learning Blog

NOVEMBER 19, 2024

In 2018, I sat in the audience at AWS re:Invent as Andy Jassy announced AWS DeepRacer —a fully autonomous 1/18th scale race car driven by reinforcement learning. But AWS DeepRacer instantly captured my interest with its promise that even inexperienced developers could get involved in AI and ML.

AWS

AWS ML ML AI

Use K8sGPT and Amazon Bedrock for simplified Kubernetes cluster maintenance

AWS Machine Learning Blog

JULY 10, 2025

As Kubernetes clusters grow in complexity, managing them efficiently becomes increasingly challenging. Integrating advanced generative AI tools like K8sGPT and Amazon Bedrock can revolutionize Kubernetes cluster operations and maintenance.

Clustering

Clustering AWS AI AI

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

AWS Machine Learning Blog

NOVEMBER 13, 2024

Prerequisites To implement the proposed solution, make sure that you have the following: An AWS account and a working knowledge of FMs, Amazon Bedrock , Amazon SageMaker , Amazon OpenSearch Service , Amazon S3 , and AWS Identity and Access Management (IAM). Amazon Titan Multimodal Embeddings model access in Amazon Bedrock.

AWS

AWS Database K-nearest Neighbors AI

Unlocking near real-time analytics with petabytes of transaction data using Amazon Aurora Zero-ETL integration with Amazon Redshift and dbt Cloud

Flipboard

NOVEMBER 27, 2024

To implement this solution, complete the following steps: Set up Zero-ETL integration from the AWS Management Console for Amazon Relational Database Service (Amazon RDS). An AWS Identity and Access Management (IAM) user with sufficient permissions to interact with the AWS Management Console and related AWS services.

ETL

ETL Data Warehouse Analytics Analytics

Optimize RAG in production environments using Amazon SageMaker JumpStart and Amazon OpenSearch Service

Flipboard

JULY 2, 2025

This service offers numerous advantages for building and deploying generative AI applications, including access to a wide range of pre-trained models with ready-to-use artifacts, a user-friendly interface, and seamless scalability within the AWS ecosystem. Choose Store to save your secret.

AWS

AWS Clustering K-nearest Neighbors Algorithm

HCLTech’s AWS powered AutoWise Companion: A seamless experience for informed automotive buyer decisions with data-driven design

AWS Machine Learning Blog

JANUARY 15, 2025

Powered by generative AI services on AWS and large language models (LLMs) multi-modal capabilities, HCLTechs AutoWise Companion provides a seamless and impactful experience. Technical architecture The overall solution is implemented using AWS services and LangChain. AWS Glue AWS Glue is used for data cataloging.

AWS

AWS SQL AI AI

Map Earth’s vegetation in under 20 minutes with Amazon SageMaker

AWS Machine Learning Blog

OCTOBER 16, 2024

Although setting up a processing cluster is an alternative, it introduces its own set of complexities, from data distribution to infrastructure management. We use the purpose-built geospatial container with SageMaker Processing jobs for a simplified, managed experience to create and run a cluster. format("/".join(tile_prefix),

ML

ML ML Clustering Machine Learning

Enhance your Amazon Redshift cloud data warehouse with easier, simpler, and faster machine learning using Amazon SageMaker Canvas

AWS Machine Learning Blog

OCTOBER 24, 2024

Prerequisites Before you begin, make sure you have the following prerequisites in place: An AWS account and role with the AWS Identity and Access Management (IAM) privileges to deploy the following resources: IAM roles. For this post we’ll use a provisioned Amazon Redshift cluster. A SageMaker domain. Database name : Enter dev.

Data Warehouse

Data Warehouse Machine Learning Machine Learning Cloud Data

Streamline machine learning workflows with SkyPilot on Amazon SageMaker HyperPod

AWS Machine Learning Blog

JULY 11, 2025

The complexity of Kubernetes manifests and cluster management can pose significant challenges, potentially slowing down development cycles and resource utilization. Solution overview Implementing this solution is straightforward, whether you’re working with existing SageMaker HyperPod clusters or setting up a new deployment.

Machine Learning

Machine Learning Machine Learning Clustering AWS

Deploy Meta Llama 3.1 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

AWS Machine Learning Blog

NOVEMBER 25, 2024

8B and 70B inference support on AWS Trainium and AWS Inferentia instances in Amazon SageMaker JumpStart. Trainium and Inferentia, enabled by the AWS Neuron software development kit (SDK), offer high performance and lower the cost of deploying Meta Llama 3.1 An AWS Identity and Access Management (IAM) role to access SageMaker.

AWS

AWS Python ML ML

ByteDance processes billions of daily videos using their multimodal video understanding models on AWS Inferentia2

AWS Machine Learning Blog

FEBRUARY 26, 2025

At ByteDance, we collaborated with Amazon Web Services (AWS) to deploy multimodal large language models (LLMs) for video understanding using AWS Inferentia2 across multiple AWS Regions around the world. Solution overview Weve collaborated with AWS since the first generation of Inferentia chips.

AWS

AWS ML ML Clustering

Syngenta develops a generative AI assistant to support sales representatives using Amazon Bedrock Agents

Flipboard

DECEMBER 3, 2024

Syngenta and AWS collaborated to develop Cropwise AI , an innovative solution powered by Amazon Bedrock Agents , to accelerate their sales reps’ ability to place Syngenta seed products with growers across North America. The collaboration between Syngenta and AWS showcases the transformative power of LLMs and AI agents.

AWS

AWS Machine Learning Machine Learning AI

Build AWS architecture diagrams using Amazon Q CLI and MCP

AWS Machine Learning Blog

JUNE 30, 2025

Creating professional AWS architecture diagrams is a fundamental task for solutions architects, developers, and technical teams. By using generative AI through natural language prompts, architects can now generate professional diagrams in minutes rather than hours, while adhering to AWS best practices.

AWS

AWS Database Python Clustering

Build conversational interfaces for structured data using Amazon Bedrock Knowledge Bases

Flipboard

JUNE 17, 2025

You can chat with your structured data by setting up structured data ingestion from AWS Glue Data Catalog tables and Amazon Redshift clusters in a few steps, using the power of Amazon Bedrock Knowledge Bases structured data retrieval. Prerequisites To implement the solution provided in this post, you must have an AWS account.

AWS

AWS SQL Database Natural Language Processing

Transforming financial analysis with CreditAI on Amazon Bedrock: Octus’s journey with AWS

AWS Machine Learning Blog

MARCH 10, 2025

We walk through the journey Octus took from managing multiple cloud providers and costly GPU instances to implementing a streamlined, cost-effective solution using AWS services including Amazon Bedrock, AWS Fargate , and Amazon OpenSearch Service. Along the way, it also simplified operations as Octus is an AWS shop more generally.

AWS

AWS Database AI AI

Accelerating Mixtral MoE fine-tuning on Amazon SageMaker with QLoRA

AWS Machine Learning Blog

NOVEMBER 22, 2024

Although QLoRA helps optimize memory during fine-tuning, we will use Amazon SageMaker Training to spin up a resilient training cluster, manage orchestration, and monitor the cluster for failures. To take complete advantage of this multi-GPU cluster, we use the recent support of QLoRA and PyTorch FSDP. 24xlarge compute instance.

Clustering

Clustering AWS ML ML

Supercharge generative AI workflows with NVIDIA DGX Cloud on AWS and Amazon Bedrock Custom Model Import

AWS Machine Learning Blog

JULY 15, 2025

DGX Cloud on Amazon Web Services (AWS) represents a significant leap forward in democratizing access to high-performance AI infrastructure. By combining NVIDIA GPU expertise with AWS scalable cloud services, organizations can accelerate their time-to-train, reduce operational complexity, and unlock new business opportunities.

AWS

AWS Clustering AI AI

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS Machine Learning Blog

MARCH 3, 2025

These recipes include a training stack validated by Amazon Web Services (AWS) , which removes the tedious work of experimenting with different model configurations, minimizing the time it takes for iterative evaluation and testing. The launcher will interface with your cluster with Slurm or Kubernetes native constructs.

Clustering

Clustering AWS ML ML

Scale ML workflows with Amazon SageMaker Studio and Amazon SageMaker HyperPod

AWS Machine Learning Blog

DECEMBER 4, 2024

Solution overview Implementing the solution consists of the following high-level steps: Set up your environment and the permissions to access Amazon HyperPod clusters in SageMaker Studio. You can now use SageMaker Studio to discover the SageMaker HyperPod clusters, and view cluster details and metrics.

ML

ML ML Clustering AWS

How climate tech startups are building foundation models with Amazon SageMaker HyperPod

Flipboard

JUNE 4, 2025

Amazon Web Services (AWS) provides the essential compute infrastructure to support these endeavors, offering scalable and powerful resources through Amazon SageMaker HyperPod. To offer a more concrete look at these trends, the following is a deep dive into how climate tech startups are building FMs on AWS.

AWS

AWS Clustering ML ML

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

AWS Machine Learning Blog

NOVEMBER 27, 2024

Launching a machine learning (ML) training cluster with Amazon SageMaker training jobs is a seamless process that begins with a straightforward API call, AWS Command Line Interface (AWS CLI) command, or AWS SDK interaction. Surya Kari is a Senior Generative AI Data Scientist at AWS.

AWS

AWS Clustering ML ML

How PayU built a secure enterprise AI assistant using Amazon Bedrock

Flipboard

JULY 15, 2025

For deployment, we containerized Open WebUI and orchestrated it on an Amazon Elastic Kubernetes Service (Amazon EKS) cluster, using automatic scaling to dynamically adjust resources based on demand while maintaining high availability. This data lakehouse, internally referred to as Luna, is built using Apache Spark and Apache Hudi.

AWS

AWS AI AI SQL

How Rocket Companies modernized their data science solution on AWS

AWS Machine Learning Blog

FEBRUARY 21, 2025

Communication between the two systems was established through Kerberized Apache Livy (HTTPS) connections over AWS PrivateLink. Responsibility for maintenance and troubleshooting: Rockets DevOps/Technology team was responsible for all upgrades, scaling, and troubleshooting of the Hadoop cluster, which was installed on bare EC2 instances.

Data Science

Data Science AWS Hadoop Data Scientist

Use language embeddings for zero-shot classification and semantic search with Amazon Bedrock

AWS Machine Learning Blog

FEBRUARY 13, 2025

Amazon Bedrock offers a serverless experience, so you can get started quickly, privately customize FMs with your own data, and integrate and deploy them into your applications using Amazon Web Services (AWS) services without having to manage infrastructure. AWS Lambda The API is a Fastify application written in TypeScript.

AWS

AWS K-nearest Neighbors Clustering Algorithm

Query Amazon Aurora PostgreSQL using Amazon Bedrock Knowledge Bases structured data

Flipboard

JULY 9, 2025

At AWS re:Invent 2024, we announced Amazon Bedrock Knowledge Bases support for natural language querying to retrieve structured data from Amazon Redshift and Amazon SageMaker Lakehouse. Users can interact with Amazon Bedrock Knowledge Bases using the AWS Management Console or an AWS SDK client, which sends natural language queries.

ETL

ETL Database AWS SQL

Unify structured data in Amazon Aurora and unstructured data in Amazon S3 for insights using Amazon Q

AWS Machine Learning Blog

NOVEMBER 20, 2024

In this post, we explore how you can use Amazon Q Business , the AWS generative AI-powered assistant, to build a centralized knowledge base for your organization, unifying structured and unstructured datasets from different sources to accelerate decision-making and drive productivity. Choose Create database. aligned identity provider (IdP).

Database

Database AWS SQL ETL

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

AWS Machine Learning Blog

NOVEMBER 14, 2024

We recently announced the general availability of cross-account sharing of Amazon SageMaker Model Registry using AWS Resource Access Manager (AWS RAM) , making it easier to securely share and discover machine learning (ML) models across your AWS accounts.

AWS

AWS ML ML Machine Learning

Real value, real time: Production AI with Amazon SageMaker and Tecton

AWS Machine Learning Blog

DECEMBER 4, 2024

Orchestrate with Tecton-managed EMR clusters – After features are deployed, Tecton automatically creates the scheduling, provisioning, and orchestration needed for pipelines that can run on Amazon EMR compute engines. You can view and create EMR clusters directly through the SageMaker notebook.

ML

ML ML AWS AI

Train, optimize, and deploy models on edge devices using Amazon SageMaker and Qualcomm AI Hub

AWS Machine Learning Blog

OCTOBER 18, 2024

We demonstrate this solution by walking you through a comprehensive step-by-step guide on how to fine-tune YOLOv8 , a real-time object detection model, on Amazon Web Services (AWS) using a custom dataset. You can train foundation models (FMs) for weeks and months without disruption by automatically monitoring and repairing training clusters.

AWS

AWS AI AI Machine Learning

Optimizing costs of generative AI applications on AWS

AWS Machine Learning Blog

DECEMBER 26, 2024

The potential for such large business value is galvanizing tens of thousands of enterprises to build their generative AI applications in AWS. This post addresses these cost considerations so you can optimize your generative AI costs in AWS. Annual costs (directional)*. These costs are based on assumptions.

AWS

AWS Database AI AI

Implement user-level access control for multi-tenant ML platforms on Amazon SageMaker AI

AWS Machine Learning Blog

JULY 11, 2025

Managing access control in enterprise machine learning (ML) environments presents significant challenges, particularly when multiple teams share Amazon SageMaker AI resources within a single Amazon Web Services (AWS) account. Refer to the Operating model whitepaper for best practices on account structure.

ML

ML ML AWS Clustering

How Fastweb fine-tuned the Mistral model using Amazon SageMaker HyperPod as a first step to build an Italian large language model

AWS Machine Learning Blog

DECEMBER 18, 2024

Training an LLM is a compute-intensive and complex process, which is why Fastweb, as a first step in their AI journey, used AWS generative AI and machine learning (ML) services such as Amazon SageMaker HyperPod. The team opted for fine-tuning on AWS.

Clustering

Clustering AWS AI AI

Building enterprise-scale RAG applications with Amazon S3 Vectors and DeepSeek R1 on Amazon SageMaker AI

Flipboard

JULY 17, 2025

Operational complexity – Teams are forced to divert valuable engineering resources toward managing and tuning dedicated vector database clusters. Prerequisites To perform the solution, you need to have these prerequisites: An AWS account with billing enabled. Seamless integration – Works with familiar Amazon S3 APIs and AWS services.

AI

AI AI Database AWS

How Lumi streamlines loan approvals with Amazon SageMaker AI

AWS Machine Learning Blog

APRIL 4, 2025

Integration with existing systems on AWS: Lumi seamlessly integrated SageMaker Asynchronous Inference endpoints with their existing loan processing pipeline. Using Databricks on AWS for model training, they built a pipeline to host the model in SageMaker AI, optimizing data flow and results retrieval. Follow him on LinkedIn.

AI

AI AI Machine Learning Machine Learning

Search enterprise data assets using LLMs backed by knowledge graphs

Flipboard

NOVEMBER 27, 2024

In the context of enterprise data asset search powered by a metadata catalog hosted on services such Amazon DataZone, AWS Glue, and other third-party catalogs, knowledge graphs can help integrate this linked data and also enable a scalable search paradigm that integrates metadata that evolves over time.

AWS

AWS Database ML ML

Efficiently build and tune custom log anomaly detection models with Amazon SageMaker

AWS Machine Learning Blog

JANUARY 6, 2025

Prerequisites You should have the following prerequisites: An AWS account A SageMaker notebook instance An S3 bucket to store the input data Process the data To start, upload the log dataset to an S3 bucket in your AWS account. aws ecr get-login --region $region --registry-ids $account_id --no-include-email) !aws client("sts").get_caller_identity().get("Account")

Python

Python AWS ML ML

Accelerate generative AI inference with NVIDIA Dynamo and Amazon EKS

AWS Machine Learning Blog

JULY 15, 2025

It supports AWS services such as Amazon Simple Storage Service (Amazon S3), Elastic Fabric Adapter (EFA), and Amazon Elastic Kubernetes Service (Amazon EKS), and can be deployed on NVIDIA GPU-accelerated Amazon Elastic Compute Cloud (Amazon EC2) instances, including P6 instances accelerated by NVIDIA Blackwell.

AWS

AWS AI AI Clustering

End-to-End model training and deployment with Amazon SageMaker Unified Studio

Flipboard

JULY 3, 2025

To address these challenges, AWS has expanded Amazon SageMaker with a comprehensive set of data, analytics, and generative AI capabilities. It’s available as a standalone service on the AWS Management Console , or through APIs. For setup instructions, see Set up an AWS account and create an administrator user.

ML

ML ML AWS Data Engineering

Unlocking generative AI for enterprises: How SnapLogic powers their low-code Agent Creator using Amazon Bedrock

AWS Machine Learning Blog

OCTOBER 23, 2024

SnapLogic uses Amazon Bedrock to build its platform, capitalizing on the proximity to data already stored in Amazon Web Services (AWS). To address customers’ requirements about data privacy and sovereignty, SnapLogic deploys the data plane within the customer’s VPC on AWS.

AI

AI AI AWS Database

From Chaos to Control: A Cost Maturity Journey with Databricks

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

Webinars

Trending Sources

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

Webinars

Speed up your cluster procurement time with Amazon SageMaker HyperPod training plans

Your guide to generative AI and ML at AWS re:Invent 2024

Racing into the future: How AWS DeepRacer fueled my AI and ML journey

Use K8sGPT and Amazon Bedrock for simplified Kubernetes cluster maintenance

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

Unlocking near real-time analytics with petabytes of transaction data using Amazon Aurora Zero-ETL integration with Amazon Redshift and dbt Cloud

Optimize RAG in production environments using Amazon SageMaker JumpStart and Amazon OpenSearch Service

HCLTech’s AWS powered AutoWise Companion: A seamless experience for informed automotive buyer decisions with data-driven design

Map Earth’s vegetation in under 20 minutes with Amazon SageMaker

Enhance your Amazon Redshift cloud data warehouse with easier, simpler, and faster machine learning using Amazon SageMaker Canvas

Streamline machine learning workflows with SkyPilot on Amazon SageMaker HyperPod

Deploy Meta Llama 3.1 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

ByteDance processes billions of daily videos using their multimodal video understanding models on AWS Inferentia2

Syngenta develops a generative AI assistant to support sales representatives using Amazon Bedrock Agents

Build AWS architecture diagrams using Amazon Q CLI and MCP

Build conversational interfaces for structured data using Amazon Bedrock Knowledge Bases

Transforming financial analysis with CreditAI on Amazon Bedrock: Octus’s journey with AWS

Accelerating Mixtral MoE fine-tuning on Amazon SageMaker with QLoRA

Supercharge generative AI workflows with NVIDIA DGX Cloud on AWS and Amazon Bedrock Custom Model Import

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

Scale ML workflows with Amazon SageMaker Studio and Amazon SageMaker HyperPod

How climate tech startups are building foundation models with Amazon SageMaker HyperPod

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

How PayU built a secure enterprise AI assistant using Amazon Bedrock

How Rocket Companies modernized their data science solution on AWS

Use language embeddings for zero-shot classification and semantic search with Amazon Bedrock

Query Amazon Aurora PostgreSQL using Amazon Bedrock Knowledge Bases structured data

Unify structured data in Amazon Aurora and unstructured data in Amazon S3 for insights using Amazon Q

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

Real value, real time: Production AI with Amazon SageMaker and Tecton

Train, optimize, and deploy models on edge devices using Amazon SageMaker and Qualcomm AI Hub

Optimizing costs of generative AI applications on AWS

Implement user-level access control for multi-tenant ML platforms on Amazon SageMaker AI

How Fastweb fine-tuned the Mistral model using Amazon SageMaker HyperPod as a first step to build an Italian large language model

Building enterprise-scale RAG applications with Amazon S3 Vectors and DeepSeek R1 on Amazon SageMaker AI

How Lumi streamlines loan approvals with Amazon SageMaker AI

Search enterprise data assets using LLMs backed by knowledge graphs

Efficiently build and tune custom log anomaly detection models with Amazon SageMaker

Accelerate generative AI inference with NVIDIA Dynamo and Amazon EKS

End-to-End model training and deployment with Amazon SageMaker Unified Studio

Unlocking generative AI for enterprises: How SnapLogic powers their low-code Agent Creator using Amazon Bedrock

Stay Connected