AWS, Computer Science and Database - Data Science Current

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning Blog

DECEMBER 24, 2024

To simplify infrastructure setup and accelerate distributed training, AWS introduced Amazon SageMaker HyperPod in late 2023. In this blog post, we showcase how you can perform efficient supervised fine tuning for a Meta Llama 3 model using PEFT on AWS Trainium with SageMaker HyperPod. architectures/5.sagemaker-hyperpod/LifecycleScripts/base-config/

AWS

AWS Clustering Deep Learning Deep Learning

New – Accelerate database modernization with generative AI using AWS Database Migration Service Schema Conversion

Flipboard

DECEMBER 3, 2024

AWS Database Migration Service Schema Conversion (DMS SC) helps you accelerate your database migration to AWS. Using DMS SC, you can assess, convert, …

Database

Database AWS AI AI

Enterprise-grade natural language to SQL generation using LLMs: Balancing accuracy, latency, and scale

Flipboard

APRIL 24, 2025

These tables house complex domain-specific schemas, with instances of nested tables and multi-dimensional data that require complex database queries and domain-specific knowledge for data retrieval. As a result, NL2SQL solutions for enterprise data are often incomplete or inaccurate.

SQL

SQL Database AWS ML

Webinars

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Multi-tenancy in RAG applications in a single Amazon Bedrock knowledge base with metadata filtering

AWS Machine Learning Blog

APRIL 7, 2025

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies and AWS. Solution overview The following diagram provides a high-level overview of AWS services and features through a sample use case.

Database

Database AWS Natural Language Processing AI

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

AWS Machine Learning Blog

APRIL 30, 2025

Agent function calling represents a critical capability for modern AI applications, allowing models to interact with external tools, databases, and APIs by accurately determining when and how to invoke specific functions. You can track these job status details in both the AWS Management Console and AWS SDK.

AWS

AWS AI AI Computer Science

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning Blog

OCTOBER 29, 2024

This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. Organizations typically can’t predict their call patterns, so the solution relies on AWS serverless services to scale during busy times.

AWS

AWS AI AI ML

Cohere Embed multimodal embeddings model is now available on Amazon SageMaker JumpStart

AWS Machine Learning Blog

NOVEMBER 15, 2024

Traditionally, RAG systems were text-centric, retrieving information from large text databases to provide relevant context for language models. First, it enables you to include both image and text features in a single database and therefore reduces complexity. You may be prompted to subscribe to this model through AWS Marketplace.

AWS

AWS Computer Science Computer Science Database

Neo4j and AWS team up to enhance generative AI applications with graph database technology - SiliconANGLE

Flipboard

NOVEMBER 21, 2023

Graph database company Neo4j Inc. said today it’s embarking on a multiyear strategic collaboration with the cloud computing giant Amazon Web Services …

Database

Database Cloud Computing AWS AI

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

AWS Machine Learning Blog

DECEMBER 4, 2024

SageMaker Unified Studio combines various AWS services, including Amazon Bedrock , Amazon SageMaker , Amazon Redshift , Amazon Glue , Amazon Athena , and Amazon Managed Workflows for Apache Airflow (MWAA) , into a comprehensive data and AI development platform. Navigate to the AWS Secrets Manager console and find the secret -api-keys.

AWS

AWS AI AI SQL

Use AWS PrivateLink to set up private access to Amazon Bedrock

AWS Machine Learning Blog

OCTOBER 30, 2023

Amazon Bedrock is a fully managed service provided by AWS that offers developers access to foundation models (FMs) and the tools to customize them for specific applications. The workflow steps are as follows: AWS Lambda running in your private VPC subnet receives the prompt request from the generative AI application.

AWS

AWS ML ML Computer Science

Unlocking generative AI for enterprises: How SnapLogic powers their low-code Agent Creator using Amazon Bedrock

AWS Machine Learning Blog

OCTOBER 23, 2024

Agent Creator is a versatile extension to the SnapLogic platform that is compatible with modern databases, APIs, and even legacy mainframe systems, fostering seamless integration across various data environments. Pre-built templates tailored to various use cases are included, significantly enhancing both employee and customer experiences.

AI

AI AI Database AWS

Exclusive: Amazon AWS aims to outshine Microsoft with Gen AI offerings at Re:Invent

Flipboard

NOVEMBER 27, 2023

Amazon AWS, the cloud computing giant, has been perceived as playing catch-up with its rivals Microsoft Azure and Google Cloud in the emerging and exciting field of generative AI. But this week, at its annual AWS Re:Invent conference, Amazon plans to showcase its ambitious vision for generative AI, …

AWS

AWS Cloud Computing Azure AI

Generative AI and multi-modal agents in AWS: The key to unlocking new value in financial markets

AWS Machine Learning Blog

SEPTEMBER 19, 2023

Technical challenges with multi-modal data further include the complexity of integrating and modeling different data types, the difficulty of combining data from multiple modalities (text, images, audio, video), and the need for advanced computer science skills and sophisticated analysis tools.

AWS

AWS AI AI ML

How Qualtrics built Socrates: An AI platform powered by Amazon SageMaker and Amazon Bedrock

Flipboard

MAY 15, 2025

The content and opinions in this post are those of the third-party author and AWS is not responsible for the content or accuracy of this post. It uses managed AWS services like SageMaker and Amazon Bedrock to enable the entire ML lifecycle. This post is co-authored by Jay Kshirsagar and Ronald Quan from Qualtrics.

ML

ML ML AI AI

Create a next generation chat assistant with Amazon Bedrock, Amazon Connect, Amazon Lex, LangChain, and WhatsApp

AWS Machine Learning Blog

OCTOBER 23, 2024

By automating document ingestion, chunking, and embedding, it eliminates the need to manually set up complex vector databases or custom retrieval systems, significantly reducing development complexity and time. The solution’s scalability quickly accommodates growing data volumes and user queries thanks to AWS serverless offerings.

AWS

AWS Natural Language Processing AI AI

Learn how to build and deploy tool-using LLM agents using AWS SageMaker JumpStart Foundation Models

AWS Machine Learning Blog

SEPTEMBER 15, 2023

Often, LLMs need to interact with other software, databases, or APIs to accomplish complex tasks. In this post, we introduce LLM agents and demonstrate how to build and deploy an e-commerce LLM agent using Amazon SageMaker JumpStart and AWS Lambda. Next, we show how to implement a simple agent loop using AWS services.

AWS

AWS Database Python Computer Science

Federated Learning on AWS with FedML: Health analytics without sharing sensitive data – Part 1

AWS Machine Learning Blog

JANUARY 13, 2023

In this two-part series, we demonstrate how you can deploy a cloud-based FL framework on AWS. In the second post , we present the use cases and dataset to show its effectiveness in analyzing real-world healthcare datasets, such as the eICU data , which comprises a multi-center critical care database collected from over 200 hospitals.

AWS

AWS Analytics Analytics Machine Learning

How SnapLogic built a text-to-pipeline application with Amazon Bedrock to translate business intent into action

Flipboard

NOVEMBER 24, 2023

In this post, we show you how SnapLogic , an AWS customer, used Amazon Bedrock to power their SnapGPT product through automated creation of these complex DSL artifacts from human language. SnapLogic background SnapLogic is an AWS customer on a mission to bring enterprise automation to the world.

Database

Database AWS ETL SQL

Build a Search Engine: Semantic Search System Using OpenSearch

PyImageSearch

MAY 19, 2025

run_opensearch.sh Running OpenSearch Locally A script to start OpenSearch using Docker for local testing before deploying to AWS. Register the Sentence Transformer model in AWS OpenSearch: AWS users must ensure that OpenSearch can access the model before indexing. These can be used for evaluation and comparison.

K-nearest Neighbors

K-nearest Neighbors AWS Deep Learning Deep Learning

Federated Learning on AWS with FedML: Health analytics without sharing sensitive data – Part 2

AWS Machine Learning Blog

JANUARY 13, 2023

To mitigate these challenges, we propose a federated learning (FL) framework, based on open-source FedML on AWS, which enables analyzing sensitive HCLS data. In this two-part series, we demonstrate how you can deploy a cloud-based FL framework on AWS. In the first post , we described FL concepts and the FedML framework.

AWS

AWS Analytics Analytics Machine Learning

Build well-architected IDP solutions with a custom lens – Part 2: Security

AWS Machine Learning Blog

NOVEMBER 22, 2023

Building a production-ready solution in AWS involves a series of trade-offs between resources, time, customer expectation, and business outcome. The AWS Well-Architected Framework helps you understand the benefits and risks of decisions you make while building workloads on AWS.

AWS

AWS ML ML Machine Learning

Build an automated insight extraction framework for customer feedback analysis with Amazon Bedrock and Amazon QuickSight

AWS Machine Learning Blog

JUNE 25, 2024

The customer review analysis workflow consists of the following steps: A user uploads a file to dedicated data repository within your Amazon Simple Storage Service (Amazon S3) data lake, invoking the processing using AWS Step Functions. In the first step, an AWS Lambda function reads and validates the file, and extracts the raw data.

AWS

AWS Natural Language Processing Machine Learning Machine Learning

Harnessing the power of enterprise data with generative AI: Insights from Amazon Kendra, LangChain, and large language models

AWS Machine Learning Blog

NOVEMBER 7, 2023

Instead of relying solely on their pre-trained knowledge, RAG allows models to pull data from documents, databases, and more. This means that as new data becomes available, it can be added to the retrieval database without needing to retrain the entire model. Prerequisites You must have the following prerequisites: An AWS account.

AWS

AWS AI AI Database

Use DeepSeek with Amazon OpenSearch Service vector database and Amazon SageMaker

Flipboard

FEBRUARY 7, 2025

This post shows you how to set up RAG using DeepSeek-R1 on Amazon SageMaker with an OpenSearch Service vector database as the knowledge base. You will execute scripts to create an AWS Identity and Access Management (IAM) role for invoking SageMaker, and a role for your user to create a connector to SageMaker.

Database

Database AWS Python ML

Best practices for prompt engineering with Meta Llama 3 for Text-to-SQL use cases

AWS Machine Learning Blog

AUGUST 30, 2024

With the rapid growth of generative artificial intelligence (AI), many AWS customers are looking to take advantage of publicly available foundation models (FMs) and technologies. This democratizes access to generative AI and improves efficiency in writing complex queries without needing to learn SQL or understand complex database schemas.

SQL

SQL AWS Database AI

AI-powered assistants for investment research with multi-modal data: An application of Agents for Amazon Bedrock

AWS Machine Learning Blog

JUNE 26, 2024

This post is a follow-up to Generative AI and multi-modal agents in AWS: The key to unlocking new value in financial markets. Action groups – Action groups are interfaces that an agent uses to interact with the different underlying components such as APIs and databases.

AWS

AWS AI AI Database

Accelerate migration portfolio assessment using Amazon Bedrock

AWS Machine Learning Blog

OCTOBER 15, 2024

Despite the existence of AWS Application Discovery Service or the presence of some form of configuration management database (CMDB), customers still face many challenges. Customization and adaptability : Action groups allow users to customize migration workflows to suit specific AWS environments and requirements.

AWS

AWS Database AI AI

Automate user on-boarding for financial services with a digital assistant powered by Amazon Bedrock

AWS Machine Learning Blog

OCTOBER 8, 2024

Our solution provides practical guidance on addressing this challenge by using a generative AI assistant on AWS. The approach uses Retrieval Augmented Generation (RAG) , which combines text generation capabilities with database querying to provide contextually relevant responses to customer inquiries.

AWS

AWS Database Machine Learning Machine Learning

How Twitch used agentic workflow with RAG on Amazon Bedrock to supercharge ad sales

Flipboard

DECEMBER 13, 2024

This content is then transformed into a vector database optimized for efficient information retrieval. In the RAG pipeline, the retriever taps into this vector database to surface relevant information, and the LLM generates tailored responses to Twitch user queries submitted through a Slack assistant.

AWS

AWS Database AI AI

How Deltek uses Amazon Bedrock for question and answering on government solicitation documents

AWS Machine Learning Blog

AUGUST 9, 2024

This post provides an overview of a custom solution developed by the AWS Generative AI Innovation Center (GenAIIC) for Deltek , a globally recognized standard for project-based businesses in both government contracting and professional services. It uses a vector database structure to efficiently store and query large volumes of data.

AWS

AWS Database AI AI

Build ML features at scale with Amazon SageMaker Feature Store using data from Amazon Redshift

Flipboard

AUGUST 17, 2023

Amazon Redshift uses SQL to analyze structured and semi-structured data across data warehouses, operational databases, and data lakes, using AWS-designed hardware and ML to deliver the best price-performance at any scale. Prerequisites To continue with the examples in this post, you need to create the required AWS resources.

ML

ML ML AWS Data Warehouse

Solve forecasting challenges for the retail and CPG industry using Amazon SageMaker Canvas

AWS Machine Learning Blog

JANUARY 21, 2025

In this post, we show you how Amazon Web Services (AWS) helps in solving forecasting challenges by customizing machine learning (ML) models for forecasting. In this post, we access Amazon SageMaker Canvas through the AWS console. About the Authors Aditya Pendyala is a Principal Solutions Architect at AWS based out of NYC.

ML

ML ML Algorithm AWS

Build a RAG-based QnA application using Llama3 models from SageMaker JumpStart

AWS Machine Learning Blog

SEPTEMBER 12, 2024

The application sends the user query to the vector database to find similar documents. The QnA application submits a request to the SageMaker JumpStart model endpoint with the user query and context returned from the vector database. Basic familiarity with SageMaker and AWS services that support LLMs.

AWS

AWS ML ML AI

Simplify access to internal information using Retrieval Augmented Generation and LangChain Agents

AWS Machine Learning Blog

SEPTEMBER 14, 2023

This post takes you through the most common challenges that customers face when searching internal documents, and gives you concrete guidance on how AWS services can be used to create a generative AI conversational bot that makes internal information more useful. The web application front-end is hosted on AWS Amplify.

AWS

AWS AI AI Data Silos

Build production-ready generative AI applications for enterprise search using Haystack pipelines and Amazon SageMaker JumpStart with LLMs

AWS Machine Learning Blog

AUGUST 14, 2023

The final retrieval augmentation workflow covers the following high-level steps: The user query is used for a retriever component, which does a vector search, to retrieve the most relevant context from our database. A vector database provides efficient vector similarity search by providing specialized indexes like k-NN indexes.

AWS

AWS Database AI AI

Implement backup and recovery using an event-driven serverless architecture with Amazon SageMaker Studio

AWS Machine Learning Blog

MAY 3, 2023

Moreover, as of November 2022, Studio supports shared spaces to accelerate real-time collaboration and multiple Amazon SageMaker domains in a single AWS Region for each account. First, we demonstrate how to perform backup and recovery if you create a new Studio domain, user, and space profiles using AWS CloudFormation templates.

AWS

AWS Data Scientist Machine Learning Machine Learning

Accelerate data preparation for ML in Amazon SageMaker Canvas

AWS Machine Learning Blog

NOVEMBER 29, 2023

Browse to locate loan dataset from the Snowflake database Select the two loans datasets by dragging and dropping them from the left side of the screen to the right. For more information on how to accelerate your journeys from data to business insights, see SageMaker Canvas immersion day and AWS user guide. Product Manager at AWS.

Data Preparation

Data Preparation ML ML Data Quality

A Guide to Choose the Best Data Science Bootcamp

Data Science Dojo

JULY 3, 2024

Big Data Technologies : Handling and processing large datasets using tools like Hadoop, Spark, and cloud platforms such as AWS and Google Cloud. Databases and SQL : Managing and querying relational databases using SQL, as well as working with NoSQL databases like MongoDB.

Data Science

Data Science Machine Learning Machine Learning Data Visualization

How CCC Intelligent Solutions created a custom approach for hosting complex AI models using Amazon SageMaker

AWS Machine Learning Blog

JANUARY 20, 2023

In this post, we discuss how CCC Intelligent Solutions (CCC) combined Amazon SageMaker with other AWS services to create a custom solution capable of hosting the types of complex artificial intelligence (AI) models envisioned. Step-by-step solution Step 1 A client makes a request to the AWS API Gateway endpoint.

AWS

AWS AI AI Computer Science

A review of purpose-built accelerators for financial services

AWS Machine Learning Blog

SEPTEMBER 11, 2024

Examples of other PBAs now available include AWS Inferentia and AWS Trainium , Google TPU, and Graphcore IPU. Around this time, industry observers reported NVIDIA’s strategy pivoting from its traditional gaming and graphics focus to moving into scientific computing and data analytics.

AWS

AWS ML ML Clustering

Big Data vs. Data Science: Demystifying the Buzzwords

Pickl AI

APRIL 21, 2025

Data Science extracts insights and builds predictive models from processed data. Big Data technologies include Hadoop, Spark, and NoSQL databases. Data Science uses Python, R, and machine learning frameworks. This might involve querying databases, scraping websites, accessing APIs, or using existing datasets.

Big Data

Big Data Big Data Data Science Machine Learning

Create a Generative AI Gateway to allow secure and compliant consumption of foundation models

AWS Machine Learning Blog

SEPTEMBER 28, 2023

In this post, we define what a Generative AI Gateway is, its benefits, and how to architect one on AWS. AWS services can help in building a model abstraction layer (MAL) as follows: The generative AI manager creates a registry table using Amazon DynamoDB. Finally, a retroactive audit is available through AWS CloudTrail.

AI

AI AI AWS ML

Create a multimodal assistant with advanced RAG and Amazon Bedrock

AWS Machine Learning Blog

MAY 21, 2024

Solution architecture The mmRAG solution is based on a straightforward concept: to extract different data types separately, you generate text summarization using a VLM from different data types, embed text summaries along with raw data accordingly to a vector database, and store raw unstructured data in a document store.

ML

ML ML Database Natural Language Processing

Recommend and dynamically filter items based on user context in Amazon Personalize

AWS Machine Learning Blog

JUNE 29, 2023

Automatically deriving context is achieved through Amazon CloudFront headers that are included in requests such as a REST API in Amazon API Gateway that calls an AWS Lambda function to retrieve recommendations. We provide a AWS CloudFormation template to create the necessary resources.

AWS

AWS ML ML Computer Science

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

New – Accelerate database modernization with generative AI using AWS Database Migration Service Schema Conversion

Webinars

Trending Sources

Enterprise-grade natural language to SQL generation using LLMs: Balancing accuracy, latency, and scale

Webinars

Multi-tenancy in RAG applications in a single Amazon Bedrock knowledge base with metadata filtering

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Cohere Embed multimodal embeddings model is now available on Amazon SageMaker JumpStart

Neo4j and AWS team up to enhance generative AI applications with graph database technology - SiliconANGLE

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

Use AWS PrivateLink to set up private access to Amazon Bedrock

Unlocking generative AI for enterprises: How SnapLogic powers their low-code Agent Creator using Amazon Bedrock

Exclusive: Amazon AWS aims to outshine Microsoft with Gen AI offerings at Re:Invent

Generative AI and multi-modal agents in AWS: The key to unlocking new value in financial markets

How Qualtrics built Socrates: An AI platform powered by Amazon SageMaker and Amazon Bedrock

Create a next generation chat assistant with Amazon Bedrock, Amazon Connect, Amazon Lex, LangChain, and WhatsApp

Learn how to build and deploy tool-using LLM agents using AWS SageMaker JumpStart Foundation Models

Federated Learning on AWS with FedML: Health analytics without sharing sensitive data – Part 1

How SnapLogic built a text-to-pipeline application with Amazon Bedrock to translate business intent into action

Build a Search Engine: Semantic Search System Using OpenSearch

Federated Learning on AWS with FedML: Health analytics without sharing sensitive data – Part 2

Build well-architected IDP solutions with a custom lens – Part 2: Security

Build an automated insight extraction framework for customer feedback analysis with Amazon Bedrock and Amazon QuickSight

Harnessing the power of enterprise data with generative AI: Insights from Amazon Kendra, LangChain, and large language models

Use DeepSeek with Amazon OpenSearch Service vector database and Amazon SageMaker

Best practices for prompt engineering with Meta Llama 3 for Text-to-SQL use cases

AI-powered assistants for investment research with multi-modal data: An application of Agents for Amazon Bedrock

Accelerate migration portfolio assessment using Amazon Bedrock

Automate user on-boarding for financial services with a digital assistant powered by Amazon Bedrock

How Twitch used agentic workflow with RAG on Amazon Bedrock to supercharge ad sales

How Deltek uses Amazon Bedrock for question and answering on government solicitation documents

Build ML features at scale with Amazon SageMaker Feature Store using data from Amazon Redshift

Solve forecasting challenges for the retail and CPG industry using Amazon SageMaker Canvas

Build a RAG-based QnA application using Llama3 models from SageMaker JumpStart

Simplify access to internal information using Retrieval Augmented Generation and LangChain Agents

Build production-ready generative AI applications for enterprise search using Haystack pipelines and Amazon SageMaker JumpStart with LLMs

Implement backup and recovery using an event-driven serverless architecture with Amazon SageMaker Studio

Accelerate data preparation for ML in Amazon SageMaker Canvas

A Guide to Choose the Best Data Science Bootcamp

­­How CCC Intelligent Solutions created a custom approach for hosting complex AI models using Amazon SageMaker

A review of purpose-built accelerators for financial services

Big Data vs. Data Science: Demystifying the Buzzwords

Create a Generative AI Gateway to allow secure and compliant consumption of foundation models

Create a multimodal assistant with advanced RAG and Amazon Bedrock

Recommend and dynamically filter items based on user context in Amazon Personalize

Stay Connected

How CCC Intelligent Solutions created a custom approach for hosting complex AI models using Amazon SageMaker