2012, Database and Machine Learning

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

AWS Machine Learning Blog

NOVEMBER 13, 2024

It works by analyzing the visual content to find similar images in its database. Exclusive to Amazon Bedrock, the Amazon Titan family of models incorporates 25 years of experience innovating with AI and machine learning at Amazon. To do so, you can use a vector database. Retrieve images stored in S3 bucket response = s3.list_objects_v2(Bucket=BUCKET_NAME)

AWS

AWS Database K-nearest Neighbors AI

Import data from Google Cloud Platform BigQuery for no-code machine learning with Amazon SageMaker Canvas

AWS Machine Learning Blog

OCTOBER 28, 2024

This fragmentation can complicate efforts by organizations to consolidate and analyze data for their machine learning (ML) initiatives. You should be able to run live queries against the BigQuery database. In the modern, cloud-centric business landscape, data is often scattered across numerous clouds and on-site systems.

Machine Learning

Machine Learning Machine Learning ML ML

Combine keyword and semantic search for text and images using Amazon Bedrock and Amazon OpenSearch Service

Flipboard

APRIL 24, 2025

OpenSearch Service is the AWS recommended vector database for Amazon Bedrock. OpenSearch is a distributed open-source search and analytics engine composed of a search engine and vector database. To learn more, see Improve search results for AI using Amazon OpenSearch Service as a vector database with Amazon Bedrock.

AWS

AWS Database Machine Learning Machine Learning

Webinars

Precision in Motion: Why Process Optimization Is the Future of Manufacturing

Airflow Best Practices for ETL/ELT Pipelines

MORE WEBINARS

Implement user-level access control for multi-tenant ML platforms on Amazon SageMaker AI

AWS Machine Learning Blog

JULY 11, 2025

Managing access control in enterprise machine learning (ML) environments presents significant challenges, particularly when multiple teams share Amazon SageMaker AI resources within a single Amazon Web Services (AWS) account.

ML

ML ML AWS Clustering

Agents as escalators: Real-time AI video monitoring with Amazon Bedrock Agents and video streams

Flipboard

JULY 7, 2025

As demonstrated in the following example, the system translates natural language queries about vehicles into SQL, returning structured information from the database. The database connection is configured through a SQL Alchemy engine. Daytime conditions, clear visibility. No suspicious behavior or safety concerns observed.

AWS

AWS SQL Database AI

Training AI to Detect Disease: Stand Up To Cancer’s Julian Adams

Flipboard

JUNE 11, 2025

Adams’s recognitions include the 2012 Warren Alpert Foundation Prize for his role in the discovery and development of bortezomib, an anti-cancer drug; the 2012 C. The AI component of this is that the more images you show the computer, the more it learns, the better and more accurately it describes the abnormality.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence AI AI

Transforming financial analysis with CreditAI on Amazon Bedrock: Octus’s journey with AWS

AWS Machine Learning Blog

MARCH 10, 2025

It was built using a combination of in-house and external cloud services on Microsoft Azure for large language models (LLMs), Pinecone for vectorized databases, and Amazon Elastic Compute Cloud (Amazon EC2) for embeddings. Opportunities for innovation CreditAI by Octus version 1.x x uses Retrieval Augmented Generation (RAG).

AWS

AWS Database AI AI

Adobe enhances developer productivity using Amazon Bedrock Knowledge Bases

AWS Machine Learning Blog

JUNE 11, 2025

This involved creating a pipeline for data ingestion, preprocessing, metadata extraction, and indexing in a vector database. Similarity search and retrieval – The system retrieves the most relevant chunks in the vector database based on similarity scores to the query.

AWS

AWS AI AI Database

Enrich your AWS Glue Data Catalog with generative AI metadata using Amazon Bedrock

Flipboard

NOVEMBER 15, 2024

When you run the crawler, it creates metadata tables that are added to a database you specify or the default database. Approach 1: In-context learning In this approach, you use an LLM to generate the metadata descriptions. This approach is ideal for AWS Glue databases with a small number of tables. Build the prompt.

AWS

AWS Database AI AI

Building cost-effective RAG applications with Amazon Bedrock Knowledge Bases and Amazon S3 Vectors

Flipboard

JULY 17, 2025

As knowledge bases grow and require more granular embeddings, many vector databases that rely on high-performance storage such as SSDs or in-memory solutions become prohibitively expensive. In this post, we demonstrate how to integrate Amazon S3 Vectors with Amazon Bedrock Knowledge Bases for RAG applications.

AWS

AWS Database AI AI

An Algorithm for a Better Bookshelf

Hacker News

JULY 1, 2025

In 2012, researchers proved that no deterministic algorithm can improve on log 2 n. 35 th ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (PODS) , pages 289-302, June 2016. 44 th annual ACM Symposium on Theory of Computing (STOC) , pages 1185-1198, 2012. 7 Previous Issue June 2025 , Vol. Bulánek, J.

Algorithm

Algorithm Computer Science Computer Science Artificial Intelligence

Connect, share, and query where your data sits using Amazon SageMaker Unified Studio

Flipboard

MARCH 21, 2025

Under DATABASES , select glue_db_ or the customer glue database name you provided during project creation. You will see a new database dev@ in the managed Amazon Redshift Serverless workgroup. Select Redshift (Lakehouse) from CONNECTIONS , dev@ from DATABASES and public from SCHEMAS Run the following SQL in order.

SQL

SQL Data Analyst Data Warehouse AWS

Integrate generative AI capabilities into Microsoft Office using Amazon Bedrock

AWS Machine Learning Blog

MARCH 19, 2025

AWS (Amazon Web Services) is a comprehensive cloud computing platform offering a wide range of services like computing power, database storage, content delivery, and more.n2. Make sure that we have Powertools for AWS Lambda (Python) available in our runtime, for example, by attaching a Lambda layer to our function.

AWS

AWS AI AI Cloud Computing

New projects contribute to digital commons

Hacker News

JUNE 25, 2025

The Open Energy Profiler Toolset (OpenEPT) ecosystem will provide diverse hardware solutions, a user-friendly interface encapsulated in a GUI application, and a collaborative database infrastructure that brings together engineers and researchers to drive innovations in the field of battery-powered technologies.

EDA

EDA Algorithm Database Data Visualization

Ask HN: What Are You Working On? (June 2025)

Hacker News

JUNE 29, 2025

reply phelddagrif 21 minutes ago | prev | next [–] https://www.quarterbackranking.com It's a DIY confirmation bias machine for ranking NFL quarterbacks by a variety of stats. So I might be putting it into a database to hopefully aggregate multiples of the 10k results if they're not always the same 10k. [0]:

AI

AI AI Database Python

Feature Platforms?—?A New Paradigm in Machine Learning Operations (MLOps)

IBM Data Science in Practice

MARCH 8, 2023

Feature Platforms — A New Paradigm in Machine Learning Operations (MLOps) Operationalizing Machine Learning is Still Hard OpenAI introduced ChatGPT. The growth of the AI and Machine Learning (ML) industry has continued to grow at a rapid rate over recent years.

Machine Learning

Machine Learning Machine Learning ML ML

Use machine learning to detect anomalies and predict downtime with Amazon Timestream and Amazon Lookout for Equipment

AWS Machine Learning Blog

DECEMBER 29, 2022

revolution has shown the value and importance of machine learning (ML) across verticals and environments, with more impact on manufacturing than possibly any other application. Now that signals are being generated, we can set up IoT Core to read the MQTT topics and direct the payloads to the Timestream database. Choose Add.

Machine Learning

Machine Learning Machine Learning AWS Database

Use AWS PrivateLink to set up private access to Amazon Bedrock

AWS Machine Learning Blog

OCTOBER 30, 2023

When building such generative AI applications using FMs or base models, customers want to generate a response without going over the public internet or based on their proprietary data that may reside in their enterprise databases. You’re redirected to the IAM console. Currently, the VPC endpoint policy is set to Allow.

AWS

AWS ML ML Computer Science

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

AWS Machine Learning Blog

SEPTEMBER 3, 2024

This allows SageMaker Studio users to perform petabyte-scale interactive data preparation, exploration, and machine learning (ML) directly within their familiar Studio notebooks, without the need to manage the underlying compute infrastructure. elasticmapreduce", "arn:aws:s3:::*.elasticmapreduce/*" elasticmapreduce", "arn:aws:s3:::*.elasticmapreduce/*"

AWS

AWS Clustering Big Data Big Data

How to choose a graph database: we compare 6 favorites

Cambridge Intelligence

OCTOBER 19, 2023

That’s why our data visualization SDKs are database agnostic: so you’re free to choose the right stack for your application. There have been a lot of new entrants and innovations in the graph database category, with some vendors slowly dipping below the radar, or always staying on the periphery. can handle many graph-type problems.

Database

Database Azure Analytics Analytics

Explore data with ease: Use SQL and Text-to-SQL in Amazon SageMaker Studio JupyterLab notebooks

AWS Machine Learning Blog

APRIL 16, 2024

Amazon SageMaker Studio provides a fully managed solution for data scientists to interactively build, train, and deploy machine learning (ML) models. For example, you can visually explore data sources like databases, tables, and schemas directly from your JupyterLab ecosystem. or later image versions. or later image versions.

SQL

SQL AWS Database Data Scientist

16 Companies Leading the Way in AI and Data Science

ODSC - Open Data Science

FEBRUARY 28, 2023

Cloudera For Cloudera, it’s all about machine learning optimization. Their CDP machine learning allows teams to collaborate across the full data life cycle with scalable computing resources, tools, and more.

Data Science

Data Science Machine Learning Machine Learning AI

Configure cross-account access of Amazon Redshift clusters in Amazon SageMaker Studio using VPC peering

AWS Machine Learning Blog

JULY 17, 2023

With cloud computing, as compute power and data became more available, machine learning (ML) is now making an impact across every industry and is a core part of every business and industry. The SourceIdentity attribute is used to tie the identity of the original SageMaker Studio user to the Amazon Redshift database user.

Clustering

Clustering AWS Data Warehouse ML

Prepare training and validation dataset for facies classification using Snowflake integration and train using Amazon SageMaker Canvas

AWS Machine Learning Blog

MAY 17, 2023

Facies classification using AI and machine learning (ML) has become an increasingly popular area of investigation for many oil majors. An existing database within Snowflake. Download the training_data.csv and validation_data_nofacies.csv files to your local machine. Do the same for the validation database.

ML

ML ML AWS Database

Securing MLflow in AWS: Fine-grained access control with AWS native services

AWS Machine Learning Blog

MAY 8, 2023

With Amazon SageMaker , you can manage the whole end-to-end machine learning (ML) lifecycle. For this task, we build on top the following GitHub repo: Manage your machine learning lifecycle with MLflow and Amazon SageMaker. mlflow/runs/search/", "arn:aws:execute-api: : : / /POST/api/2.0/mlflow/experiments/search",

AWS

AWS Data Science Machine Learning Machine Learning

Manage your Amazon Lex bot via AWS CloudFormation templates

AWS Machine Learning Blog

APRIL 16, 2024

IAM role that is used by the bot at runtime BotRuntimeRole: Type: AWS::IAM::Role Properties: AssumeRolePolicyDocument: Version: "2012-10-17" Statement: - Effect: Allow Principal: Service: - lexv2.amazonaws.com For more information, refer to Enabling custom logic with AWS Lambda functions.

AWS

AWS Deep Learning Deep Learning Artificial Intelligence

Build ML features at scale with Amazon SageMaker Feature Store using data from Amazon Redshift

Flipboard

AUGUST 17, 2023

Many practitioners are extending these Redshift datasets at scale for machine learning (ML) using Amazon SageMaker , a fully managed ML service, with requirements to develop features offline in a code way or low-code/no-code way, store featured data from Amazon Redshift, and make this happen at scale in a production environment.

ML

ML ML AWS Data Warehouse

Store Sales Forecasting with Snowflake Cortex ML & Snowpark

phData

MAY 17, 2024

Cortex ML is Snowflake’s newest feature, added to enhance the ease of use and low-code functionality of your business’s machine learning needs. What is Cortex ML, and Why Does it Matter? The newest ML functions are Forecasting, Anomaly Detection, and Contribution Explorer.

ML

ML ML Predictive Analytics Machine Learning

Onboard users to Amazon SageMaker Studio with Active Directory group-specific IAM roles

AWS Machine Learning Blog

JUNE 19, 2023

Amazon SageMaker Studio is a web-based integrated development environment (IDE) for machine learning (ML) that lets you build, train, debug, deploy, and monitor your ML models. She is passionate about making machine learning accessible to everyone.

AWS

AWS ML ML Machine Learning

A comprehensive guide to learning LLMs (Foundational Models)

Mlearning.ai

JUNE 14, 2023

Learning LLMs (Foundational Models) Base Knowledge / Concepts: What is AI, ML and NLP Introduction to ML and AI — MFML Part 1 — YouTube What is NLP (Natural Language Processing)? — YouTube YouTube Introduction to Natural Language Processing (NLP) NLP 2012 Dan Jurafsky and Chris Manning (1.1)

Natural Language Processing

Natural Language Processing ML ML Support Vector Machines

MLOps for IoT Edge Ecosystems: Building an MLOps Environment on AWS

The MLOps Blog

JANUARY 11, 2023

They can help to ensure that machine learning models are developed and deployed efficiently and that they remain reliable and accurate over time. AWS offers a three-layered machine learning stack to choose from based on your skill set and team’s requirements for implementing workloads to execute machine learning tasks.

AWS

AWS Machine Learning Machine Learning ML

How Data Security Posture Management Protects Against Data Breaches

ODSC - Open Data Science

FEBRUARY 13, 2024

In 2012, records show there were 447 data breaches in the United States. EVENT — ODSC East 2024 In-Person and Virtual Conference April 23rd to 25th, 2024 Join us for a deep dive into the latest data science and AI trends, tools, and techniques, from LLMs to data analytics and from machine learning to responsible AI.

Data Science

Data Science Database Machine Learning Machine Learning

Access control for vector stores using metadata filtering with Knowledge Bases for Amazon Bedrock

AWS Machine Learning Blog

JULY 2, 2024

After the doctor has successfully signed in, the application retrieves the list of patients associated with the doctor’s ID from the Amazon DynamoDB database. Before querying the knowledge base, the Lambda function retrieves data from the DynamoDB database, which stores doctor-patient associations.

AWS

AWS Data Governance Database Artificial Intelligence

Exploring the leading AI medical scribes

Dataconomy

AUGUST 10, 2023

With the application of natural language processing (NLP) and machine learning algorithms, AI systems can understand and translate spoken language into written notes. Utilizing its CognitiveML engine, Iodine Software implements advanced machine learning across various use-case scenarios within a database of patient admissions.

Natural Language Processing

Natural Language Processing AI AI Artificial Intelligence

Build secure RAG applications with AWS serverless data lakes

AWS Machine Learning Blog

JULY 14, 2025

Without careful architectural planning, RAG implementations can lead to unnecessary expenses through duplicate data storage, excessive vector database operations, and inefficient data transfer patterns. Aamna Najmi is a Senior GenAI and Data Specialist in the Worldwide team at Amazon Web Services (AWS).

Data Lakes

Data Lakes AWS AI AI

Transform one-on-one customer interactions: Build speech-capable order processing agents with AWS and generative AI

AWS Machine Learning Blog

MARCH 15, 2024

The orchestrating Lambda function calls the Amazon Bedrock LLM endpoint to generate a final order summary including the order total from the customer database system (for example, Amazon DynamoDB ). A strategic leader with expertise in cloud architecture, generative AI, machine learning, and data analytics.

AWS

AWS AI AI Python

Introducing the DataRobot AI Cloud: A Closer Look

DataRobot

SEPTEMBER 14, 2021

Since DataRobot was founded in 2012, we’ve been committed to democratizing access to the power of AI. DataRobot AI Cloud brings together any type of data from any source to give our customers a holistic view that drives their business: critical information in databases, data clouds, cloud storage systems, enterprise apps, and more.

AI

AI AI Data Pipeline Data Preparation

Build an image search engine with Amazon Kendra and Amazon Rekognition

AWS Machine Learning Blog

MAY 5, 2023

In this post, we discuss a machine learning (ML) solution for complex image searches using Amazon Kendra and Amazon Rekognition. She is also passionate about the field of machine learning. With the internet, searching and obtaining an image has never been easier. join(", "), }; }).catch((error)

AWS

AWS ETL ML ML

Automatic summarization with LLMs in Python

AssemblyAI

AUGUST 15, 2023

Around 2012 to 2014, developers proposed updating these modules, but were told to use third party libraries instead. Python for machine learning and data science** In the early 1990s, scientists used Fortran and C++ libraries to solve mathematical problems. However, over time these modules became outdated.

Python

Python Machine Learning Machine Learning Algorithm

10 Years Later: Who’s the GOAT of Data Catalogs?

Alation

DECEMBER 15, 2022

December 2012: Alation forms and goes to work creating the first enterprise data catalog. October 2020: Forrester Research names Alation a Leader in The Forrester Wave: Machine Learning Data Catalogs, Q4, 2020. Here’s a timeline view of what the market has said about Alation since our founding: Timeline: 10 Years of Alation.

Data Governance

Data Governance Data Quality Data Warehouse Data Scientist

A review of purpose-built accelerators for financial services

AWS Machine Learning Blog

SEPTEMBER 11, 2024

These activities cover disparate fields such as basic data processing, analytics, and machine learning (ML). in 2012 is now widely referred to as ML’s “Cambrian Explosion.” Machine learning Generative AI is the most topical ML application at this point in time. Work by Hinton et al.

AWS

AWS ML ML Clustering

The Story Continues: Announcing Version 14 of Wolfram Language and Mathematica

Hacker News

JANUARY 9, 2024

The LLMs Have Landed The machine learning superfunctions Classify and Predict first appeared in Wolfram Language in 2014 ( Version 10 ). And in a similar vein, we can expect LLMs to be useful in making connections to external databases, functions, etc. But in Version 14.0 and if it’s right, can be used henceforth.

Python

Python Algorithm Machine Learning Machine Learning

Implementing Agents in LangChain

Heartbeat

DECEMBER 8, 2023

Here are a few reasons why an agent needs tools: Access to external resources: Tools allow an agent to access and retrieve information from external sources, such as databases, APIs, or web scraping. This includes cleaning and transforming data, performing calculations, or applying machine learning algorithms.

Deep Learning

Deep Learning Deep Learning AI AI

Introducing spaCy

Explosion

FEBRUARY 18, 2015

To do great NLP, you have to know a little about linguistics, a lot about machine learning, and almost everything about the latest research. Hardware : Intel i7-3770 (2012) Efficiency is a major concern for NLP applications. I spent a long time on spaCy’s initial design before this announcement. ZPar 1ms 8ms 850ms 5x 8x 44.7x

Clustering

Clustering Natural Language Processing Machine Learning Machine Learning

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

Import data from Google Cloud Platform BigQuery for no-code machine learning with Amazon SageMaker Canvas

Webinars

Trending Sources

Combine keyword and semantic search for text and images using Amazon Bedrock and Amazon OpenSearch Service

Webinars

Implement user-level access control for multi-tenant ML platforms on Amazon SageMaker AI

Agents as escalators: Real-time AI video monitoring with Amazon Bedrock Agents and video streams

Training AI to Detect Disease: Stand Up To Cancer’s Julian Adams

Transforming financial analysis with CreditAI on Amazon Bedrock: Octus’s journey with AWS

Adobe enhances developer productivity using Amazon Bedrock Knowledge Bases

Enrich your AWS Glue Data Catalog with generative AI metadata using Amazon Bedrock

Building cost-effective RAG applications with Amazon Bedrock Knowledge Bases and Amazon S3 Vectors

An Algorithm for a Better Bookshelf

Connect, share, and query where your data sits using Amazon SageMaker Unified Studio

Integrate generative AI capabilities into Microsoft Office using Amazon Bedrock

New projects contribute to digital commons

Ask HN: What Are You Working On? (June 2025)

Feature Platforms?—?A New Paradigm in Machine Learning Operations (MLOps)

Use machine learning to detect anomalies and predict downtime with Amazon Timestream and Amazon Lookout for Equipment

Use AWS PrivateLink to set up private access to Amazon Bedrock

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

How to choose a graph database: we compare 6 favorites

Explore data with ease: Use SQL and Text-to-SQL in Amazon SageMaker Studio JupyterLab notebooks

16 Companies Leading the Way in AI and Data Science

Configure cross-account access of Amazon Redshift clusters in Amazon SageMaker Studio using VPC peering

Prepare training and validation dataset for facies classification using Snowflake integration and train using Amazon SageMaker Canvas

Securing MLflow in AWS: Fine-grained access control with AWS native services

Manage your Amazon Lex bot via AWS CloudFormation templates

Build ML features at scale with Amazon SageMaker Feature Store using data from Amazon Redshift

Store Sales Forecasting with Snowflake Cortex ML & Snowpark

Onboard users to Amazon SageMaker Studio with Active Directory group-specific IAM roles

A comprehensive guide to learning LLMs (Foundational Models)

MLOps for IoT Edge Ecosystems: Building an MLOps Environment on AWS

How Data Security Posture Management Protects Against Data Breaches

Access control for vector stores using metadata filtering with Knowledge Bases for Amazon Bedrock

Exploring the leading AI medical scribes

Build secure RAG applications with AWS serverless data lakes

Transform one-on-one customer interactions: Build speech-capable order processing agents with AWS and generative AI

Introducing the DataRobot AI Cloud: A Closer Look

Build an image search engine with Amazon Kendra and Amazon Rekognition

Automatic summarization with LLMs in Python

10 Years Later: Who’s the GOAT of Data Catalogs?

A review of purpose-built accelerators for financial services

The Story Continues: Announcing Version 14 of Wolfram Language and Mathematica

Implementing Agents in LangChain

Introducing spaCy

Stay Connected