Data Science Current

RO-ViT: Region-aware pre-training for open-vocabulary object detection with vision transformers

Google Research AI blog

AUGUST 28, 2023

However, as VLMs are primarily designed for image-level tasks like classification and retrieval, they do not fully leverage the concept of objects or regions during the pre-training phase. Region-aware image-text pre-training Existing VLMs are trained to match an image as a whole to a text description.

Clustering

Retain original PDF formatting to view translated documents with Amazon Textract, Amazon Translate, and PDFBox

AWS Machine Learning Blog

JULY 3, 2023

There are similar PDF processing libraries available in other programming languages, for example Node PDFBox. jar --source en --translated es Two translated PDF documents are created in the documents folder, with and without the original formatting ( SampleOutput-es.pdf and SampleOutput-min-es.pdf ). region(region).build();

AWS

AWS ML ML Clustering

Deploy pre-trained models on AWS Wavelength with 5G edge using Amazon SageMaker JumpStart

AWS Machine Learning Blog

APRIL 7, 2023

You can opt in to the Wavelength Zones within a given Region via the AWS Management Console or the AWS Command Line Interface (AWS CLI). Because SageMaker is not natively supported in Wavelength Zones, we demonstrate how to extract the model artifacts from the Region and re-deploy to the edge. Run the train_model.py sourcedir.tar.gz

AWS

AWS Clustering ML ML

Webinars

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

The Project Clinic: Assessing Project Health, Planning, and Execution

Leading the Development of Profitable and Sustainable Products

How To Get Promoted In Product Management

MORE WEBINARS

Streamline diarization using AI as an assistive technology: ZOO Digital’s story

AWS Machine Learning Blog

FEBRUARY 20, 2024

ZOO Digital provides end-to-end localization and media services to adapt original TV and movie content to different languages, regions, and cultures. Remember to replace [REGION] with the AWS Region you are using. For other required Python packages, create a requirements.txt file with a list of packages and their versions.

AWS

AWS AI AI Machine Learning

Enhancing customer experience: Streamlining orders with custom email notifications in IBM Cloud

IBM Journey to AI blog

OCTOBER 11, 2023

Select a Region from the list of supported regions and Select a pricing plan. This case study may inspire other businesses to explore similar solutions for improving customer engagement and satisfaction. Step 1: Create an IBM Cloud Event Notifications service instance Log in to your IBM Cloud account. Provide a Service name.

Build RAG applications using Jina Embeddings v2 on Amazon SageMaker JumpStart

AWS Machine Learning Blog

JUNE 6, 2024

Choose Jina Embeddings v2 Base – en , which is Jina AI’s English language embeddings model. Our state-of-the-art text embedding models support English and Chinese and soon will support German, with other languages to follow. Search for “jina” and you will see the provider page link and models available from Jina AI. Choose Deploy.

AWS

AWS Database ML ML

Simplify continuous learning of Amazon Comprehend custom models using Comprehend flywheel

AWS Machine Learning Blog

MARCH 1, 2023

It develops insights by recognizing the entities, key phrases, language, sentiments, and other common elements in a document. As part of the next steps, you can explore the following: Create and manage Comprehend flywheel resources from other mediums such as SDK and console.

Data Lakes

Data Lakes AWS ML ML

4 Best Practices for SAP Automation

Precisely

MARCH 5, 2024

Is it flexible enough to accommodate different process and data requirements for various regions, sales organizations, plants, or product lines? It should empower users to create or update records en masse using SAP-enabled Excel workbooks. It should allow you to supply, update, or approve data using role-based forms.

Data Quality

Falcon 2 11B is now available on Amazon SageMaker JumpStart

AWS Machine Learning Blog

MAY 31, 2024

It’s equipped with multilingual capabilities and can seamlessly tackle tasks in English, French, Spanish, German, Portuguese, and other languages for diverse scenarios. The Falcon 2 11B model is available today for inferencing from 22 AWS Regions where SageMaker JumpStart is available. En qué puedo ayudarte? Dónde puedo empezar?

AWS

AWS Python ML ML

Build financial search applications using the Amazon Bedrock Cohere multilingual embedding model

AWS Machine Learning Blog

JANUARY 12, 2024

The multilingual model groups text with similar meanings by assigning them positions that are close to each other in a semantic vector space. In this case, all returned results are in Danish, but the model can return a document in a language other than the query if its semantic meaning is closer.

Natural Language Processing

Natural Language Processing AWS Data Science Database

Explain medical decisions in clinical settings using Amazon SageMaker Clarify

AWS Machine Learning Blog

AUGUST 21, 2023

However, in a real-world use case, electronic health records (EHRs) or other hospital care applications would directly invoke the SageMaker endpoint to get the same response. The endpoint_name must be unique within a Region in your AWS account. In the sample code, we use a Jupyter notebook to showcase the functionality.

AWS

AWS ML ML Machine Learning

Zero-shot and few-shot prompting for the BloomZ 176B foundation model with the simplified Amazon SageMaker JumpStart SDK

AWS Machine Learning Blog

AUGUST 14, 2023

Translation: Sorry but I cannot. ### NLP Cloud permet de deployer le NLP en production facilement. You can take advantage of the model-specific default values we provide to specify the configuration, such as the Docker image, ML instance type, model artifact location, and hyperparameters, among other fields.

AWS

AWS Natural Language Processing Machine Learning Machine Learning

Google Research, 2022 & beyond: Research community engagement

Google Research AI blog

FEBRUARY 28, 2023

You can find other posts in the series here.) We also support Responsible AI projects directly for other organizations — including our commitment of $3M to fund the new INSAIT research center based in Bulgaria. We partnered with ENS , a university in France, to help fund scholarships for students to train through research.

ML

ML ML Deep Learning Deep Learning

Advanced RAG patterns on Amazon SageMaker

AWS Machine Learning Blog

MARCH 28, 2024

Solution overview In this post, we demonstrate the use of Mixtral-8x7B Instruct text generation combined with the BGE Large En embedding model to efficiently construct a RAG QnA system on an Amazon SageMaker notebook using the parent document retriever tool and contextual compression technique.

AWS

AWS Machine Learning Machine Learning AI

Use Amazon SageMaker Studio to build a RAG question answering solution with Llama 2, LangChain, and Pinecone for fast experimentation

Flipboard

NOVEMBER 20, 2023

We use two AWS Media & Entertainment Blog posts as the sample external data, which we convert into embeddings with the BAAI/bge-small-en-v1.5 Deploy the BAAI/bge-small-en-v1.5 In other words, implement RAG. em_model_name = "BAAI/bge-small-en" em_model_path = f"./em-model" Deploy the BAAI/bge-small-en-v1.5

AWS

AWS Database Machine Learning Machine Learning

Implementing a custom trainable component for relation extraction

Explosion

APRIL 27, 2023

Genes are regions of your DNA that code for specific proteins. We are interested in direct, binary relations between two entities, which means that this sentence contains two relevant instances: one where “GATA3” is the object and “FOXP3” is the subject and one where it’s the other way around.

Machine Learning

Machine Learning Machine Learning Deep Learning Deep Learning

Toki Pona: an attempted universal language with only ~120 words

Hacker News

AUGUST 13, 2023

The whole tok->en dictionary fits on seventeen printed pages [[link] There are no conjugations or tenses to memorize. Toki Pona on the other hand is a language simple enough the entire world actually could learn it, had it reason to, and the buy-in cost in time is a lot lower. Â³ Designing an actually learnable, universal(?)

Fine-tune Llama 2 for text generation on Amazon SageMaker JumpStart

AWS Machine Learning Blog

SEPTEMBER 6, 2023

You can also find other four model variants by choosing Explore all Text Generation Models or searching for llama in the search box. This helps in significant reduction of the memory requirement because we only need to store gradients, optimizer states, and other training-related information for only 1% of the parameters.

ML

ML ML Machine Learning Machine Learning

Cohere Command R and R+ are now available in Amazon SageMaker JumpStart

AWS Machine Learning Blog

APRIL 29, 2024

If an endpoint has already been created, you can simply connect to it: co = Client(region_name=region) co.connect_to_endpoint(endpoint_name="cohere-command-r-plus") Real-time inference Once your endpoint has been connected, you can perform real-time inference using the co.chat endpoint.

AWS

AWS Natural Language Processing Database ML

Build trust and safety for generative AI applications with Amazon Comprehend and LangChain

AWS Machine Learning Blog

NOVEMBER 10, 2023

At its core, the toxicity detection model analyzes text to determine the likelihood of it containing hateful content, threats, obscenities, or other forms of harmful text. Chains can be built by merging numerous chains or by mixing chains with other components.

AI

AI AI AWS ML

Intelligent video and audio Q&A with multilingual support using LLMs on Amazon SageMaker

AWS Machine Learning Blog

AUGUST 15, 2023

On the other hand, langchain provides the recursive chunking text splitter function RecursiveCharacterTextSplitter , which can keep all the semantically relevant content in the same chunk. This means when the vector of a query is close to the vector of one chunk, it may have less possibility to be close to other chunks.

AWS

AWS ML ML AI

A 50-Year Quest: My Personal Journey with the Second Law of Thermodynamics

Hacker News

FEBRUARY 2, 2023

The fifth one at first seemed quite mysterious—and somehow more abstract in its goals than the others: What story was the filmstrip on its cover telling? The other books I’d read had all basically said “physics works like this”. Then the third. The second. The fourth. For a couple of months I didn’t look seriously at the book.

Machine Learning

Machine Learning Machine Learning Algorithm Analytics

Multi-Modal Methods: Visual Speech Recognition (Lip Reading)

ML Review

MAY 3, 2018

These characteristics mean that improvements and breakthroughs in one field may catalyse further progress in other fields. Deep Learning est en train de mourir. Figure 5 : Saliency map of “Please” Note from source : Saliency shows the places where LipNet has learned to attend, i.e. the phonologically important regions.

Deep Learning

Deep Learning Deep Learning Natural Language Processing Machine Learning

Data Science Current

RO-ViT: Region-aware pre-training for open-vocabulary object detection with vision transformers

Retain original PDF formatting to view translated documents with Amazon Textract, Amazon Translate, and PDFBox

Webinars

Trending Sources

Deploy pre-trained models on AWS Wavelength with 5G edge using Amazon SageMaker JumpStart

Webinars

Streamline diarization using AI as an assistive technology: ZOO Digital’s story

Enhancing customer experience: Streamlining orders with custom email notifications in IBM Cloud

Build RAG applications using Jina Embeddings v2 on Amazon SageMaker JumpStart

Simplify continuous learning of Amazon Comprehend custom models using Comprehend flywheel

4 Best Practices for SAP Automation

Falcon 2 11B is now available on Amazon SageMaker JumpStart

Build financial search applications using the Amazon Bedrock Cohere multilingual embedding model

Explain medical decisions in clinical settings using Amazon SageMaker Clarify

Zero-shot and few-shot prompting for the BloomZ 176B foundation model with the simplified Amazon SageMaker JumpStart SDK

Google Research, 2022 & beyond: Research community engagement

Advanced RAG patterns on Amazon SageMaker

Use Amazon SageMaker Studio to build a RAG question answering solution with Llama 2, LangChain, and Pinecone for fast experimentation

Implementing a custom trainable component for relation extraction

Toki Pona: an attempted universal language with only ~120 words

Fine-tune Llama 2 for text generation on Amazon SageMaker JumpStart

Cohere Command R and R+ are now available in Amazon SageMaker JumpStart

Build trust and safety for generative AI applications with Amazon Comprehend and LangChain

Intelligent video and audio Q&A with multilingual support using LLMs on Amazon SageMaker

A 50-Year Quest: My Personal Journey with the Second Law of Thermodynamics

Multi-Modal Methods: Visual Speech Recognition (Lip Reading)

Stay Connected