AWS Turbocharges Foundation Models With Smart AI Agents

Contributor

Contributor Group

Aug 4, 2023,09:00am EDT

AWS

AWS recently announced an important AI capability called agents that adds key functionality to its foundation models. Before discussing agents in detail, I will provide another overview of the Amazon Bedrock foundation models (FMs) that will use these new features. Patrick Moorhead, founder and Chief Analyst of Moor Insights and Strategy, provided a first look at AWS Bedrock in an earlier Fobes article.

Overview of Bedrock foundation models

Amazon Bedrock uses API calls to facilitate integrating foundation models into applications. Bedrock's API service also allows developers to build applications without the need to manage AI infrastructure at all.

Training foundation models is a time-consuming process. You can train small models with just a few GPUs and about 100 million parameters in a few days. On the other hand, most models used in the corporate environment are large language models (LLMs) running on thousands of GPUs that can take months to train.

Using Bedrock, LLMs like Codex and GPT-4 can generate text, answer questions and summarize content without the need to build and train them from scratch.

Bedrock foundation models are already optimized and pretrained for conversations and content creation. And because they run on an AWS-managed infrastructure, the models can be scaled on services like EC2 and Lambda and provide low-latency endpoints to enable real-time integration into workflows.

Bedrock models can also be customized by using parameters in the API with features including caching, access controls and usage monitoring.

Bedrock offers a choice of foundation models

MORE FROMFORBES ADVISOR

Best Travel Insurance Companies

Amy Danise

Editor

Best Covid-19 Travel Insurance Plans

Amy Danise

Editor

AWS

The two Amazon Titan foundation models, Titan Text and Titan Embeddings, are pretrained on large datasets. Both models can be used as-is or customized for a particular task using proprietary data, which avoids the expense and time that developers would otherwise need to annotate the model with large volumes of data.

Titan Text is a large language model used for natural language processing (NLP) tasks such as summarization, text generation, classification, QA and information extraction from text.
Titan Embeddings is more complicated than Titan Text. Instead of generating text, Embeddings encodes words, phrases or blocks of text into high-dimensional numeric vectors used for semantic search, recommendations, sentiment analysis and other tasks. The model finds relationships between conceptual meanings rather than between keywords.

Jurassic-2, created by AI21 Studio, is a multilingual LLM for generating text in Spanish, French, German, Portuguese, Italian and Dutch. The model is available in three sizes: Large, Grande and Jumbo, alongside instruction-tuned language models for Jumbo and Grande. Jurrassic-2 also offers zero-shot instruction capabilities, which means developers can steer the model with natural language prompts without using examples.

Claude-2 is an LLM designed for dialogue, content creation, complex reasoning, creativity and coding. Anthropic based it on Constitutional AI and safe training. Claude-2 input can accommodate 100,000 tokens, equivalent to about 75,000 words. That makes it possible to input hundreds of pages of information for analysis.

Stable Diffusion is a text-to-image open-source model created by Stability.ai and trained on a dataset with 5 billion image-text pairs. This model can create realistic, high-quality images of different styles and content by using a text prompt.

Command and Embed, two of the newest foundation models available on AWS Bedrock, were created by Cohere.

Cohere Command is a large natural language text generation model focused on task-oriented dialogue. It creates summarization, copywriting, dialogue extraction and questions and answers. Its use cases include database queries, forms and website navigation.
Cohere Embed generates vector embeddings that represent the semantic meaning of text. It encodes words, sentences and whole documents into high-dimensional numeric vectors. It is useful for semantic search, recommendations and personalization based on meaning.

Agents for Amazon Bedrock

AWS

Foundation models are powerful, but LLMs cannot execute any tasks without proper resources. For that reason, AWS created intelligent agents to manage and perform complex tasks associated with foundation models.

When using agents, developers need only to provide high-level goals or natural language instructions to the foundation model. The agent handles interpreting those instructions, orchestrating the steps involved, integrating the instructions with various systems and providing the right prompts to the model.

Agents allow developers to complete tasks and workflows simply. Natural language instructions direct agents to automate workflows, gather specified information, monitor systems for events or fill out web forms. The agent maps the developer's natural language instructions to the specific actions and workflows needed to complete the goal.

Even though agents can interpret user requests, carry on a conversation and break down complex tasks into simple steps, developers must still provide the right prompt engineering to ensure that the agent has the right prompts and instructions to work with.

Agents leverage AWS’s cloud infrastructure to scale and maintain real-time responsiveness to users. The cloud also connects agents to external data sources such as OpenSearch or other databases to retrieve the latest contextual information. By combining cloud hosting with prompt augmentation from live data, agents can provide low-latency responses to users even during high-volume usage.

Wrapping up

According to Amazon, thousands of customers are now using Amazon Bedrock for various generative AI applications such as self-service, customer care, text creation and post-call analysis. Having a choice of several models is significant because no one model can satisfy every use case within an enterprise. That’s why AWS offers two foundation models from Amazon plus foundation models from best-in-class AI startups.

How does this work in the real world? In one example, a major insurance firm is using Bedrock to test generative AI applications for analyzing market data and evaluating the impact of AI functionality on employee efficiency. In another instance, a bank is working with the AWS Generative AI Innovation Center to pioneer new use cases based on Bedrock foundation models.

Although machine learning has been used in customer care applications for a decade, and chatbots and AI-powered contact centers have been around for years, the recent surge in generative AI has dramatically improved the versatility of these applications.

Generative AI applications such as conversational search, text summarization and other “copilot” actions have already increased employee productivity. In a more technical setting, code generation is one of the most important productivity improvements for software developers.

Generative AI is still in the early phases of enhancing business operations, but as the technology continues to improve, it will enable even more significant optimizations across organizations.

At the same time that AWS announced agents for BedRock foundation models, it also made other important AI announcements that included AWS Entity Resolution, Amazon EC2 P5 Instances using NVIDIA H100 Tensor Core GPUs, and Generative BI capabilities in Amazon QuickSight. I plan to cover a few of these enhancements in a separate article.

Moor Insights & Strategy provides or has provided paid services to technology companies like all research and tech industry analyst firms. These services include research, analysis, advising, consulting, benchmarking, acquisition matchmaking, and video and speaking sponsorships. The company has had or currently has paid business relationships with 8×8, Accenture, A10 Networks, Advanced Micro Devices, Amazon, Amazon Web Services, Ambient Scientific, Ampere Computing, Anuta Networks, Applied Brain Research, Applied Micro, Apstra, Arm, Aruba Networks (now HPE), Atom Computing, AT&T, Aura, Automation Anywhere, AWS, A-10 Strategies, Bitfusion, Blaize, Box, Broadcom, C3.AI, Calix, Cadence Systems, Campfire, Cisco Systems, Clear Software, Cloudera, Clumio, Cohesity, Cognitive Systems, CompuCom, Cradlepoint, CyberArk, Dell, Dell EMC, Dell Technologies, Diablo Technologies, Dialogue Group, Digital Optics, Dreamium Labs, D-Wave, Echelon, Ericsson, Extreme Networks, Five9, Flex, Foundries.io, Foxconn, Frame (now VMware), Fujitsu, Gen Z Consortium, Glue Networks, GlobalFoundries, Revolve (now Google), Google Cloud, Graphcore, Groq, Hiregenics, Hotwire Global, HP Inc., Hewlett Packard Enterprise, Honeywell, Huawei Technologies, HYCU, IBM, Infinidat, Infoblox, Infosys, Inseego, IonQ, IonVR, Inseego, Infosys, Infiot, Intel, Interdigital, Jabil Circuit, Juniper Networks, Keysight, Konica Minolta, Lattice Semiconductor, Lenovo, Linux Foundation, Lightbits Labs, LogicMonitor, LoRa Alliance, Luminar, MapBox, Marvell Technology, Mavenir, Marseille Inc, Mayfair Equity, Meraki (Cisco), Merck KGaA, Mesophere, Micron Technology, Microsoft, MiTEL, Mojo Networks, MongoDB, Multefire Alliance, National Instruments, Neat, NetApp, Nightwatch, NOKIA, Nortek, Novumind, NVIDIA, Nutanix, Nuvia (now Qualcomm), NXP, onsemi, ONUG, OpenStack Foundation, Oracle, Palo Alto Networks, Panasas, Peraso, Pexip, Pixelworks, Plume Design, PlusAI, Poly (formerly Plantronics), Portworx, Pure Storage, Qualcomm, Quantinuum, Rackspace, Rambus, Rayvolt E-Bikes, Red Hat, Renesas, Residio, Samsung Electronics, Samsung Semi, SAP, SAS, Scale Computing, Schneider Electric, SiFive, Silver Peak (now Aruba-HPE), SkyWorks, SONY Optical Storage, Splunk, Springpath (now Cisco), Spirent, Splunk, Sprint (now T-Mobile), Stratus Technologies, Symantec, Synaptics, Syniverse, Synopsys, Tanium, Telesign,TE Connectivity, TensTorrent, Tobii Technology, Teradata,T-Mobile, Treasure Data, Twitter, Unity Technologies, UiPath, Verizon Communications, VAST Data, Ventana Micro Systems, Vidyo, VMware, Wave Computing, Wellsmith, Xilinx, Zayo, Zebra, Zededa, Zendesk, Zoho, Zoom, and Zscaler. Moor Insights & Strategy founder, CEO, and Chief Analyst Patrick Moorhead is an investor in dMY Technology Group Inc. VI, Fivestone Partners, Frore Systems, Groq, MemryX, Movandi, and Ventana Micro., MemryX, Movandi, and Ventana Micro.

Paul Smith-Goodson

Following

One Community. Many Voices. Create a free account to share your thoughts.

Read our community guidelines .

Our community is about connecting people through open and thoughtful conversations. We want our readers to share their views and exchange ideas and facts in a safe space.

In order to do so, please follow the posting rules in our site's Terms of Service. We've summarized some of those key rules below. Simply put, keep it civil.

Your post will be rejected if we notice that it seems to contain:

False or intentionally out-of-context or misleading information
Spam
Insults, profanity, incoherent, obscene or inflammatory language or threats of any kind
Attacks on the identity of other commenters or the article's author
Content that otherwise violates our site's terms.

User accounts will be blocked if we notice or believe that users are engaged in:

Continuous attempts to re-post comments that have been previously moderated/rejected
Racist, sexist, homophobic or other discriminatory comments
Attempts or tactics that put the site security at risk
Actions that otherwise violate our site's terms.

So, how can you be a power user?

Stay on topic and share your insights
Feel free to be clear and thoughtful to get your point across
‘Like’ or ‘Dislike’ to show your point of view.
Protect your community.
Use the report tool to alert us when someone breaks the rules.

Thanks for reading our community guidelines. Please read the full list of posting rules found in our site's Terms of Service.

AWS Turbocharges Foundation Models With Smart AI Agents

Best Travel Insurance Companies

Best Covid-19 Travel Insurance Plans

Join The Conversation

Forbes Community Guidelines

More From Forbes

AWS Turbocharges Foundation Models With Smart AI Agents

Best Travel Insurance Companies

Best Covid-19 Travel Insurance Plans

Join The Conversation

Forbes Community Guidelines