
Data is the fuel of AI applications, but the magnitude and scale of enterprise data often make it too expensive and time-consuming to use effectively.
According to IDC's Global DataSphere1, enterprises will generate 317 zettabytes of data annually by 2028 - including the creation of 29 zettabytes of unique data - of which 78% will be unstructured data and 44% of that will be audio and video. Because of the extremely high volume and various data types, most generative AI applications use a fraction of the total amount of data being stored and generated.
For enterprises to thrive in the AI era, they must find a way to make use of all of their data. This isn't possible using traditional computing and data processing techniques. Instead, enterprises need an AI query engine.
What Is an AI Query Engine? Simply, an AI query engine is a system that connects AI applications, or AI agents, to data. It's a critical component of agentic AI, as it serves as a bridge between an organization's knowledge base and AI-powered applications, enabling more accurate, context-aware responses.
AI agents form the basis of an AI query engine, where they can gather information and do work to assist human employees. An AI agent will gather information from many data sources, plan, reason and take action. AI agents can communicate with users, or they can work in the background, where human feedback and interaction will always be available.
In practice, an AI query engine is a sophisticated system that efficiently processes large amounts of data, extracts and stores knowledge, and performs semantic search on that knowledge, which can be quickly retrieved and used by AI.
An AI query engine processes, stores and retrieves data - connecting AI agents to insights. AI Query Engines Unlock Intelligence in Unstructured Data An enterprise's AI query engine will have access to knowledge stored in many different formats, but being able to extract intelligence from unstructured data is one of the most significant advancements it enables.
To generate insights, traditional query engines rely on structured queries and data sources, such as relational databases. Users must formulate precise queries using languages like SQL, and results are limited to predefined data formats.
In contrast, AI query engines can process structured, semi-structured and unstructured data. Common unstructured data formats are PDFs, log files, images and video, and are stored on object stores, file servers and parallel file systems. AI agents communicate with users and with each other using natural language. This enables them to interpret user intent, even when it's ambiguous, by accessing diverse data sources. These agents can deliver results in a conversational format, so that users can interpret results.
This capability makes it possible to derive more insights and intelligence from any type of data - not just data that fits neatly into rows and columns.
For example, companies like DataStax and NetApp are building AI data platforms that enable their customers to have an AI query engine for their next-generation applications.
Key Features of AI Query Engines AI query engines possess several crucial capabilities:
Diverse data handling: AI query engines can access and process various data types, including structured, semi-structured and unstructured data from multiple sources, including text, PDF, image, video and specialty data types.
Scalability: AI query engines can efficiently handle petabyte-scale data, making all enterprise knowledge available to AI applications quickly.
Accurate retrieval: AI query engines provide high-accuracy, high-performance embedding, vector search and reranking of knowledge from multiple sources.
Continuous learning: AI query engines can store and incorporate feedback from AI-powered applications, creating an AI data flywheel in which the feedback is used to refine models and increase the effectiveness of the applications over time.
Retrieval-augmented generation is a component of AI query engines. RAG uses the power of generative AI models to act as a natural language interface to data, allowing models to access and incorporate relevant information from large datasets during the response generation process.
Using RAG, any business or other organization can turn its technical information, policy manuals, videos and other data into useful knowledge bases. An AI query engine can then rely on these sources to support such areas as customer relations, employee training and developer productivity.
Additional information-retrieval techniques and ways to store knowledge are in research and development, so the capabilities of an AI query engine are expected to rapidly evolve.
The Impact of AI Query Engines Using AI query engines, enterprises can fully harness the power of AI agents to connect their workforces to vast amounts of enterprise knowledge, improve the accuracy and relevance of AI-generated responses, process and utilize previously untapped data sources, and create data-driven AI flywheels that continuously improve their AI applications.
Some examples include an AI virtual assistant that provides personalized, 24/7 customer service experiences, an AI agent for searching and summarizing video, an AI agent for analyzing software vulnerabilities or an AI research assistant.
Bridging the gap between raw data and AI-powered applications, AI query engines will grow to play a crucial role in helping organizations extract value from their data.
NVIDIA Blueprints can help enterprises get started connecting AI to their data. Learn more about NVIDIA Blueprints and try them in the NVIDIA API catalog.
IDC, Global DataSphere Forecast, 2024.
More from Nvidia
07/02/2025
Every year, venomous snakes kill over 100,000 people and leave 300,000 more with devastating injuries - amputations, paralysis and permanent disabilities. The v...
06/02/2025
AI built for speech is now decoding the language of earthquakes.
A team of researchers from the Earth and environmental sciences division at Los Alamos Nationa...
06/02/2025
GeForce NOW celebrates its fifth anniversary this February with a lineup of five major releases. The month kicks off with Kingdom Come: Deliverance II. Prepare ...
05/02/2025
When non-technical users can create and deploy reliable AI workflows, organizations can do more to serve their clientele
Platforms for developing no- and low-c...
05/02/2025
The financial services industry is reaching an important milestone with AI, as organizations move beyond testing and experimentation to successful AI implementa...
05/02/2025
NVIDIA's GeForce RTX 5090 and 5080 GPUs - which are based on the groundbreaking NVIDIA Blackwell architecture -offer up to 8x faster frame rates with NVIDIA...
04/02/2025
AI reasoning models and agents are set to transform industries, but delivering their full potential at scale requires massive compute and optimized software. Th...
31/01/2025
The recently released DeepSeek-R1 model family has brought a new wave of excitement to the AI community, allowing enthusiasts and developers to run state-of-the...
30/01/2025
DeepSeek-R1 is an open model with state-of-the-art reasoning capabilities. Instead of offering direct responses, AI models like DeepSeek-R1 perform reasoning th...
30/01/2025
GeForce NOW turns five this February. Five incredible years of high-performance gaming have been made possible thanks to the members who've joined the cloud...
30/01/2025
New GeForce RTX 5090 and RTX 5080 GPUs - built on the NVIDIA Blackwell architect...
29/01/2025
AI agents with advanced perception and cognition capabilities are making digital experiences more dynamic and personalized across retail, finance, entertainment...
27/01/2025
Named after Greek mythology's goddess of the sea, France-based startup Amphi...
23/01/2025
Businesses across every industry are rolling out AI services this year. For Microsoft, Oracle, Perplexity, Snap and hundreds of other leading companies, using t...
23/01/2025
GeForce NOW is expanding mod support for hit game Baldur's Gate 3 in collaboration with Larian Studios and mod.io for Ultimate and Performance members.
Thi...
22/01/2025
Companies and organizations are increasingly using AI to protect their customers and thwart the efforts of fraudsters around the world.
Voice security company ...
22/01/2025
Editor's note: This post is part of Into the Omniverse, a series focused on ...
22/01/2025
AI agents - which can understand, adapt to and support each user's unique journey - are making online shopping and digital marketing more efficient and pers...
21/01/2025
More than 90 million new vehicles are introduced to roads across the globe every...
16/01/2025
Time to suit up, members. The multiverse is about to get a whole lot cloudier as GeForce NOW opens a portal to the first season of hit game Marvel Rivals from N...
16/01/2025
AI agents are poised to transform productivity for the world's billion knowledge workers with knowledge robots that can accomplish a variety of tasks. To ...
15/01/2025
Troves of unwatched surgical video footage are finding new life, fueling AI tools that help make surgery safer and enhance surgical education. The Surgical Data...
14/01/2025
AI is making inroads across the entire healthcare industry - from genomic research to drug discovery, clinical trial workflows and patient care.
In a fireside ...
14/01/2025
Quantum computing is one of the most exciting areas in computer science, promising progress in accelerated computing beyond what's considered possible today...
13/01/2025
For decades, leadership in computing and software ecosystems has been a cornerst...
13/01/2025
For decades, leadership in computing and software ecosystems has been a cornerst...
13/01/2025
IQVIA, the world's leading provider of clinical research services, commercial insights and healthcare intelligence, is working with NVIDIA to build custom f...
10/01/2025
Artificial intelligence is rapidly becoming the cornerstone of innovation in the...
09/01/2025
Driving the future of smart mobility, Hyundai Motor Group (the Group) is partnering with NVIDIA to develop the next generation of safe, secure mobility with AI ...
09/01/2025
This GFN Thursday recaps the latest cloud announcements from the CES trade show, including GeForce RTX gaming expansion across popular devices such as Steam Dec...
08/01/2025
Over the past year, generative AI has transformed the way people live, work and play, enhancing everything from writing and content creation to gaming, learning...
07/01/2025
Data is the fuel of AI applications, but the magnitude and scale of enterprise data often make it too expensive and time-consuming to use effectively.
Accordin...
07/01/2025
In the fast-evolving landscape of AI, it's becoming increasingly important to develop models that can accurately simulate and predict outcomes in physical, ...
06/01/2025
The next big moment in AI is in sight - literally.
Today, more than 1.5 billion enterprise level cameras deployed worldwide are generating roughly 7 trillion h...
06/01/2025
Generative AI and foundation models let autonomous machines generalize beyond th...
06/01/2025
According to Gartner, the worldwide end-user spending on all IT products for 202...
02/01/2025
Artificial intelligence and accelerated computing are being used to help solve the world's greatest challenges.
NVIDIA has reinvented the computing stack -...
02/01/2025
GeForce NOW is kicking off 2025 by delivering 14 games to the cloud this month, with two available to stream this week so members can get started on their New Y...
30/12/2024
The pace of technology innovation has accelerated in the past year, most dramati...
27/12/2024
NVIDIA's AI Podcast gives listeners the inside scoop on the ways AI is transforming nearly every industry. Since the show's debut in 2016, it's gar...
26/12/2024
This GFN Thursday wraps up another incredible year for cloud gaming. Take a look back at the top games and new features that made 2024 a standout for GeForce NO...
24/12/2024
Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and showcases new hardware, softwa...
19/12/2024
Shoppers pondering a new hairstyle can now try styles before committing to curls or a new color. An AI app by Ulta Beauty, the largest specialty beauty retailer...
19/12/2024
Stuck in a gaming rut? Get out of the loop this GFN Thursday with four new games...
18/12/2024
Editor's note: This post is part of the AI On blog series, which explores th...
18/12/2024
Imagine a future in which everyone is empowered to build and use their own AI agents. That future may not be far off, as new software is infused with intelligen...
18/12/2024
For more than two decades, the NVIDIA Graduate Fellowship Program has supported graduate students doing outstanding work relevant to NVIDIA technologies. Today,...
17/12/2024
In enterprise AI, understanding and working across multiple languages is no long...
17/12/2024
NVIDIA is taking the wraps off a new compact generative AI supercomputer, offering increased performance at a lower price with a software upgrade.
The new NVID...
16/12/2024
On Jan. 6 at 6:30 p.m. PT, NVIDIA founder and CEO Jensen Huang - with his trademark leather jacket and an unwavering vision - will step onto the CES 2025 stage....