
Nations around the world are pursuing sovereign AI to produce artificial intelligence using their own computing infrastructure, data, workforce and business networks to ensure AI systems align with local values, laws and interests.
In support of these efforts, NVIDIA today announced the availability of four new NVIDIA NIM microservices that enable developers to more easily build and deploy high-performing generative AI applications.
The microservices support popular community models tailored to meet regional needs. They enhance user interactions through accurate understanding and improved responses based on local languages and cultural heritage.
In the Asia-Pacific region alone, generative AI software revenue is expected to reach $48 billion by 2030 - up from $5 billion this year, according to ABI Research.
Llama-3-Swallow-70B, trained on Japanese data, and Llama-3-Taiwan-70B, trained on Mandarin data, are regional language models that provide a deeper understanding of local laws, regulations and other customs.
The RakutenAI 7B family of models, built on Mistral-7B, were trained on English and Japanese datasets, and are available as two different NIM microservices for Chat and Instruct. Rakuten's foundation and instruct models have achieved leading scores among open Japanese large language models, landing the top average score in the LM Evaluation Harness benchmark carried out from January to March 2024.
Training a large language model (LLM) on regional languages enhances the effectiveness of its outputs by ensuring more accurate and nuanced communication, as it better understands and reflects cultural and linguistic subtleties.
The models offer leading performance for Japanese and Mandarin language understanding, regional legal tasks, question-answering, and language translation and summarization compared with base LLMs like Llama 3.
Nations worldwide - from Singapore, the United Arab Emirates, South Korea and Sweden to France, Italy and India - are investing in sovereign AI infrastructure.
The new NIM microservices allow businesses, government agencies and universities to host native LLMs in their own environments, enabling developers to build advanced copilots, chatbots and AI assistants.
Developing Applications With Sovereign AI NIM Microservices Developers can easily deploy the sovereign AI models, packaged as NIM microservices, into production while achieving improved performance.
The microservices, available with NVIDIA AI Enterprise, are optimized for inference with the NVIDIA TensorRT-LLM open-source library.
NIM microservices for Llama 3 70B - which was used as the base model for the new Llama-3-Swallow-70B and Llama-3-Taiwan-70B NIM microservices - can provide up to 5x higher throughput. This lowers the total cost of running the models in production and provides better user experiences by decreasing latency.
The new NIM microservices are available today as hosted application programming interfaces (APIs).
Tapping NVIDIA NIM for Faster, More Accurate Generative AI Outcomes The NIM microservices accelerate deployments, enhance overall performance and provide the necessary security for organizations across global industries, including healthcare, finance, manufacturing, education and legal.
The Tokyo Institute of Technology fine-tuned Llama-3-Swallow 70B using Japanese-language data.
LLMs are not mechanical tools that provide the same benefit for everyone. They are rather intellectual tools that interact with human culture and creativity. The influence is mutual where not only are the models affected by the data we train on, but also our culture and the data we generate will be influenced by LLMs, said Rio Yokota, professor at the Global Scientific Information and Computing Center at the Tokyo Institute of Technology. Therefore, it is of paramount importance to develop sovereign AI models that adhere to our cultural norms. The availability of Llama-3-Swallow as an NVIDIA NIM microservice will allow developers to easily access and deploy the model for Japanese applications across various industries.
For instance, a Japanese AI company, Preferred Networks, uses the model to develop a healthcare specific model trained on a unique corpus of Japanese medical data, called Llama3-Preferred-MedSwallow-70B, that tops scores on the Japan National Examination for Physicians.
Chang Gung Memorial Hospital (CGMH), one of the leading hospitals in Taiwan, is building a custom-made AI Inference Service (AIIS) to centralize all LLM applications within the hospital system. Using Llama 3-Taiwan 70B, it is improving the efficiency of frontline medical staff with more nuanced medical language that patients can understand.
By providing instant, context-appropriate guidance, AI applications built with local-language LLMs streamline workflows and serve as a continuous learning tool to support staff development and improve the quality of patient care, said Dr. Changfu Kuo, director of the Center for Artificial Intelligence in Medicine at CGMH, Linko Branch. NVIDIA NIM is simplifying the development of these applications, allowing for easy access and deployment of models trained on regional languages with minimal engineering expertise.
Taiwan-based Pegatron, a maker of electronic devices, will adopt the Llama 3-Taiwan 70B NIM microservice for internal- and external-facing applications. It has integrated it with its PEGAAi Agentic AI System to automate processes, boosting efficiency in manufacturing and operations.
Llama-3-Taiwan 70B NIM is also being used by global petrochemical manufacturer Chang Chun Group, world-leading printed circuit board company Unimicron, technology-focused media company TechOrange, online contract service company LegalSign.ai and generative AI startup APMIC. These companies are also collaborating on the open model.
Creating Custom Enterprise Models
Most recent headlines
05/01/2027
Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...
01/06/2026
January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026
Throughout the week, Dolby brings to life the latest innovatio...
02/05/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
01/05/2026
January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...
01/04/2026
January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION
Douyin Users Can Now Create And Share Videos With Stun...
08/02/2026
The Alum Behind the Sound of the Super Bowl Joshua Sutherland BM '19 was recently interviewed by the Boston Globe about his role as music supervisor for t...
07/02/2026
X Games will host a live event taking place Thurs., March 12 at Cosm Los Angeles, marking the first-ever draft in X Games history and the official launch point ...
07/02/2026
Onsite experts and shop inside the stadium are providing quick solutions
Before any live event production, there is a high probability that unpredictable issue...
07/02/2026
Three Sony HDC-F5500s will capture this look on the broadcaster's concourse ...
07/02/2026
Inclusion in control-room revamp proves pivotal to supporting this tentpole even...
07/02/2026
The digital giant also produced episodes of The Edge with Micah Parsons' be...
07/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
07/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
07/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
06/02/2026
Appear, which specializes in live production technology, announces the appointme...
06/02/2026
Baller League US announces CBS Sports and its 24/7 soccer streaming channel CBS Sports Golazo Network will air the league's programming in the United States...
06/02/2026
Gravity Media, which concentrates in production, content, media services, and fa...
06/02/2026
The Alliance for IP Media Solutions (AIMS), together with the Video Services Forum (VSF), the Advanced Media Workflow Association (AMWA), and the European Broad...
06/02/2026
Bitmovin, a provider of video streaming solutions, announces that 1001, an OTT service in Iraq, has chosen the Bitmovin Player to improve its video streaming pe...
06/02/2026
Combate Global and content creator Shane Fazen announce a licensing agreement to distribute the Hispanic-focused franchise's first three live MMA events in ...
06/02/2026
Cisco is powering the invisible backbone of Super Bowl LX at Levi's Stadium as the technology giant delivers secure, high-capacity connectivity for over 70,...
06/02/2026
Over the past decade, the NFL and Amazon Web Services have changed how football analytics are analyzed and presented through Next Gen Stats. There's real-ti...
06/02/2026
In-venue and creative video staffers at the professional and collegiate level ha...
06/02/2026
In-venue and creative video staffers at the professional and collegiate level ha...
06/02/2026
Ratings Roundup is a rundown of recent rating news and is derived from press rel...
06/02/2026
How the podcast-turned-studio-show Boston Has Entered The Chat became an anchor ...
06/02/2026
ORF, the public service broadcaster for Austria, is in Italy for Milano Cortina 2026, ready to bring the country's most popular winter sports direct to view...
06/02/2026
Milano Cortina 2026 is now underway and Austrian public service broadcaster, ORF...
06/02/2026
Warner Bros. Discovery (WBD) has lifted the curtain on its studios in Italy that...
06/02/2026
Milano Cortina marks the first time since London 2012 that NRK has had the full ...
06/02/2026
Winter sports are wildly popular in Norway, with cross-country skiing and biathl...
06/02/2026
Norwegian broadcaster NRK has the free-to-air rights to the Olympics back for th...
06/02/2026
The production of the mega-esports event also leverages facilities at EA headqua...
06/02/2026
Here's a preview of NBC's massive game and pregame production operation as Super Bowl Sunday approaches....
06/02/2026
Music fans know the feeling: A song stops you in your tracks, and you immediately want to know more. What inspired it, and what's the meaning behind it? We ...
06/02/2026
The National Film and Video Foundation (NFVF), an agency of the Department of Sp...
06/02/2026
Calrec Wins Best of Show at ISE 2026 for Orchestrating Distributed IP Production
Calrec is delighted to announce that its IP Ecosystem Powered by True Control...
06/02/2026
Despite most never having strapped on skis or skates, Aussies are keen for some ...
06/02/2026
MNC Software, a global leader in network management and operational support systems tailored to the broadcast and media industry, today announced the launch of ...
06/02/2026
The annual Junior Eurovision Song Contest arrived at Tbilisi's Gymnastic Hall in Olympic City, presenting an international stage for young talent with rich,...
06/02/2026
NAB Show 2026 | April 19 22 | Booth # N2471
At this year s NAB Show, Sonnet will showcase new Thunderbolt 5 products, including desktop and rackmount PCIe card...
06/02/2026
The Alliance for IP Media Solutions (AIMS), together with the Video Services Forum (VSF), the Advanced Media Workflow Association (AMWA), and the European Broad...
06/02/2026
Dalet, a leading technology and service provider for media-rich organizations, today announced a major update to Dalet Flex. Building on the workflow packages a...
06/02/2026
Getting closer to the business through highly respected technology partner
Stand 4P880, ISE 2026, Fira de Barcelona, 3 6 February 2026
Bitfocus is acceleratin...
06/02/2026
Bitmovin, a leading provider of video streaming solutions, has announced that 1001, a premier OTT service in Iraq, has chosen the Bitmovin Player to improve its...
06/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
06/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
06/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
06/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
06/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...