{"id":110526,"date":"2025-11-12T16:09:44","date_gmt":"2025-11-12T16:09:44","guid":{"rendered":"https:\/\/www.artificialintelligence-news.com\/?p=110526"},"modified":"2025-11-12T16:09:47","modified_gmt":"2025-11-12T16:09:47","slug":"baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks","status":"publish","type":"post","link":"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/","title":{"rendered":"Baidu ERNIE multimodal AI beats GPT and Gemini in benchmarks"},"content":{"rendered":"\n<p>Baidu&#8217;s latest ERNIE model, a super-efficient multimodal AI, is beating GPT and <a href=\"https:\/\/www.developer-tech.com\/news\/google-upgrades-gemini-ai-for-android-enterprise-apps\/\">Gemini<\/a> on key benchmarks and targets enterprise data often ignored by text-focused models.<\/p>\n\n\n\n<p>For many businesses, valuable insights are locked in engineering schematics, factory-floor video feeds, medical scans, and logistics dashboards. Baidu&#8217;s new model, ERNIE-4.5-VL-28B-A3B-Thinking, is designed to fill this gap.<\/p>\n\n\n\n<p>What\u2019s interesting to enterprise architects is not just its multimodal capability, but its architecture. It&#8217;s described as a &#8220;lightweight&#8221; model, activating only three billion parameters during operation. This approach targets the high inference costs that often stall AI-scaling projects. Baidu is betting on efficiency as a path to adoption, training the system as a foundation for &#8220;multimodal agents&#8221; that can reason and act, not just perceive.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"h-complex-visual-data-analysis-capabilities-supported-by-ai-benchmarks\">Complex visual data analysis capabilities supported by AI benchmarks<\/h3>\n\n\n\n<p>Baidu&#8217;s multimodal ERNIE AI model excels at handling dense, non-text data. For example, it can interpret a \u201cPeak Time Reminder\u201d chart to find optimal visiting hours, a task that reflects the resource-scheduling challenges in logistics or retail.<\/p>\n\n\n\n<p>ERNIE 4.5 also shows capability in technical domains, like solving a bridge circuit diagram by applying Ohm\u2019s and Kirchhoff\u2019s laws. For R&amp;D and engineering arms, a future assistant could validate designs or explain complex schematics to new hires.<\/p>\n\n\n\n<p>This capability is supported by Baidu&#8217;s benchmarks, which show ERNIE-4.5-VL-28B-A3B-Thinking outperforming competitors like GPT-5-High and Gemini 2.5 Pro on some key tests:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>MathVista: ERNIE (82.5) vs Gemini (82.3) and GPT (81.3)<\/li>\n<\/ul>\n\n\n\n<ul class=\"wp-block-list\">\n<li>ChartQA: ERNIE (87.1) vs Gemini (76.3) and GPT (78.2)<\/li>\n<\/ul>\n\n\n\n<ul class=\"wp-block-list\">\n<li>VLMs Are Blind: ERNIE (77.3) vs Gemini (76.5) and GPT (69.6)<\/li>\n<\/ul>\n\n\n\n<p>It\u2019s worth noting, of course, that AI benchmarks provide a guide but <a href=\"https:\/\/www.artificialintelligence-news.com\/news\/flawed-ai-benchmarks-enterprise-budgets-at-risk\/\">can be flawed<\/a>. Always perform internal tests for your needs before deploying any AI model for mission-critical applications.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"h-baidu-shifts-from-perception-to-automation-with-its-latest-ernie-ai-model\">Baidu shifts from perception to automation with its latest ERNIE AI model<\/h3>\n\n\n\n<p>The primary hurdle for enterprise AI is moving from perception (&#8220;what is this?&#8221;) to automation (&#8220;what now?&#8221;). ERNIE 4.5 claims to address this by integrating visual grounding with tool use.<\/p>\n\n\n\n<p>Asking the multimodal AI to find all people wearing suits in an image and return their coordinates in JSON format works. The model generates the structured data, a function easily transferable to a production line for visual inspection or to a system auditing site images for safety compliance.<\/p>\n\n\n\n<p>The model also manages external tools and can autonomously zoom in on a photograph to read small text. If it faces an unknown object, it can trigger an image search to identify it. This represents a less passive form of AI that could power an agent to not only flag a data centre error, but also zoom in on the code, search the internal knowledge base, and suggest the fix.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"h-unlocking-business-intelligence-with-multimodal-ai\">Unlocking business intelligence with multimodal AI<\/h3>\n\n\n\n<p>Baidu&#8217;s latest ERNIE AI model also targets corporate video archives from training sessions and meetings to security footage. It can extract all on-screen subtitles and map them to their precise timestamps.<\/p>\n\n\n\n<p>It also demonstrates temporal awareness, finding specific scenes (like those &#8220;filmed on a bridge&#8221;) by analysing visual cues. The clear end-goal is making vast video libraries searchable, allowing an employee to find the exact moment a specific topic was discussed in a two-hour webinar they may have dozed off a couple of times during.<\/p>\n\n\n\n<p>Baidu provides deployment guidance for several paths, including transformers, vLLM, and FastDeploy. However, the hardware requirements are a major barrier. A single-card deployment needs 80GB of GPU memory. This is not a tool for casual experimentation, but for organisations with existing and high-performance AI infrastructure.<\/p>\n\n\n\n<p>For those with the hardware, Baidu\u2019s ERNIEKit toolkit allows fine-tuning on proprietary data; a necessity for most high-value use cases. Baidu is providing its latest ERNIE AI model with an Apache 2.0 licence that permits commercial use, which is essential for adoption.<\/p>\n\n\n\n<p>The market is finally moving toward multimodal AI that can see, read, and act within a specific business context, and the benchmarks suggest it\u2019s doing so with impressive capability. The immediate task is to identify high-value visual reasoning jobs within your own operation and weigh them against the substantial hardware and governance costs.<\/p>\n\n\n\n<p><strong>See also: <\/strong><a href=\"https:\/\/www.artificialintelligence-news.com\/news\/wiz-security-lapses-emerge-amid-global-ai-race\/\"><strong>Wiz: Security lapses emerge amid the global AI race<\/strong><\/a><\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full is-resized\"><a href=\"https:\/\/www.ai-expo.net\/?utm_source=AI-News&amp;utm_medium=Footer-banner&amp;utm_campaign=world-series\"><img fetchpriority=\"high\" decoding=\"async\" width=\"728\" height=\"90\" src=\"https:\/\/www.artificialintelligence-news.com\/wp-content\/uploads\/2025\/10\/image-10.png\" alt=\"Banner for AI &amp; Big Data Expo by TechEx events.\" class=\"wp-image-110077\" style=\"width:800px;height:auto\" srcset=\"https:\/\/www.artificialintelligence-news.com\/wp-content\/uploads\/2025\/10\/image-10.png 728w, https:\/\/www.artificialintelligence-news.com\/wp-content\/uploads\/2025\/10\/image-10-300x37.png 300w\" sizes=\"(max-width: 728px) 100vw, 728px\" \/><\/a><\/figure>\n\n\n\n<p><strong>Want to learn more about AI and big data from industry leaders?<\/strong> Check out <a href=\"https:\/\/www.ai-expo.net\/?utm_source=AI-News&amp;utm_medium=Footer-banner&amp;utm_campaign=world-series\">AI &amp; Big Data Expo<\/a> taking place in Amsterdam, California, and London. The comprehensive event is part of <a href=\"https:\/\/techexevent.com\/?utm_source=AI-News&amp;utm_medium=Footer-banner&amp;utm_campaign=world-series\">TechEx<\/a> and is co-located with other leading technology events including the <a href=\"https:\/\/www.cybersecuritycloudexpo.com\/\">Cyber Security Expo<\/a>. Click <a href=\"https:\/\/techexevent.com\/?utm_source=AI-News&amp;utm_medium=Footer-banner&amp;utm_campaign=world-series\">here<\/a> for more information.<\/p>\n\n\n\n<p>AI News is powered by <a href=\"https:\/\/techforge.pub\/?utm_source=AI-News&amp;utm_medium=Footer-banner&amp;utm_campaign=world-series\">TechForge Media<\/a>. Explore other upcoming enterprise technology events and webinars <a href=\"https:\/\/techforge.pub\/events\/?utm_source=AI-News&amp;utm_medium=Footer-banner&amp;utm_campaign=world-series\">here<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Baidu&#8217;s latest ERNIE model, a super-efficient multimodal AI, is beating GPT and Gemini on key benchmarks and targets enterprise data often ignored by text-focused models. For many businesses, valuable insights are locked in engineering schematics, factory-floor video feeds, medical scans, and logistics dashboards. Baidu&#8217;s new model, ERNIE-4.5-VL-28B-A3B-Thinking, is designed to fill this gap. What\u2019s interesting [&hellip;]<\/p>\n","protected":false},"author":1570,"featured_media":110528,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[3367,3334,3336,3346,3339,3338],"tags":[186,192,252,954,3324,2964,2745],"ppma_author":[2401],"class_list":["post-110526","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-market-trends","category-how-it-works","category-inside-ai","category-multimodal-ai","category-open-source-democratised-ai","category-world-of-work","tag-ai","tag-artificial-intelligence","tag-baidu","tag-enterprise","tag-ernie","tag-models","tag-multimodal"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v27.1 (Yoast SEO v27.1.1) - https:\/\/yoast.com\/product\/yoast-seo-premium-wordpress\/ -->\n<title>Baidu ERNIE multimodal AI beats GPT and Gemini in benchmarks<\/title>\n<meta name=\"description\" content=\"Baidu&#039;s latest ERNIE model, a super-efficient multimodal AI, is beating GPT and Gemini on some key benchmarks.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/\" \/>\n<meta property=\"og:locale\" content=\"en_GB\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Baidu ERNIE multimodal AI beats GPT and Gemini in benchmarks\" \/>\n<meta property=\"og:description\" content=\"Baidu&#039;s latest ERNIE model, a super-efficient multimodal AI, is beating GPT and Gemini on some key benchmarks.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/\" \/>\n<meta property=\"og:site_name\" content=\"AI News\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/AITechNews\/\" \/>\n<meta property=\"article:published_time\" content=\"2025-11-12T16:09:44+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-11-12T16:09:47+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.artificialintelligence-news.com\/wp-content\/uploads\/2025\/11\/baidu-ernie-multimodal-ai-benchmarks-gemini-gpt-artificial-intelligence-development.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"2200\" \/>\n\t<meta property=\"og:image:height\" content=\"1650\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Ryan Daws\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@Gadget_Ry\" \/>\n<meta name=\"twitter:site\" content=\"@ai_technews\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Ryan Daws\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimated reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/\"},\"author\":{\"name\":\"Ryan Daws\",\"@id\":\"https:\/\/www.ianj52.sg-host.com\/#\/schema\/person\/e4d76ff18520c27fd0713eff05f814ed\"},\"headline\":\"Baidu ERNIE multimodal AI beats GPT and Gemini in benchmarks\",\"datePublished\":\"2025-11-12T16:09:44+00:00\",\"dateModified\":\"2025-11-12T16:09:47+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/\"},\"wordCount\":771,\"publisher\":{\"@id\":\"https:\/\/www.ianj52.sg-host.com\/#organization\"},\"image\":{\"@id\":\"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.artificialintelligence-news.com\/wp-content\/uploads\/2025\/11\/baidu-ernie-multimodal-ai-benchmarks-gemini-gpt-artificial-intelligence-development.jpg\",\"keywords\":[\"ai\",\"artificial intelligence\",\"baidu\",\"enterprise\",\"ernie\",\"models\",\"multimodal\"],\"articleSection\":[\"AI Market Trends\",\"How It Works\",\"Inside AI\",\"Multimodal AI\",\"Open-Source &amp; Democratised AI\",\"World of Work\"],\"inLanguage\":\"en-GB\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/\",\"url\":\"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/\",\"name\":\"Baidu ERNIE multimodal AI beats GPT and Gemini in benchmarks\",\"isPartOf\":{\"@id\":\"https:\/\/www.ianj52.sg-host.com\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.artificialintelligence-news.com\/wp-content\/uploads\/2025\/11\/baidu-ernie-multimodal-ai-benchmarks-gemini-gpt-artificial-intelligence-development.jpg\",\"datePublished\":\"2025-11-12T16:09:44+00:00\",\"dateModified\":\"2025-11-12T16:09:47+00:00\",\"description\":\"Baidu's latest ERNIE model, a super-efficient multimodal AI, is beating GPT and Gemini on some key benchmarks.\",\"breadcrumb\":{\"@id\":\"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/#breadcrumb\"},\"inLanguage\":\"en-GB\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/#primaryimage\",\"url\":\"https:\/\/www.artificialintelligence-news.com\/wp-content\/uploads\/2025\/11\/baidu-ernie-multimodal-ai-benchmarks-gemini-gpt-artificial-intelligence-development.jpg\",\"contentUrl\":\"https:\/\/www.artificialintelligence-news.com\/wp-content\/uploads\/2025\/11\/baidu-ernie-multimodal-ai-benchmarks-gemini-gpt-artificial-intelligence-development.jpg\",\"width\":2200,\"height\":1650,\"caption\":\"Boxing gloves as Baidu's latest ERNIE model, a super-efficient multimodal AI, is beating GPT and Gemini on key benchmarks and targets enterprise data often ignored by text-focused models.\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.artificialintelligence-news.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Baidu ERNIE multimodal AI beats GPT and Gemini in benchmarks\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.ianj52.sg-host.com\/#website\",\"url\":\"https:\/\/www.ianj52.sg-host.com\/\",\"name\":\"AI News\",\"description\":\"Artificial Intelligence News\",\"publisher\":{\"@id\":\"https:\/\/www.ianj52.sg-host.com\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.ianj52.sg-host.com\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-GB\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.ianj52.sg-host.com\/#organization\",\"name\":\"AI News\",\"url\":\"https:\/\/www.ianj52.sg-host.com\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\/\/www.ianj52.sg-host.com\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.artificialintelligence-news.com\/wp-content\/uploads\/2025\/01\/AI-News.png\",\"contentUrl\":\"https:\/\/www.artificialintelligence-news.com\/wp-content\/uploads\/2025\/01\/AI-News.png\",\"width\":1245,\"height\":238,\"caption\":\"AI News\"},\"image\":{\"@id\":\"https:\/\/www.ianj52.sg-host.com\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/AITechNews\/\",\"https:\/\/x.com\/ai_technews\",\"https:\/\/www.linkedin.com\/groups\/1906826\/\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.ianj52.sg-host.com\/#\/schema\/person\/e4d76ff18520c27fd0713eff05f814ed\",\"name\":\"Ryan Daws\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\/\/www.ianj52.sg-host.com\/#\/schema\/person\/image\/6595e4680966ed61f0c4ca8bd817a57b\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/d82963cecdd93f33733ed383f2d8005d91790f3740b49a926d4c180557eec721?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/d82963cecdd93f33733ed383f2d8005d91790f3740b49a926d4c180557eec721?s=96&d=mm&r=g\",\"caption\":\"Ryan Daws\"},\"description\":\"Ryan Daws is a senior editor at TechForge Media with over a decade of experience in weaving narratives and dissecting complex topics. His articles and interviews with industry leaders have earned him recognition as a key tech influencer from numerous organisations. Under his leadership, publications have been praised by analyst firms for their excellence and performance. Connect with him on X, Mastodon, Bluesky, Threads, and\/or LinkedIn.\",\"sameAs\":[\"https:\/\/twitter.com\/gadget_ry\",\"https:\/\/x.com\/Gadget_Ry\"],\"url\":\"https:\/\/www.artificialintelligence-news.com\/news\/author\/ryan\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Baidu ERNIE multimodal AI beats GPT and Gemini in benchmarks","description":"Baidu's latest ERNIE model, a super-efficient multimodal AI, is beating GPT and Gemini on some key benchmarks.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/","og_locale":"en_GB","og_type":"article","og_title":"Baidu ERNIE multimodal AI beats GPT and Gemini in benchmarks","og_description":"Baidu's latest ERNIE model, a super-efficient multimodal AI, is beating GPT and Gemini on some key benchmarks.","og_url":"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/","og_site_name":"AI News","article_publisher":"https:\/\/www.facebook.com\/AITechNews\/","article_published_time":"2025-11-12T16:09:44+00:00","article_modified_time":"2025-11-12T16:09:47+00:00","og_image":[{"width":2200,"height":1650,"url":"https:\/\/www.artificialintelligence-news.com\/wp-content\/uploads\/2025\/11\/baidu-ernie-multimodal-ai-benchmarks-gemini-gpt-artificial-intelligence-development.jpg","type":"image\/jpeg"}],"author":"Ryan Daws","twitter_card":"summary_large_image","twitter_creator":"@Gadget_Ry","twitter_site":"@ai_technews","twitter_misc":{"Written by":"Ryan Daws","Estimated reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/#article","isPartOf":{"@id":"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/"},"author":{"name":"Ryan Daws","@id":"https:\/\/www.ianj52.sg-host.com\/#\/schema\/person\/e4d76ff18520c27fd0713eff05f814ed"},"headline":"Baidu ERNIE multimodal AI beats GPT and Gemini in benchmarks","datePublished":"2025-11-12T16:09:44+00:00","dateModified":"2025-11-12T16:09:47+00:00","mainEntityOfPage":{"@id":"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/"},"wordCount":771,"publisher":{"@id":"https:\/\/www.ianj52.sg-host.com\/#organization"},"image":{"@id":"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/#primaryimage"},"thumbnailUrl":"https:\/\/www.artificialintelligence-news.com\/wp-content\/uploads\/2025\/11\/baidu-ernie-multimodal-ai-benchmarks-gemini-gpt-artificial-intelligence-development.jpg","keywords":["ai","artificial intelligence","baidu","enterprise","ernie","models","multimodal"],"articleSection":["AI Market Trends","How It Works","Inside AI","Multimodal AI","Open-Source &amp; Democratised AI","World of Work"],"inLanguage":"en-GB"},{"@type":"WebPage","@id":"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/","url":"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/","name":"Baidu ERNIE multimodal AI beats GPT and Gemini in benchmarks","isPartOf":{"@id":"https:\/\/www.ianj52.sg-host.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/#primaryimage"},"image":{"@id":"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/#primaryimage"},"thumbnailUrl":"https:\/\/www.artificialintelligence-news.com\/wp-content\/uploads\/2025\/11\/baidu-ernie-multimodal-ai-benchmarks-gemini-gpt-artificial-intelligence-development.jpg","datePublished":"2025-11-12T16:09:44+00:00","dateModified":"2025-11-12T16:09:47+00:00","description":"Baidu's latest ERNIE model, a super-efficient multimodal AI, is beating GPT and Gemini on some key benchmarks.","breadcrumb":{"@id":"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/#breadcrumb"},"inLanguage":"en-GB","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/"]}]},{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/#primaryimage","url":"https:\/\/www.artificialintelligence-news.com\/wp-content\/uploads\/2025\/11\/baidu-ernie-multimodal-ai-benchmarks-gemini-gpt-artificial-intelligence-development.jpg","contentUrl":"https:\/\/www.artificialintelligence-news.com\/wp-content\/uploads\/2025\/11\/baidu-ernie-multimodal-ai-benchmarks-gemini-gpt-artificial-intelligence-development.jpg","width":2200,"height":1650,"caption":"Boxing gloves as Baidu's latest ERNIE model, a super-efficient multimodal AI, is beating GPT and Gemini on key benchmarks and targets enterprise data often ignored by text-focused models."},{"@type":"BreadcrumbList","@id":"https:\/\/www.artificialintelligence-news.com\/news\/baidu-ernie-multimodal-ai-gpt-and-gemini-benchmarks\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.artificialintelligence-news.com\/"},{"@type":"ListItem","position":2,"name":"Baidu ERNIE multimodal AI beats GPT and Gemini in benchmarks"}]},{"@type":"WebSite","@id":"https:\/\/www.ianj52.sg-host.com\/#website","url":"https:\/\/www.ianj52.sg-host.com\/","name":"AI News","description":"Artificial Intelligence News","publisher":{"@id":"https:\/\/www.ianj52.sg-host.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.ianj52.sg-host.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-GB"},{"@type":"Organization","@id":"https:\/\/www.ianj52.sg-host.com\/#organization","name":"AI News","url":"https:\/\/www.ianj52.sg-host.com\/","logo":{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/www.ianj52.sg-host.com\/#\/schema\/logo\/image\/","url":"https:\/\/www.artificialintelligence-news.com\/wp-content\/uploads\/2025\/01\/AI-News.png","contentUrl":"https:\/\/www.artificialintelligence-news.com\/wp-content\/uploads\/2025\/01\/AI-News.png","width":1245,"height":238,"caption":"AI News"},"image":{"@id":"https:\/\/www.ianj52.sg-host.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/AITechNews\/","https:\/\/x.com\/ai_technews","https:\/\/www.linkedin.com\/groups\/1906826\/"]},{"@type":"Person","@id":"https:\/\/www.ianj52.sg-host.com\/#\/schema\/person\/e4d76ff18520c27fd0713eff05f814ed","name":"Ryan Daws","image":{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/www.ianj52.sg-host.com\/#\/schema\/person\/image\/6595e4680966ed61f0c4ca8bd817a57b","url":"https:\/\/secure.gravatar.com\/avatar\/d82963cecdd93f33733ed383f2d8005d91790f3740b49a926d4c180557eec721?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/d82963cecdd93f33733ed383f2d8005d91790f3740b49a926d4c180557eec721?s=96&d=mm&r=g","caption":"Ryan Daws"},"description":"Ryan Daws is a senior editor at TechForge Media with over a decade of experience in weaving narratives and dissecting complex topics. His articles and interviews with industry leaders have earned him recognition as a key tech influencer from numerous organisations. Under his leadership, publications have been praised by analyst firms for their excellence and performance. Connect with him on X, Mastodon, Bluesky, Threads, and\/or LinkedIn.","sameAs":["https:\/\/twitter.com\/gadget_ry","https:\/\/x.com\/Gadget_Ry"],"url":"https:\/\/www.artificialintelligence-news.com\/news\/author\/ryan\/"}]}},"authors":[{"term_id":2401,"user_id":1570,"is_guest":0,"slug":"ryan","display_name":"Ryan Daws","avatar_url":{"url":"https:\/\/secure.gravatar.com\/avatar\/?s=96&d=mm&r=g","url2x":"https:\/\/secure.gravatar.com\/avatar\/?s=96&d=mm&r=g2x"},"0":null,"1":"","2":"","3":"","4":"","5":"","6":"","7":"","8":""}],"_links":{"self":[{"href":"https:\/\/www.artificialintelligence-news.com\/wp-json\/wp\/v2\/posts\/110526","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.artificialintelligence-news.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.artificialintelligence-news.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.artificialintelligence-news.com\/wp-json\/wp\/v2\/users\/1570"}],"replies":[{"embeddable":true,"href":"https:\/\/www.artificialintelligence-news.com\/wp-json\/wp\/v2\/comments?post=110526"}],"version-history":[{"count":0,"href":"https:\/\/www.artificialintelligence-news.com\/wp-json\/wp\/v2\/posts\/110526\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.artificialintelligence-news.com\/wp-json\/wp\/v2\/media\/110528"}],"wp:attachment":[{"href":"https:\/\/www.artificialintelligence-news.com\/wp-json\/wp\/v2\/media?parent=110526"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.artificialintelligence-news.com\/wp-json\/wp\/v2\/categories?post=110526"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.artificialintelligence-news.com\/wp-json\/wp\/v2\/tags?post=110526"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.artificialintelligence-news.com\/wp-json\/wp\/v2\/ppma_author?post=110526"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}