The Chinese company Baidu has launched two new artificial intelligence models, the multimodal Ernie 4.5 and a new reasoning-focused model called X1. The company announced this, explaining that the Ernie 4.5 model has “excellent multimodal understanding capabilities. It has more advanced language skills, and its comprehension, generation, logic, and memory capabilities have been significantly improved.”
Furthermore, it has “high QE” and can easily understand internet memes and satirical cartoons, Baidu said. The Chinese tech giant, one of the first to launch a ChatGPT-style chatbot, has struggled to achieve widespread adoption for its large language model Ernie, despite claiming performance on par with OpenAI’s GPT-4, amid fierce competition. Multimodal AI systems are able to process and integrate various types of data, including text, video, images, and audio, and can convert content into these formats.
The X1 has “stronger comprehension, planning, reflection, and evolution abilities,” Baidu said, adding that it is the first deep thinking model that uses tools autonomously.