CAMEL Now Supports ERNIE 5.0 & ERNIE 4.5 Turbo VL! Key Features: - ERNIE 5.0: Unified Omni-Modal Modeling: Natively integrates text, images, audio, and video into a single model architecture, enabling seamless cross-modal understanding and generation for complex, multi-sensory agent tasks. - ERNIE 5.0: Efficient MoE Architecture: Features a massive 2.4T+ parameter Mixture-of-Experts design with less than 3% active parameters per inference, delivering frontier-level performance across 40+ benchmarks while significantly reducing computational costs. - ERNIE 4.5 Turbo VL: Advanced Visual Reasoning: Introduces "Thinking with Images" capabilities, allowing the model to zoom into details and perform multi-step reasoning on complex visual data, such as chart analysis and causal relationship interpretation. - ERNIE 4.5 Turbo VL: High-Speed Efficiency: Built on a lightweight 28B parameter MoE architecture (activating only ~3B parameters), offering a perfect balance of speed and intelligence for real-time visual understanding and tool-use scenarios. This integration expands CAMEL-AI's multimodal capabilities, offering developers access to Baidu's latest flagship models for building powerful, efficient, and versatile agentic applications. Special thanks to Tao Sun for leading the implementation! Reference: https://lnkd.in/eF74Fbj3