Close
  • Latest News
  • Artificial Intelligence
  • Video
  • Big Data and Analytics
  • Cloud
  • Networking
  • Cybersecurity
  • Applications
  • IT Management
  • Storage
  • Sponsored
  • Mobile
  • Small Business
  • Development
  • Database
  • Servers
  • Android
  • Apple
  • Innovation
  • Blogs
  • PC Hardware
  • Reviews
  • Search Engines
  • Virtualization
Read Down
Sign in
Close
Welcome!Log into your account
Forgot your password?
Read Down
Password recovery
Recover your password
Close
Search
Logo
Subscribe
Logo
  • Latest News
  • Artificial Intelligence
  • Video
  • Big Data and Analytics
  • Cloud
  • Networking
  • Cybersecurity
  • Applications
  • IT Management
  • Storage
  • Sponsored
  • Mobile
  • Small Business
  • Development
  • Database
  • Servers
  • Android
  • Apple
  • Innovation
  • Blogs
  • PC Hardware
  • Reviews
  • Search Engines
  • Virtualization
More
    Subscribe
    Home Latest News

      Revolutionary LLM Marco-o1 By Alibaba Achieves 6% Accuracy Boost In Mathematical Problem-Solving Tests

      Written by

      Aminu Abdullahi
      Published December 14, 2024
      Share
      Facebook
      Twitter
      Linkedin
        Alibaba homepage.
        Alibaba's homepage.

        eWEEK content and product recommendations are editorially independent. We may make money when you click on links to our partners. Learn More.

        Alibaba has announced Marco-o1, an advanced large language model (LLM) designed to tackle open-ended problem-solving and complex reasoning tasks. Developed by the MarcoPolo Team under Alibaba International Digital Commerce, Marco-o1 is poised to rival OpenAI’s o1 model, offering enhancements in reasoning, translation, and problem-solving across multiple domains.

        The Alibaba Difference

        Unlike traditional AI models that excel in structured tasks such as coding and mathematics, Marco-o1 focuses on open-ended problems, where definitive answers and clear evaluation metrics are often absent. 

        “Currently, OpenAI o1 sparks a surge of interest in the study of large reasoning models (LRM),” according to an excerpt from the report. “Building on this momentum, Marco-o1 not only focuses on disciplines with standard answers, such as mathematics, physics, and coding—which are well-suited for reinforcement learning (RL)—but also places greater emphasis on open-ended resolutions. We aim to address the question, ‘Can the o1 model effectively generalize to broader domains where clear standards are absent and rewards are challenging to quantify?’”

        This leap is achieved through a combination of advanced methodologies:

        • Chain-of-Thought (CoT) Fine-Tuning: Enables step-by-step reasoning by explicitly mapping the thought process.
        • Monte Carlo Tree Search (MCTS): Explores multiple reasoning paths, using confidence scores to guide the model toward optimal solutions.
        • Reasoning Action Strategies: Dynamically adjusts the granularity of decision-making, refining the model’s approach to nuanced problems.

        Marco-o1 is built on the Qwen2-7B-Instruct architecture and trained using a blend of open-source CoT data and proprietary synthetic datasets. These technologies empower Marco-o1 to address tasks ranging from abstract reasoning to multilingual translations.

        Performance Highlights

        Marco-o1 demonstrated significant advancements in reasoning and translation benchmarks, including +6.17 percent accuracy on the MGSM (English) dataset, a rigorous test of reasoning capabilities; +5.60 percent accuracy on the MGSM (Chinese) dataset, showcasing multilingual proficiency; and mastery in machine translation, evidenced by its ability to interpret cultural nuances in slang. 

        In a nod to open innovation, Alibaba has made Marco-o1 freely available on platforms like GitHub and Hugging Face, inviting researchers and developers to explore and enhance its capabilities.

        Alibaba’s announcement comes on the heels of similar innovations, such as the DeepSeek-R1-Lite-Preview model launched by China’s DeepSeek lab, and it directly challenges OpenAI’s o1 model, which has been celebrated for excelling in logical and mathematical reasoning, demonstrated by its outstanding performance on platforms like AIME and CodeForces. Marco-o1, however, aims to go a step further by generalizing its reasoning capabilities to ambiguous, real-world domains.

        Learn more about some of the other top companies reshaping the world of AI technology.

        Aminu Abdullahi
        Aminu Abdullahi
        Aminu Abdullahi is an experienced B2B technology and finance writer and award-winning public speaker. He is the co-author of the e-book, The Ultimate Creativity Playbook, and has written for various publications, including TechRepublic, eWEEK, Enterprise Networking Planet, eSecurity Planet, CIO Insight, Enterprise Storage Forum, IT Business Edge, Webopedia, Software Pundit, Geekflare and more.

        Get the Free Newsletter!

        Subscribe to Daily Tech Insider for top news, trends & analysis

        Get the Free Newsletter!

        Subscribe to Daily Tech Insider for top news, trends & analysis

        MOST POPULAR ARTICLES

        Artificial Intelligence

        9 Best AI 3D Generators You Need...

        Sam Rinko - June 25, 2024 0
        AI 3D Generators are powerful tools for many different industries. Discover the best AI 3D Generators, and learn which is best for your specific use case.
        Read more
        Cloud

        RingCentral Expands Its Collaboration Platform

        Zeus Kerravala - November 22, 2023 0
        RingCentral adds AI-enabled contact center and hybrid event products to its suite of collaboration services.
        Read more
        Artificial Intelligence

        8 Best AI Data Analytics Software &...

        Aminu Abdullahi - January 18, 2024 0
        Learn the top AI data analytics software to use. Compare AI data analytics solutions & features to make the best choice for your business.
        Read more
        Latest News

        Zeus Kerravala on Networking: Multicloud, 5G, and...

        James Maguire - December 16, 2022 0
        I spoke with Zeus Kerravala, industry analyst at ZK Research, about the rapid changes in enterprise networking, as tech advances and digital transformation prompt...
        Read more
        Video

        Datadog President Amit Agarwal on Trends in...

        James Maguire - November 11, 2022 0
        I spoke with Amit Agarwal, President of Datadog, about infrastructure observability, from current trends to key challenges to the future of this rapidly growing...
        Read more
        Logo

        eWeek has the latest technology news and analysis, buying guides, and product reviews for IT professionals and technology buyers. The site’s focus is on innovative solutions and covering in-depth technical content. eWeek stays on the cutting edge of technology news and IT trends through interviews and expert analysis. Gain insight from top innovators and thought leaders in the fields of IT, business, enterprise software, startups, and more.

        Facebook
        Linkedin
        RSS
        Twitter
        Youtube

        Advertisers

        Advertise with TechnologyAdvice on eWeek and our other IT-focused platforms.

        Advertise with Us

        Menu

        • About eWeek
        • Subscribe to our Newsletter
        • Latest News

        Our Brands

        • Privacy Policy
        • Terms
        • About
        • Contact
        • Advertise
        • Sitemap
        • California – Do Not Sell My Information

        Property of TechnologyAdvice.
        © 2024 TechnologyAdvice. All Rights Reserved

        Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.