dotlah! dotlah!
  • Cities
  • Technology
  • Business
  • Politics
  • Society
  • Science
  • About
Social Links
  • zedreviews.com
  • citi.io
  • aster.cloud
  • liwaiwai.com
  • guzz.co.uk
  • atinatin.com
0 Likes
0 Followers
0 Subscribers
dotlah!
  • Cities
  • Technology
  • Business
  • Politics
  • Society
  • Science
  • About
  • Artificial Intelligence
  • Technology

IBM contributes key open-source projects to Linux Foundation to advance AI community participation

  • March 22, 2025
Total
0
Shares
0
0
0

IBM is contributing 3 open-source projects—Docling, Data Prep Kit and BeeAI—to the Linux Foundation. This move signals not only the potential growth of these projects but also IBM’s ongoing commitment to open-source AI.

“We’re continuing our long history of contributing open-source projects to ensure that they’re easy to consume and that it’s easy for others—not just us—to contribute,” says Brad Topol, IBM Distinguished Engineer and Director of Open Technologies, in an interview. Topol also chairs the Governing Board of the LF AI & Data Foundation, a group hosted under the Linux Foundation focused on advancing open-source innovation across artificial intelligence and data technologies.

Each project is focused on an essential part of the AI development stack. As the industry matures, innovation driven by the broader developer community in these areas is key to making AI enterprise ready.

Docling, which launched and open-sourced a year ago, addresses a limit that many foundation models have for enterprise use. While the models have been trained on every scrap of publicly available information, much of the data valuable to businesses lies in documents that are not accessible online: PDFs, annual reports, slide decks.

Docling streamlines the process of turning unstructured documents into JSON and Markdown files that are easy for large language models (LLMs) and other foundation models to digest.

Since its release, Docling has gained traction, earning more than 23,000 stars on GitHub. When combined with retrieval-augmented generation (RAG) techniques, Docling improves LLM outputs. “Docling can make the LLMs answer much better and much more specific to their needs,” says Topol. In addition to gaining traction in the open-source community, Docling helps power Red Hat® Enterprise Linux® AI, where it enables context aware chunking and supports the platform’s new data ingestion pipeline.

Of course, another critical step in deploying AI is data preparation. IBM’s Data Prep Kit, which was released in 2024, has also gained popularity: it helps clean, transform and enrich unstructured data for pre-training, fine-tuning and RAG use cases.

Unstructured data—such as databases, web pages and audio files which are more complex to parse and extract insights—accounts for 90% of all enterprise-generated data, according to IDC. LLMs can analyze vast amounts of unstructured data and extract relevant insights to generate and test new product or service ideas, for instance, in hours rather than months.

Data Prep Kit is designed to simplify data prep for LLM applications—currently focused on code and language models—supporting pre-training, fine-tuning and RAG use cases. Built on familiar distributed processing frameworks like Spark and Ray, it gives developers the flexibility to create custom modules that scale easily, whether running on a laptop or across an entire data center.

“We used to say, garbage in, garbage out. You definitely want good data going in,” Topol says. “This is not a glamorous project compared to some of the other parts of the LLM life cycle, but it’s incredibly critical, incredibly valuable and a definite must-have.” Data Prep Kit is beginning to power IBM offerings and is now in IBM’s TechPreview of IBM Data Integration for Unstructured Data.

Finally, as agents are gaining traction, IBM released BeeAI. BeeAI can be used by developers to discover, run and compose AI agents from any framework, including CrewAI, LangGraph, and AutoGen. The project includes the Agent Communication Protocol, which powers agent discoverability and interoperability, and the BeeAI-framework, its native framework for building agents in Python or TypeScript, optimized for open source models.

“There are other frameworks for building agents,” says Topol. “But what’s nice about BeeAI is that it provides a platform where you can also plug in agents from those other technologies. BeeAI doesn’t just work with its own agents.”

By contributing these projects to the Linux Foundation, IBM aims to expand their reach and attract new contributors and users. “The projects are in a wonderful spot where people can invest their resources. It makes a huge difference,” says Topol. “It’s like an insurance policy. The open governance also makes people feel better that if they contribute, over time, they’re going to earn their stripes through what we call meritocracy and earn a more influential role in the project. They can also feel secure that the project won’t make any drastic open-source license changes that could dramatically impede future use of the project.”

Pointing to Kubernetes—an open-source container orchestration system originally developed by Google and later donated to the Cloud Native Computing Foundation—Topol notes how its adoption surged after becoming part of an open governance model, ultimately turning it into an industry standard.

He has bold ambitions for these projects.

“An open-source project with a powerful ecosystem is, frankly, unstoppable,” he says.

Learn more about projects like Docling, Data Prep Kit and BeeAI at the IBM TechXchange Conference October 6-9, 2025, in Orlando, FL. Experts, including project committers and contributors, will be on-site for presentations, hands-on learning and networking opportunities, with more than 30 open-source projects showcased. Registration opens April 4.

By: Anabelle Nicoud (Tech Reporter, IBM)
Originally published at: IBM

Source: zedreviews.com

Total
0
Shares
Share
Tweet
Share
Share
Related Topics
  • AI
  • Artificial Intelligence
  • BeeAI
  • Data Prep Kit
  • Docling
  • IBM
  • Linux
  • Linux Foundation
  • Open Source
dotlah.com

Previous Article
PiPiPi
  • Gears

The Unexpected Pi-Fect Deals This March 14

  • March 14, 2025
View Post
Next Article
  • Lah!

Tariffs, Trump, and Other Things That Start With T – They’re Not The Problem, It’s How We Use Them

  • March 25, 2025
View Post
You May Also Like
Red Hat OpenShift
View Post
  • Artificial Intelligence
  • Technology

Red Hat Further Drives Digital Sovereignty for the AI Era with Red Hat OpenShift on Google Cloud Dedicated

  • Dean Marc
  • April 21, 2026
View Post
  • Artificial Intelligence
  • Technology

Here’s how to get the $7 trillion AI hardware buildout right

  • dotlah.com
  • April 18, 2026
totus-technologies-cover
View Post
  • Business
  • Technology
  • World Events

The Transatlantic Tech Rift and Why Data Sovereignty Is the New Industrial Imperative

  • Ackley Wyndam
  • April 16, 2026
View Post
  • Technology

Hon Hai Technology Group (Foxconn) Recognized As Top 100 Global Innovators 2026

  • Dean Marc
  • April 9, 2026
View Post
  • Artificial Intelligence
  • Technology

Kioxia Announces New SSD Model Optimized for AI GPU-Initiated Workloads

  • Dean Marc
  • March 17, 2026
View Post
  • Artificial Intelligence
  • Technology

U.S. Ski & Snowboard and Google Announce Collaboration to Build an AI-Based Athlete Performance Tool

  • Dean Marc
  • February 8, 2026
View Post
  • Artificial Intelligence
  • Technology

IBM to Support Missile Defense Agency SHIELD Contract

  • Dean Marc
  • February 5, 2026
Smartphone hero image
View Post
  • Gears
  • Technology

Zed Approves | Smartphones for Every Budget Range

  • Ackley Wyndam
  • January 29, 2026


Trending
  • 1
    • Featured
    • Gears
    Bonjour, Swifties! Paris Awaits with the Eras Tour!
    • May 11, 2024
  • 2
    • Lah!
    • Technology
    Paving The Way For UV-Enabled Flexible Wearable Tech
    • July 29, 2021
  • 3
    • Lah!
    Park Hotel Group Secures Its First Green Loan Of S$237 Million Under The UOB Real Estate Sustainable Finance Framework
    • February 26, 2020
  • 4
    • Lah!
    • Technology
    Scientific Research Shows Customised Innovations Can Reduce The Risk Of COVID-19 Transmission
    • February 28, 2021
  • x.ai - Understand the universe 5
    • People
    • Technology
    Elon Musk’s New xAI Company Launches To ‘Understand The True Nature Of The Universe’
    • July 13, 2023
  • usa-flag-jason-leung-mmth0KV0oFQ-unsplash 6
    • Cities
    2022’s Most Independent States In America
    • July 5, 2022
  • remote-working-yasmina-h-p8DjPfqEhW0-unsplash 7
    • Features
    • People
    Should You Move If You Work Remotely?
    • October 6, 2021
  • 8
    • Cities
    • Lah!
    First Look: CapitaSpring, Singapore’s Newest Skyscraper
    • February 2, 2021
  • goswifties-taylor-swift-travis-kelce-fathers-daughters-super-bowl-bonding 9
    • Featured
    • People
    How Taylor Swift Unexpectedly Brought Fathers and Daughters Together Through Football
    • February 11, 2024
  • 10
    • Lah!
    Frasers Property Retail And Frasers Centrepoint Trust To Provide Tenants With Additional S$45 Million In Rental Rebates
    • April 1, 2020
  • 11
    • Technology
    ST Engineering Unveils Enabling Technologies And Innovations At Singapore Airshow 2020
    • February 10, 2020
  • 12
    • Cities
    • Lah!
    • Society
    Cashiers Need To Be Compensated With Wage Premium To Handle Cash Payments: NUS Study
    • August 27, 2021
Trending
  • Red Hat OpenShift 1
    Red Hat Further Drives Digital Sovereignty for the AI Era with Red Hat OpenShift on Google Cloud Dedicated
    • April 21, 2026
  • Illustration of data storage 2
    The Splinternet Comes for European Supply Chains Why Fragmentation Is Now a Boardroom Problem
    • April 21, 2026
  • 3
    Here’s how to get the $7 trillion AI hardware buildout right
    • April 18, 2026
  • totus-technologies-cover 4
    The Transatlantic Tech Rift and Why Data Sovereignty Is the New Industrial Imperative
    • April 16, 2026
  • 5
    What will it take to get ships going through the Strait of Hormuz again?
    • April 13, 2026
  • 6
    Hon Hai Technology Group (Foxconn) Recognized As Top 100 Global Innovators 2026
    • April 9, 2026
  • 7
    3 lessons on the energy transition in an age of crisis
    • April 7, 2026
  • 8
    Samsung Unveils Galaxy A57 5G and Galaxy A37 5G, Packing Pro-Level Features at Awesome Price
    • March 25, 2026
  • 9
    The global price tag of war in the Middle East
    • March 24, 2026
  • 10
    Kioxia Announces New SSD Model Optimized for AI GPU-Initiated Workloads
    • March 17, 2026
Social Links
dotlah! dotlah!
  • Cities
  • Technology
  • Business
  • Politics
  • Society
  • Science
  • About
Connecting Dots Across Asia's Tech and Urban Landscape

Input your search keywords and press Enter.