Wikidata & Wikibase – the mainstays of an open internet

Free Knowledge needs free software – because only if the tools used to collect, structure and share knowledge are openly accessible to everyone, everyone can really participate. Wikimedia Deutschland is therefore not only committed to open content, but also develops the technical foundations that make knowledge available worldwide. Whether through open interfaces, transparent development or collaborative cooperation, the software department at Wikimedia Deutschland has continued to work at full speed in 2024 to strengthen Free Knowledge as a common good.

Grafik

Wikidata is an impressively large collection of free knowledge data. Launched in 2012, it now consists of over 120 million structured items that are both human- and machine-readable. It’s not just Wikipedia that uses its data to automatically update information such as population figures or dates of birth; Wikidata also forms the backbone of numerous digital applications both inside and outside the Wikimedia world, from well-known voice assistants through platforms like Govdirectory to travel apps and games like Wikitrivia.

Knowledge to serve humanity

Wikidata is a mainstay of the open and transparent internet. Its importance was recognized with a special award in October 2025: The Digital Public Goods Alliance — an initiative supported by the United Nations — has recognized Wikidata as a digital public good. Only a few months previous, Wikipedia had been awarded the same distinction.

Wikidata shows that technology can serve people, not the other way around. It is an alternative to data monopolies and closed systems and a place where a global community works together to make knowledge openly, freely, and reliably available.
Lydia Pintscher Portfolio Lead for Wikidata at Wikimedia Deutschland

Worldwide, over 24,000 volunteers work to verify, update and expand Wikidata’s store of knowledge. In 2025, together with the development teams at Wikimedia, these volunteers have made significant contributions to shaping an internet that is more equitable, transparent and humane.

The Wikidata Embedding Project promotes fair and open AI solutions

While tech companies have been training AI systems on Wikidata’s open data for years, at Wikimedia Deutschland we’ve concentrated on funding and volunteer support for open and fair AI solutions.

A central component of this work is the Wikidata Embedding project, launched in 2024. Its goal is to make it easier, especially for open-source developers, to access Wikidata’s data and thus promote the development of innovative AI-based applications useful to the public. As part of the project, in the fall of 2025 we released a vector database, which processes Wikidata’s high-quality data and makes it directly usable in developing generative-AI applications. Above all this benefits smaller development teams and open-source initiatives, for whom such processing was often unaffordable in the past.

The launch of the Embedding Project shows that powerful AI doesn’t have to be controlled by a handful of companies — it can be created openly and collaboratively.
Philippe Saadé AI/ML Project Manager at Wikimedia Deutschland

In the first phase of the Embedding Project, approximately 23 million items from Wikidata were made available via the new vector database, with more to follow. Interest in the service is already high: since its launch, the vector database has seen between 100,000 and 150,000 accesses per month.

Community building across continents

At Wikimedia Deutschland, we have set ourselves the goal of enabling even more people to use and contribute to projects like Wikipedia and Wikidata. To that end, we intend to create easier, more equitable and accessible methods of connection and access

One example of new approaches to international cooperation is the Software Collaboration for Wikidata project, sponsored by the Arcadia Fund through June 2025. As part of this collaboration, teams of developers from Germany, Indonesia, Nigeria, Ghana and Brazil worked on developing practical tools for using the free knowledge repository with a particular emphasis on smaller language editions. Among other things, this led to the creation of the new Broomstick tool which helps editors to identify and fix issues in Wikidata.

Group photo
Participants in the Wiki Mentor Africa Hackathon in Cameroon

Another product of the Software Collaboration for Wikidata was the Wiki Mentor Africa project. This series of trainings was launched by a team from Nigeria that offered workshops and hackathons in various African countries. Through the program, many people were empowered to work with Wikidata and share their knowledge on the internet in their own language.

Award for a Software Developer from Germany

Thousands of people around the world volunteer their time to ensure that knowledge is freely accessible. The “Wikimedian of the Year” award is presented annually to recognize outstanding contributions and achievements. In 2025, the award went to Eugene Agbor Egbe, among others. A native of Cameroon, he lives in Paderborn and, as a software developer, is helping to shape the technological future of Wikimedia.

How Wikidata facilitates projects: Insights from WikidataCon

Each year, many events take place that serve to demonstrate the versatility of Wikidata’s data, such as Wikidata’s Data Reuse Days and the Wikidata and Sister Projects event. Another important gathering for the international Wikidata community is WikidataCon, where volunteers, developers, researchers and organizations who work with Wikidata gather annually to exchange ideas.

At the 2025 WikidataCon, which took place virtually at the end of October, one of the participants was a volunteer known as “Cookroach”. He presented Digitale Stolpersteine, a project he co-initiated. To date, over 116,000 memorial stones across Europe have been laid in dedication to the victims of National Socialism; the project compiles photos of the memorial stones, the biographies of those who were murdered and additional historical context, then links them all in Wikidata. This creates a unique, freely accessible digital network of remembrance.

Among the other projects presented at WikidataCon is an auditory training app, based on Wikidata lexemes, which provides support for people with difficulty hearing.

Work on the Wikibase ecosystem

In 2025, Wikimedia Deutschland also focused on the expansion of the Wikibase ecosystem. Wikibase is the software that underlies Wikidata. A diverse, decentralized network of data entities is emerging here. This network will allow tech communities, research groups, and libraries, among others, to operate their own knowledge bases along the lines of Wikidata –  instances of Wikibase – to maintain collections of specialized data independently.

One of many examples is a project by a group in Peru that uses Wikibase to document the indigenous Quechua language family and build a freely accessible dictionary. The National Library of Nigeria also uses Wikibase to make data on Nigerian editors more accessible, strengthening connections within Nigeria’s knowledge landscape.

The future is free and open

All these developments demonstrate the importance of free and open knowledge projects like Wikidata and Wikibase. Not only is the amount of accessible data continuously growing, but more and more people and initiatives are joining in on these projects. Going forward, Wikimedia Deutschland will continue providing support for these developments.

Franziska Heine, Portrait
Through open-source projects such as Wikidata, we can prevent knowledge monopolies and ensure that knowledge as a common good benefits all people in equal measure.
Franziska Heine Executive Director of Wikimedia Deutschland

Open Source Award for Wikidata

A great honor for open source and free knowledge: Lydia Pintscher was awarded the European Open Source Award in January 2025! As one of the driving forces behind Wikidata, she has played a key role in creating an open and interconnected database for the world. In this interview, she discusses the significance of Wikidata, the role of open source, and why this award is also an important milestone for the community.