Structure for separating index and services
Separating the index and the servicesallows for a multitude of services, whether existing as search engines or otherwise, to be run on a shared infrastructure. The figure shows how the public infrastructure is responsible for crawling the web, for indexing its content, and for providing an interface and application programming interface (API) to the services that are built upon the index. While services are allowed to do their own further indexing to prepare documents, some advanced indexing is also provided by the open infrastructure. Furthermore, as modern search engines rely heavily on usage data, these data (most prominently search queries routed to the index) are collected and made available for reuse.
Some possible applications
A web index puts unstructured and heterogeneous data of the Internet into a structured shape. This enables setting up Internet services, which process this data for different kinds of usage.A web index puts unstructured and heterogeneous data of the Internet into a structured shape. This enables setting up Internet services, which process this data for different kinds of usage.
N.B.: This list of ideas is far from being complete and only serves illustrative purposes.
– Web Search
– Vertical Search, e.g., video or scholarly content
Science / Research
– Trend analysis, e.g., political trends
– Language use on the Web
– Research evaluation, e.g., Altmetrics
– Data aggregation, e.g., company or person dossiers
– Opinion mining
– Market research
The OWI could build the foundation for large-scale AI applications, e.g.,
– Maschine translation
– Question answering
Who we are
We are a team of activists acting under the umbrella of the SUMA-EV, a nonprofit organisation to help secure free access to the knowledge and information embodied in the Internet.