Logo of the University of Passau

Open Search

Experiments on crawling for an open web search

As part of the OpenWebSearch.eu project, the Chair of Data Science crawled parts of the WWW.

For this purpose, some crawler experiments are carried out under the agent string: OSAlphaXCrawl or hgfAlphaXCrawl/1.0.

In addition to the content, some statistical data will also be collected, such as the average size of the web pages, the size of the net text content of the pages and the connection structure between web pages (e.g. number of outgoing links per page).

Further details about the OpenWebSearch.eu project and the crawling activities can be found at http://www.openwebsearch.eu

I agree that a connection to the Vimeo server will be established when the video is played and that personal data (e.g. your IP address) will be transmitted.
I agree that a connection to the YouTube server will be established when the video is played and that personal data (e.g. your IP address) will be transmitted.
Show video