Crawl the swissinfo page, and make the results available as a REST api.Demo
This proof of concept has been developed to crawl and index the English-language articles available on the swissinfo.ch website.
The PoC exposes a set of APIs to search for articles, and also to analyze which topics are most common among all indexed pages (= clustering).
Additional API endpoints demonstrate some strategies for search auto-completion and misspelling suggestions - both quite common features of search interfaces.
About 10 person-days
Which things could be build with this POC if you had more time:
- Integrate more third-party clustering algorithms
- Integrate Solr Semantic Knowledge Graph analysis