Text Mining

Extract information from rough texts

Computer programs have made great advances in interpreting the meaning of natural language. For example, sentiment detection understands the mood of a comment's author. Named entity recognition identifies the most important concepts in a text, for example to add as tags. A large number of texts such as emails or articles can be classified into different categories based on example data. They can also be clustered to reveal the most common topics. Some relatively standardized types of texts can also generated automatically.

Emanuelle Panzeri
Open Source Contributor & Maintainer, Software Developer
Thomas Ebermann
Lead Data Services
Sabine Maennel
Software Engineer

Prototypes and Proof of Concepts

Experimental applications that we build to prove feasibility or to test user acceptance.

Inspiration

Collection of innovative ideas and best practices which inspire us.

Tools and Frameworks

A curated collection of tools and frameworks relevant for Text Mining systems. More under http://datasciencestack.liip.ch
Python NLTK
Lib for text processing in Python
Spacy
Industrial strength NLP with Python
Pattern
Pattern recognition package in Go lang
Lexalitics
NLP as a Service
OpenCalais
Named Entities from Text
Cortical
Retina: an Saas performing complex NLP operations
CoreNLP
Integrated NLP Toolkit
OpenNLP
Open NLP Toolkit from Apache with bindings
Monkeylearn
NLP as a Service
SAGA
GATE Sentiment plugin
SEAS
GATE Sentiment plugin
GATE
General architecture for text engineering
Comprehend
NLP from Amazon Saas
Azure Text...
Microsoft Text Analytics
TextBlob
Simplified text processing
Prodigy
Make training and anotation super simple for texts
LDAjs
Topic Modeling
FuzzyWuzzy
Fuzzy Stringmatching in Ruby
Texacy
Highlevel Spacy
Natural.js
General Language utilities for node
Gensim
Topic Modeling
TM
Text mining for R
Google NLP
NLP Deep learning Models from google
Aylen
Full Service NLP
Thinc
AssemblyAI
Customizable Speech recognition
DeepSpeech
Voice Recognition without the Middle Man
Arria
Natural Language Generation
Spark NLP
Production ready NLP
Tone
IMBs tone analyzer
Einstein
NLP and image recognition on heroku
Dandelion
NLP Saas
Vader
Sentiment analyzer