information retrieval, OCR, text classification, database management systems, web-based applications, ...