6CCS3TSP/7CCSMTSP - Text Searching and Processing

January-March 2015, Monday 3-6pm, room S-3.20

Lecturers

Maxime Crochemore,   Email for an appointment,   Office: Strand Building, S6.33
Robert Mercas (tutorials), Email for a question

Aims

This unit is devoted to algorithms processing strings and texts efficiently. These types of algorithms are used for software design in the domains of operating systems utilities, search engines on the Internet, data retrieval systems, analysis of genetic sequences, and natural language processing, for example.

Syllabus

Lectures include the following topics:


Related textbooks

algo on strings M. Crochemore, C. Hancart, and T. Lecroq, Algorithms on Strings, Cambridge University Press, New York, 2007, 383 pages.
M. Crochemore and T. Lecroq, Text searching and indexing. In Z. Ésik, C. Martin-Vide, and V. Mitrana, editors, Recent Advances in Formal Languages and Applications, Chapter 2, pages 43-80. Springer-Verlag, 2006.

Other books

A. Apostolico and Z. Galil, editors, Pattern Matching Algorithms, Oxford University Press, New York, 1997, 377 pages.
M. Crochemore and T. Lecroq, Pattern matching and text compression algorithms, in A.B. Tucker Jr, ed., The Computer Science and Engineering Handbook, CRC Press, 2012.
M. Crochemore and W. Rytter, Jewels of Stringology, World Scientific, 2002, 310 pages.
M. Crochemore and W. Rytter, Text Algorithms, Oxford University Press, New York, 1994, 412 pages.
D. Gusfield, Algorithms on Strings, Trees, and Sequences, Cambridge University Press, New York, 1997, 534 pages.
G. Navarro and M. Raffinot, Flexible Pattern Matching in Strings, Cambridge University Press, Cambridge, 2002, 221 pages.
G.A. Stephen, String Searching Algorithms, World Scientific Publishing Co., 1994, 243 pages.


Pointers

SMART (S. Faro, T. Lecroq)

Bibliographies

Visualization of algorithms


Tutorials by Robert Mercas


Documents

Strings

Text Searching---Overview

Sequential String Matching

Table of Prefixes

Dictionary Matching

Practically efficient String Matching

Searching a list of strings

Suffix Arrays

Suffix Trees

Suffix Automata

Regular Pattern Matching

Past MSc exams

Past BSc exams


January 2011, Maxime Crochemore