Área do cabeçalho
Portal da UFC Acesso a informação da UFC Ouvidoria Conteúdo disponível em: PortuguêsEnglishEspañol

Universidade Federal do Ceará
Programa de Pós-Graduação em Estudos da Tradução

Área do conteúdo

HGP9029 Text Analysis with Python and NLTK

Text Analysis with Python and NLTK (04 créditos)


Text analysis through the computational handling of text corpora is an indispensable skill for any researcher interested in conducting research in the field of Discourse Analysis, Machine Translation, Quantitative Methods Analysis in Linguistics, etc. which intersect with Translation Studies. The most widely used platform for conducting text analysis today is the Natural Language Tool Kit (NLTK) combined with Python programming languages. Both are open access and used and implemented in the most different projects in the field of computational linguistics. Researchers and students in Translation Studies usually need to deal with text corpora, as well as perform statistical analysis and other types of manipulation of linguistic and textual data, so that the acquisition of programming skills comes to fill a gap in their training.


– To provide students with Python techniques to handle, wrangle, extract, and analyse text stored in different corpora;
–  To introduce and train students in the Natural Language Toolkit (NLTK) framework.


This course will introduce students to Python techniques necessary to access, wrangle, and analyse different text corpora using Python and NLTK.


BENGFORT, B.; BILBRO, R.; OJEDA, T. Applied Text Analysis with Python: Enabling Language-Aware Data Products with Machine Learning. O’Reilly Media, 2018. I BIRD, S.; KLEIN, E.; LOPER, E. Natural Language Processing with Python: Analyzing Text with the Natural Language Toolkit. O’Reilly Media, 2009. ISBN 978059655 GUTTAG, J. Introduction to Computation and Programming Using Python: With Application to Understanding Data. MIT Press, 2016. ISBN 9780262529624. HAMMOND, M. Python for Linguists. Cambridge University Press, 2020. ISBN 9781108493444. HARDENIYA, N. et al. Natural Language Processing: Python and NLTK. Packt Publishing, 2016. ISBN 9781787287846. HARRISON, M. Tiny Python 3.6 Notebook: Curated Examples. CreateSpace Independent Publishing Platform, 2017. ISBN 9781542883252. MATTHES, E. Python Crash Course, 2nd Edition: A Hands-On, Project-Based Introduction to Programming. No Starch Press, 2019. ISBN 9781593279295. VASILIEV, Y. Natural Language Processing with Python and spaCy: A Practical Introduction. No Starch Press, 2020. ISBN 9781718500532.

Acessar Ir para o topo