Tasks and projects carried out as part of the course "Qualitative data analysis and text mining".
Topics covered during the classes by branch:
lab_1"Regular Expression" - building regular expressions usingrelibrarylab_2"Stemming and Lemmatization" - cleaning and processing of selected textlab_3"WordCloud" - building word cloud from csv filelab_4"Tokenization and vectorization of text"lab_5"Term-Document Matrix" - operations on matrixlab_6"Visualizations" - visualization based on matrix operations (bar charts, prettytable)classification"Classification" - simple news classificationentity_matching"Distance and similarity between documents"kolokwium- testproject"Final project" - analysis and classification of coronavirus tweets
