Master's Thesis · Information Studies (Data Science)
Using transfer learning to increase performance of multinomial text classification on documents of different municipalities
Master's thesis by Tobias J. Beers at the University of Amsterdam (Information Studies – Data Science, 2018).
The work investigates how transfer learning can stabilise performance when each municipality writes
in its own bureaucratic dialect.
Thesis details
This thesis explores transfer learning techniques, in particular TrAdaBoost, for multinomial text
classification on municipal document collections. Because each municipality uses different terminology
and stylistic conventions, naive models trained on one city often fail on another. The work studies how
knowledge can be transferred between domains to improve robustness while using relatively simple models
such as Naive Bayes.
Author
Tobias J. Beers
Institution
University of Amsterdam — Information Studies (Data Science)
Year
2018
Transfer learning
TrAdaBoost
Text classification
Municipal documents
Naive Bayes
PDF
Download thesis
← Back to research overview