Master's Thesis · Information Studies (Data Science)

Using transfer learning to increase performance of multinomial text classification on documents of different municipalities

Master's thesis by Tobias J. Beers at the University of Amsterdam (Information Studies – Data Science, 2018). The work investigates how transfer learning can stabilise performance when each municipality writes in its own bureaucratic dialect.

Thesis details

This thesis explores transfer learning techniques, in particular TrAdaBoost, for multinomial text classification on municipal document collections. Because each municipality uses different terminology and stylistic conventions, naive models trained on one city often fail on another. The work studies how knowledge can be transferred between domains to improve robustness while using relatively simple models such as Naive Bayes.

Author Tobias J. Beers
Institution University of Amsterdam — Information Studies (Data Science)
Year 2018
Transfer learning TrAdaBoost Text classification Municipal documents Naive Bayes
PDF Download thesis
← Back to research overview