GENDER BIAS IN THE NEWS: TOPIC MODELLING AND VISUALIZATION FRAMEWORK
Keywords:
gender bias, news media, topic modelling, natural language processingAbstract
We present a topic modelling and data visualization methodology to examine genderbased disparities in news articles by topic. Existing research in topic modelling is largely focused on the text mining of closed corpora, i.e., those that include a fixed collection of composite texts. We showcase a methodology to discover topics via Latent Dirichlet Allocation, which can reliably produce human-interpretable topics over an open news corpus that continually grows with time. Our system generates topics, or distributions of keywords, for news articles on a monthly basis, to consistently detect key events and trends aligned with events in the real world.