We would like to build an user friendly interface where the users can import documents in txt format in order to perform a topic modeling analysis based on Latent Dirichlet Allocation. The users should be able to change the following parameters: number of topics, number of words per topic shown in the output (please kindly refer to the image attached), alpha and beta.
We need two different outputs: distribution over topics (on the left side in the attached picture, ) and a list of words which are categorized per topic (on the right side, topic title in quotation is not required)
We would like to use the gensim package. Please check the following website for more informations: [url removed, login to view]
For further questions, please contact us!