Topic Modelling Experiment

Ryan Nichols and Ted Slingerland, both longtime readers of this blog, write with an invitation to blog readers to help them out by participating in an experiment. Read on!

Dear Warp, Weft, and Way users,

As affiliates of the University of British Columbia’s Cultural Evolution of Religion Consortium (CERC), we write to invite Warp, Weft, & Way users with some training in classical Chinese to participate in an experiment. Several years ago we embarked on a project to use quantitative methods of analysis, including statistical testing and unsupervised data mining, in order to gain new insights into classical Chinese texts. Our corpus, drawn from Donald Sturgeon’s ctext.org, a resource we all know and enjoy, contains over 5 million characters from texts that date from pre-Warring States through the Tang.

After many pilot studies and preliminary testing, we generated a topic model for our classical Chinese corpus. In other words, we have compiled the corpus using an algorithm that allocates statistical probabilities to the associations between characters in the texts. One of several results of this process is the generation of 100 lists of characters that appear statistically closely related to one another. (Other results include data on how a given list of characters, or ‘topic’, is represented in each text in our corpus, for example.)

In contrast to much computer modeling and text mining on humanistic sources, we do not want to stop with our algorithm’s outputs. Rather, we want to seek the opinions of experts in an effort to assist us in fixing the meanings of these lists of terms. For this reason we have converted each of 100 topics produced by our model into a word cloud containing the ten characters in each topic with the highest statistical loading. We hope Warp, Weft, and Way users will find the potential of such techniques intriguing enough to participate in our experiment by using about 15-20 minutes of your time as experts on Chinese thought. After clicking the link below, you will be taken to a consent form and will then be asked to view a random assortment of 15 word clouds. After each word cloud, you will be asked a handful of brief questions, almost all multiple choice, about its content. At the end of this survey, we will collect some standard demographic information from you necessary for statistical analyses. Here is the link to the survey:

https://csufedu.qualtrics.com/SE/?SID=SV_42gaPogxqF0EoSN

Thank for your participation. This would not be possible without you.

Best,

Ryan & Ted

3 replies on “Topic Modelling Experiment”

I meant to add that we will be sharing all the results of our research with the public, including participants of course, when it is complete.

I’ve now done this, and it was kind of like playing a game — I was actually a bit disappointed when I got the the end and couldn’t play anymore. (Maybe I’ll do it again!)

I enjoyed it too!

https://thezenithblog.wordpress.com/2015/02/25/the-proverbs-of-sun-wu/

Leave a Reply Cancel reply

Ryan says:

February 25, 2015 at 6:34 pm

I meant to add that we will be sharing all the results of our research with the public, including participants of course, when it is complete.

Steve Angle says:

February 26, 2015 at 8:25 am

I’ve now done this, and it was kind of like playing a game — I was actually a bit disappointed when I got the the end and couldn’t play anymore. (Maybe I’ll do it again!)

JohnnyZenith says:

February 26, 2015 at 12:04 pm

I enjoyed it too!

https://thezenithblog.wordpress.com/2015/02/25/the-proverbs-of-sun-wu/

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Jonah Dunch on State of the Field(s)?: “As a current philosophy MA student, I’m curious as to whether the relative distribution has changed over time. Was there…” Apr 13, 09:02

Bin Song on State of the Field(s)?: “Thank you for your significant contribution to the academy, Steve! This is truly a substantial effort, with much work involved.…” Apr 13, 05:57

Justin Tiwald on New Book: Zhuangzi: A New Translation of the Sayings of Master Zhuang as Interpreted by Guo Xiang: “A minor error: “Once Zhuang Zhou dreamt that he was a butterfly, a butterfly happy as can be, and was…” Apr 12, 15:49

J. Williams on New Book: Zhuangzi: A New Translation of the Sayings of Master Zhuang as Interpreted by Guo Xiang: “One further thought: It is mind-boggling that others were able to read, digest, and write reviews on the work so…” Apr 4, 01:41

J. Williams on New Book: Zhuangzi: A New Translation of the Sayings of Master Zhuang as Interpreted by Guo Xiang: “A follow up (“Worth the consideration of those to whom it may prove worth considering”): I have finished reading most…” Apr 4, 00:25

Warp, Weft, and Way

Chinese and Comparative Philosophy 中國哲學與比較哲學

Topic Modelling Experiment

February 24, 2015

3 replies on “Topic Modelling Experiment”

Leave a Reply Cancel reply