Content-related threads in MOOC discussions have distinct linguistic features. Linguistic modeling can reliably identify content-related starting posts and replies. Most top linguistic features appear to be unrelated to the course domain. The number of views and votes threads received were not helpful for classification.