by
Cheryl M Corcoran;
Vijay A Mittal;
Carrie E Bearden;
Raquel Gur;
Kasia Hitczenko;
Zarina Bilgrami;
Aleksander Savic;
Guillermo A Cecchi;
Phillip Wolff
Human ratings of conceptual disorganization, poverty of content, referential cohesion and illogical thinking have been shown to predict psychosis onset in prospective clinical high risk (CHR) cohort studies. The potential value of linguistic biomarkers has been significantly magnified, however, by recent advances in natural language processing (NLP) and machine learning (ML). Such methodologies allow for the rapid and objective measurement of language features, many of which are not easily recognized by human raters. Here we review the key findings on language production disturbance in psychosis. We also describe recent advances in the computational methods used to analyze language data, including methods for the automatic measurement of discourse coherence, syntactic complexity, poverty of content, referential coherence, and metaphorical language. Linguistic biomarkers of psychosis risk are now undergoing cross-validation, with attention to harmonization of methods. Future directions in extended CHR networks include studies of sources of variance, and combination with other promising biomarkers of psychosis risk, such as cognitive and sensory processing impairments likely to be related to language. Implications for the broader study of social communication, including reciprocal prosody, face expression and gesture, are discussed.
Causal composition allows people to generate new causal relations by combining existing causal knowledge. We introduce a new computational model of such reasoning, the force theory, which holds that people compose causal relations by simulating the processes that join forces in the world, and compare this theory with the mental model theory (Khemlani et al., 2014) and the causal model theory (Sloman et al., 2009), which explain causal composition on the basis of mental models and structural equations, respectively. In one experiment, the force theory was uniquely able to account for people's ability to compose causal relationships from complex animations of real-world events. In three additional experiments, the force theory did as well as or better than the other two theories in explaining the causal compositions people generated from linguistically presented causal relations. Implications for causal learning and the hierarchical structure of causal knowledge are discussed.
Subtle features in people’s everyday language may harbor the signs of future mental illness. Machine learning offers an approach for the rapid and accurate extraction of these signs. Here we investigate two potential linguistic indicators of psychosis in 40 participants of the North American Prodrome Longitudinal Study. We demonstrate how the linguistic marker of semantic density can be obtained using the mathematical method of vector unpacking, a technique that decomposes the meaning of a sentence into its core ideas. We also demonstrate how the latent semantic content of an individual’s speech can be extracted by contrasting it with the contents of conversations generated on social media, here 30,000 contributors to Reddit. The results revealed that conversion to psychosis is signaled by low semantic density and talk about voices and sounds. When combined, these two variables were able to predict the conversion with 93% accuracy in the training and 90% accuracy in the holdout datasets. The results point to a larger project in which automated analyses of language are used to forecast a broad range of mental disorders well in advance of their emergence.
We use big data methods to investigate how decision-making might depend on future sightedness (that is, on how far into the future people’s thoughts about the future extend). In study 1, we establish a link between future thinking and decision-making at the population level in showing that US states with citizens having relatively far future sightedness, as reflected in their tweets, take fewer risks than citizens in states having relatively near future sightedness. In study 2, we analyze people’s tweets to confirm a connection between future sightedness and decision-making at the individual level in showing that people with long future sightedness are more likely to choose larger future rewards over smaller immediate rewards. In study 3, we show that risk taking decreases with increases in future sightedness as reflected in people’s tweets. The ability of future sightedness to predict decisions suggests that future sightedness is a relatively stable cognitive characteristic. This implication was supported in an analysis of tweets by over 38,000 people that showed that future sightedness has both state and trait characteristics (study 4). In study 5, we provide evidence for a potential mechanism by which future sightedness can affect decisions in showing that far future sightedness can make the future seem more connected to the present, as reflected in how people refer to the present, past, and future in their tweets over the course of several minutes. Our studies show how big data methods can be applied to naturalistic data to reveal underlying psychological properties and processes.