Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • View all journals
  • Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • Review Article
  • Published: 12 January 2022

A review of theories and methods in the science of face-to-face social interaction

  • Lauren V. Hadley   ORCID: orcid.org/0000-0001-6587-3644 1 ,
  • Graham Naylor   ORCID: orcid.org/0000-0003-1544-1944 1 &
  • Antonia F. de C. Hamilton   ORCID: orcid.org/0000-0001-8000-0219 2  

Nature Reviews Psychology volume  1 ,  pages 42–54 ( 2022 ) Cite this article

12k Accesses

16 Citations

54 Altmetric

Metrics details

  • Social behaviour

For most of human history, face-to-face interactions have been the primary and most fundamental way to build social relationships, and even in the digital era they remain the basis of our closest bonds. These interactions are built on the dynamic integration and coordination of verbal and non-verbal information between multiple people. However, the psychological processes underlying face-to-face interaction remain difficult to study. In this Review, we discuss three ways the multimodal phenomena underlying face-to-face social interaction can be organized to provide a solid basis for theory development. Next, we review three types of theory of social interaction: theories that focus on the social meaning of actions, theories that explain actions in terms of simple behaviour rules and theories that rely on rich cognitive models of the internal states of others. Finally, we address how different methods can be used to distinguish between theories, showcasing new approaches and outlining important directions for future research. Advances in how face-to-face social interaction can be studied, combined with a renewed focus on cognitive theories, could lead to a renaissance in social interaction research and advance scientific understanding of face-to-face interaction and its underlying cognitive foundations.

This is a preview of subscription content, access via your institution

Access options

Subscribe to this journal

Receive 12 digital issues and online access to articles

55,14 € per year

only 4,60 € per issue

Buy this article

  • Purchase on Springer Link
  • Instant access to full article PDF

Prices may be subject to local taxes which are calculated during checkout

social interaction theory research paper

Similar content being viewed by others

social interaction theory research paper

Loneliness trajectories over three decades are associated with conspiracist worldviews in midlife

social interaction theory research paper

A systematic review and multivariate meta-analysis of the physical and mental health benefits of touch interventions

social interaction theory research paper

The development of human causal learning and reasoning

Sparks, A. Tomorrow is Another Country: the Inside Story of South Africa’s Road to Change (University of Chicago Press, 1996).

Argyle, M. Social Interaction: Process and Products (Routledge, 2017).

Hartley, P. Group Communication (Routledge, 2006).

Whittaker, S., Terveen, L., Hill, W. & Cherny, L. in From Usenet to CoWebs. Computer Supported Cooperative Work (eds Lueg, C. & Fisher, D.) 79–91 (Springer, 2003).

Mondada, L. The multimodal interactional organization of tasting: practices of tasting cheese in gourmet shops. Discourse Stud. 20 , 743–769 (2018).

Google Scholar  

Stepanyan, K., Borau, K. & Ullrich, C. in 10th IEEE Int. Conf. Advanced Learning Technol . 70–72 (IEEE, 2010).

Eggins, S. & Slade, D. Analysing Casual Conversation (Equinox, 2004).

Heydon, G. The Language of Police Interviewing (Palgrave Macmillan, 2005).

Tschacher, W., Rees, G. & Ramseyer, F. Nonverbal synchrony and affect in dyadic interactions. Front. Psychol. 5 , 1323 (2014).

PubMed   PubMed Central   Google Scholar  

Remland, M. S. Nonverbal Communication in Everyday Life (Sage, 2016).

Guerrero, L. K. & Floyd, K. Nonverbal Communication in Close Relationships (Routledge, 2006).

Keltner, D., Gruenfeld, D. H. & Anderson, C. Power, approach, and inhibition. Psychol. Rev. 110 , 265–284 (2003).

PubMed   Google Scholar  

Hall, J. A. Nonverbal behavior, status, and gender: how do we understand their relations? Psychol. Women Q. 30 , 384–391 (2006).

Thibaut, J. W. & Kelley, H. H. The Social Psychology of Groups (Routledge, 2017).

Krakauer, J. W., Ghazanfar, A. A., Gomez-Marin, A., MacIver, M. A. & Poeppel, D. Neuroscience needs behavior: correcting a reductionist bias. Neuron 93 , 480–490 (2017).

Diamond, A. & Lee, K. Interventions shown to aid executive function development in children 4 to 12 years old. Science 333 , 959–964 (2011).

Ramseyer, F. & Tschacher, W. Nonverbal synchrony in psychotherapy: coordinated body movement reflects relationship quality and outcome. J. Consult. Clin. Psychol. 79 , 284–295 (2011).

Cuddy, A. Presence: Bringing your Boldest Self to Your Biggest Challenges (Hachette UK, 2015).

Friedman, H. S. in Nonverbal Communication Today (ed. Key, M. R.) 57–68 (De Gruyter Mouton, 1982).

Ritschel, H., Aslan, I., Sedlbauer, D. & André, E. in Proc. 18th Int. Conf. Autonomous Agents and MultiAgent Systems 86–94 (University Augsburg, 2019).

Sebanz, N. & Knoblich, G. Progress in joint-action research. Curr. Dir. Psychol. Sci. 30 , 138–143 (2021).

Konvalinka, I., Vuust, P., Roepstorff, A. & Frith, C. Follow you, follow me: continuous mutual prediction and adaptation in joint tapping. Q. J. Exp. Psychol. 63 , 2220–2230 (2010).

Hamilton, A. F. D. C. & Lind, F. Audience effects: what can they tell us about social neuroscience, theory of mind and autism? Cult. Brain 4 , 159–177 (2016).

Pinti, P. et al. The present and future use of functional near‐infrared spectroscopy (fNIRS) for cognitive neuroscience. Ann. NY Acad. Sci. 40 , 1–25 (2018).

Hecht, M. A. & Ambady, N. Nonverbal communication and psychology: past and future. Atl. J. Commun. 7 , 156–170 (1999).

Babbie, E. R. The Practice of Social Research (Cengage Learning, 2020).

Argyle, M. & Kendon, A. in Advances in Experimental Social Psychology (ed. Berkowitz, L.) 55–98 (Academic, 1967).

Duncan, S. Some signals and rules for taking speaking turns in conversations. J. Pers. Soc. Psychol. 23 , 283–292 (1972).

Kendon, A. Some functions of gaze-direction in social interaction. Acta Psychol. 26 , 22–63 (1967).

Tickle-Degnen, L. & Rosenthal, R. The nature of rapport and its nonverbal correlates. Psychol. Inq. 1 , 285–293 (1990).

LaFrance, M. & Broadbent, M. Group rapport: posture sharing as a nonverbal indicator. Gr. Organ. Stud. 1 , 328–333 (1976).

Witkower, Z., Tracy, J. L., Cheng, J. T. & Henrich, J. Two signals of social rank: prestige and dominance are associated with distinct nonverbal displays. J. Pers. Soc. Psychol. 118 , 89–120 (2020).

Carney, D. R. The nonverbal expression of power, status, and dominance. Curr. Opin. Psychol. 33 , 256–264 (2020).

Sanborn, F. W. & Harris, R. J. A Cognitive Psychology of Mass Communication (Routledge, 2019).

Meikle, G. Social Media: Communication, Sharing and Visibility (Routledge, 2016).

Fitzpatrick, M. A. & Noller, P. Marital communication in the eighties. J. Marriage Fam. 52 , 832–843 (1990).

Heerey, E. A. Decoding the dyad: challenges in the study of individual differences in social behavior. Curr. Dir. Psychol. Sci. 24 , 285–291 (2015).

Hirvenkari, L. et al. Influence of turn-taking in a two-person conversation on the gaze of a viewer. PLoS ONE 8 , e71569 (2013).

Hazan, V. & Baker, R. Acoustic–phonetic characteristics of speech produced with communicative intent to counter adverse listening conditions. J. Acoust. Soc. Am. 130 , 2139–2152 (2011). This innovative study manipulates the acoustic environment of talker and listener separately, demonstrating the importance of communicative intent on speech adjustments .

Lee Masson, H. & Op de Beeck, H. Socio-affective touch expression database. PLoS ONE 13 , e0190921 (2018).

Tsao, D. Y. & Livingstone, M. S. Mechanisms of face perception. Annu. Rev. Neurosci. 31 , 411–437 (2008).

Sauter, D. A., Eisner, F., Ekman, P. & Scott, S. K. Cross-cultural recognition of basic emotions through nonverbal emotional vocalizations. Proc. Natl Acad. Sci. USA 107 , 2408–2412 (2010).

Cascio, C. J., Moore, D. & McGlone, F. Social touch and human development. Dev. Cogn. Neurosci. 35 , 5–11 (2019).

Poppe, R., Truong, K. P. & Heylen, D. in Int. Workshop on Intelligent Virtual Agents (eds Vilhjálmsson, H. H., Kopp, S., Marsella, S. & Thórisson K.R.) 228–239 (Springer, 2011).

Kessous, L., Castellano, G. & Caridakis, G. Multimodal emotion recognition in speech-based interaction using facial expression, body gesture and acoustic analysis. J. Multimodal User Interfaces 3 , 33–48 (2010).

Patterson, M. L. Nonverbal Behavior: A Functional Perspective (Springer Science & Business Media, 2012).

Friederici, A. D. The brain basis of language processing: from structure to function. Physiol. Rev. 91 , 1357–1392 (2011).

Clark, H. Using Language (Cambridge Univ. Press, 1996).

Botvinick, M. M., Braver, T. S., Barch, D. M., Carter, C. S. & Cohen, J. D. Conflict monitoring and cognitive control. Psychol. Rev. 108 , 624–652 (2001).

Darda, K. M. & Ramsey, R. The inhibition of automatic imitation: a meta-analysis and synthesis of fMRI studies. Neuroimage 197 , 320–329 (2019).

Volman, I., Roelofs, K., Koch, S., Verhagen, L. & Toni, I. Anterior prefrontal cortex inhibition impairs control over social emotional actions. Curr. Biol. 21 , 1766–1770 (2011).

Rizzolatti, G. & Craighero, L. The mirror-neuron system. Annu. Rev. Neurosci. 27 , 169–192 (2004).

Hamilton, A. F. D. C. in Shared Representations: Sensorimotor Foundations of Social Life (eds Obhi, S. S. & Cross, E. S.) 313–331 (Cambridge Univ. Press, 2015).

Rizzolatti, G. & Sinigaglia, C. The functional role of the parieto-frontal mirror circuit: interpretations and misinterpretations. Nat. Rev. Neurosci. 11 , 264–274 (2010).

Caspers, S., Zilles, K., Laird, A. R. & Eickhoff, S. B. ALE meta-analysis of action observation and imitation in the human brain. NeuroImage 50 , 1148–1167 (2010).

Chartrand, T. L. & Bargh, J. A. The chameleon effect: the perception–behavior link and social interaction. J. Pers. Soc. Psychol. 76 , 893–910 (1999).

Heyes, C., Bird, G., Johnson, H. & Haggard, P. Experience modulates automatic imitation. Cogn. Brain Res. 22 , 233–240 (2005).

Adank, P., Nuttall, H., Bekkering, H. & Maegherman, G. Effects of stimulus response compatibility on covert imitation of vowels. Attent. Percept. Psychophys. 80 , 1290–1299 (2018).

Polyanskaya, L., Samuel, A. G. & Ordin, M. Speech rhythm convergence as a social coalition signal. Evol. Psychol. https://doi.org/10.1177/1474704919879335 (2019).

Article   PubMed   Google Scholar  

Menenti, L., Pickering, M. J. & Garrod, S. Toward a neural basis of interactive alignment in conversation. Front. Hum. Neurosci. 6 , 185 (2012).

Heyes, C. What’s social about social learning? J. Comp. Psychol. 126 , 193–202 (2011).

Virhia, J., Kotz, S. A. & Adank, P. Emotional state dependence facilitates automatic imitation of visual speech. Q. J. Exp. Psychol. 72 , 2833–2847 (2019).

Kuhbandner, C., Pekrun, R. & Maier, M. A. The role of positive and negative affect in the “mirroring” of other persons’ actions. Cogn. Emot. 24 , 1182–1190 (2010).

Smith, J. M. & Harper, D. Animal Signals (Oxford Univ. Press, 2003).

Csibra, G. in Sensorimotor Foundations of Higher Cognition: Attention and Performance (eds Haggard, P., Rosetti, Y. & Kawato., M.) 461–479 (Oxford Univ. Press, 2008).

Seyfarth, R. M., Cheney, D. L. & Marler, P. Vervet monkey alarm calls: semantic communication in a free-ranging primate. Anim. Behav. 28 , 1070–1094 (1980).

Burgoon, J. K. Relational message interpretations of touch, conversational distance, and posture. J. Nonverbal Behav. 15 , 233–259 (1991).

Argyle, M. Bodily Communication (International Universities Press, 1975).

Ekman, P., Sorenson, E. R. & Friesen, W. V. Pan-cultural elements in facial displays of emotion. Science 164 , 86–88 (1969).

Henley, N. Body Politics: Power, Sex, and Nonverbal Communication (Prentice Hall, 1977).

Henley, N. M. in Gender, Power, and Communication in Human Relationships (eds Kalbfleisch, P. J. & Cody, M. J.) 27–61 (Psychology Press, 1995).

Kampe, K. K., Frith, C. D. & Frith, U. “Hey John”: signals conveying communicative intention toward the self activate brain regions associated with “mentalizing,” regardless of modality. J. Neurosci. 23 , 5258–5263 (2003). This work is one of the few functional MRI studies to examine social signals across different modalities, showing common activation in the prefrontal cortex for different communicative signals .

Frith, C. & Frith, U. Theory of mind. Curr. Biol. 15 , R644–R645 (2005).

Amodio, D. M. & Frith, C. D. Meeting of minds: the medial frontal cortex and social cognition. Nat. Rev. Neurosci. 7 , 268–277 (2006).

Hagoort, P. & Levinson, S. C. in The Cognitive Neurosciences 5th edn (eds Gazzaniga, M. S. & Mangun, G. R.) 667–674 (MIT Press, 2014).

Van Berkum, J. J. in Semantics and Pragmatics: From Experiment to Theory (eds Sauerland, U. & Yatsushiro, K.) 276–316 (Palgrave Macmillan, 2009).

Stegmann, U. Animal Communication Theory: Information and Influence (Cambridge Univ. Press, 2013).

Seyfarth, R. M., Cheney, D. L. & Marler, P. Monkey responses to three different alarm calls: evidence of predator classification and semantic communication. Science 210 , 801–803 (1980).

Prat, Y., Taub, M. & Yovel, Y. Everyday bat vocalizations contain information about emitter, addressee, context, and behavior. Sci. Rep. 6 , 1–10 (2016). This paper shows how large datasets and machine learning approaches can help us understand the social meanings of animal communications .

Mehu, M. & Scherer, K. R. A psycho-ethological approach to social signal processing. Cogn. Process. 13 , 397–414 (2012).

Burgoon, J. K., Magnenat-Thalmann, N., Pantic, M. & Vinciarelli, A. Social Signal Processing (Cambridge Univ. Press, 2017).

Remland, M. S. Leadership impressions and nonverbal communication in a superior–subordinate interaction. Commun. Q. 32 , 41–48 (1984).

Burgoon, J. K. & Newton, D. A. Applying a social meaning model to relational message interpretations of conversational involvement: comparing observer and participant perspectives. South. J. Commun. 56 , 96–113 (1991).

Hall, J. A., Coats, E. J. & LeBeau, L. S. Nonverbal behavior and the vertical dimension of social relations: a meta-analysis. Psychol. Bull. 131 , 898–924 (2005).

Hessels, R. S. How does gaze to faces support face-to-face interaction? A review and perspective. Psychon. Bull. Rev. 27 , 856–881 (2020).

Kendrick, K. H. & Holler, J. Gaze direction signals response preference in conversation. Res. Lang. Soc. Interact. 50 , 12–32 (2017).

Burgoon, J. K. & Walther, J. B. Nonverbal expectancies and the evaluative consequences of violations. Hum. Commun. Res. 17 , 232–265 (1990).

Burgoon, J. K., Coker, D. A. & Coker, R. A. Communicative effects of gaze behavior: a test of two contrasting explanations. Hum. Commun. Res. 12 , 495–524 (1986).

Burgoon, J. K. A communication model of personal space violations: explication and an initial test. Hum. Commun. Res. 4 , 129–142 (1978).

Cappella, J. N. & Greene, J. O. A discrepancy‐arousal explanation of mutual influence in expressive behavior for adult and infant–adult interaction. Commun. Monogr. 49 , 89–114 (1982).

Hildenbrandt, H., Carere, C. & Hemelrijk, C. Self-organized aerial displays of thousands of starlings: a model. Behav. Ecol. 21 , 1349–1359 (2010).

Huth, A. & Wissel, C. The simulation of the movement of fish schools. J. Theor. Biol. 156 , 365–385 (1992).

Couzin, I. D. Collective cognition in animal groups. Trends Cogn. Sci. 13 , 36–43 (2009).

Moussaïd, M. et al. Experimental study of the behavioural mechanisms underlying self-organization in human crowds. Proc. R. Soc. B Biol. Sci. 276 , 2755–2762 (2009).

Hale, J., Ward, J. A., Buccheri, F., Oliver, D. & Hamilton, A. F. D. C. Are you on my wavelength? Interpersonal coordination in dyadic conversations. J. Nonverbal Behav. 44 , 63–83 (2020). This study of head nodding behaviour in conversation identifies two different types of nods using motion capture .

Wilson, M. & Wilson, T. An oscillator model of the timing of turn-taking. Psychon. Bull. Rev. 12 , 957–968 (2005).

Takahashi, D., Narayanan, D. & Ghazanfar, A. A. Coupled oscillator dynamics of vocal turn-taking in monkeys. Curr. Biol. 23 , 2162–2168 (2013).

Pickering, M. J. & Garrod, S. The interactive-alignment model: developments and refinements. Behav. Brain Sci. 27 , 212–225 (2004).

Holler, J. & Wilkin, K. Co-speech gesture mimicry in the process of collaborative referring during face-to-face dialogue. J. Nonverbal Behav. 35 , 133–153 (2011).

Heyes, C. Where do mirror neurons come from? Neurosci. Biobehav. Rev. 34 , 575–583 (2010).

Heyes, C. Cognitive Gadgets (Harvard Univ. Press, 2018).

Giles, H. Accent mobility: a model and some data. Anthropol. Linguist. 15 , 87–109 (1973).

Dragojevic, M., Gasiorek, J. & Giles, H. in The International Encyclopedia of Interpersonal Communication (eds Berger, R. C. & Roloff, E. M.) 1–21 (Wiley Blackwell, 2015).

Bailenson, J. N. & Yee, N. Digital chameleons: automatic assimilation of nonverbal gestures in immersive virtual environments. Psychol. Sci. 16 , 814–819 (2005).

Pomerantz, A. in Structures of Social Action: Studies in Conversation Analysis (eds Atkinson, J. & Heritage, J.) 57–101 (Cambridge Univ. Press, 1984).

Sperber, D. & Wilson, D. Relevance: Communication and Cognition (Harvard Univ. Press, 1986).

Butterworth, G. & Morissette, P. Onset of pointing and the acquisition of language in infancy. J. Reprod. Infant. Psychol. 14 , 219–231 (1996).

Morissette, P., Ricard, M. & Décarie, T. G. Joint visual attention and pointing in infancy: a longitudinal study of comprehension. Br. J. Dev. Psychol. 13 , 163–175 (1995).

Southgate, V., Van Maanen, C. & Csibra, G. Infant pointing: communication to cooperate or communication to learn? Child. Dev. 78 , 735–740 (2007).

Tomasello, M., Carpenter, M. & Liszkowski, U. A new look at infant pointing. Child. Dev. 78 , 705–722 (2007).

Begus, K. & Southgate, V. Infant pointing serves an interrogative function. Dev. Sci. 15 , 611–617 (2012).

Wyman, E., Rakoczy, H. & Tomasello, M. Non-verbal communication enables children’s coordination in a “stag hunt” game. Eur. J. Dev. Psychol. 10 , 597–610 (2013).

Clark, H. H. & Murphy, G. L. Audience design in meaning and reference. Adv. Psychol. 9 , 287–299 (1982).

Trujillo, J., Özyürek, A., Holler, J. & Drijvers, L. Evidence for a multimodal Lombard effect: speakers modulate not only speech but also gesture to overcome noise. Sci. Rep. 11 , 16721 (2020).

Beechey, T., Buchholz, J. M. & Keidser, G. Hearing impairment increases communication effort during conversations in noise. J. Speech Lang. Hear. Res. 63 , 305–320 (2020). This study shows how talkers spontaneously modify their speech according to the acoustic environment and their partner’s hearing ability .

Krishnan-Barman, S. & Hamilton, A. F. D. C. Adults imitate to send a social signal. Cognition 187 , 150–155 (2019).

Fridlund, A. Sociality of solitary smiling: potentiation by an implicit audience. J. Pers. Soc. Psychol. 60 , 229–240 (1991).

Senju, A., Southgate, V., White, S. & Frith, U. Mindblind eyes: an absence of spontaneous theory of mind in Asperger syndrome. Science 325 , 883–885 (2009).

Nadig, A., Vivanti, G. & Ozonoff, S. Adaptation of object descriptions to a partner under increasing communicative demands: a comparison of children with and without autism. Autism Res. 2 , 334–347 (2009).

Georgescu, A. L. et al. Reduced nonverbal interpersonal synchrony in autism spectrum disorder independent of partner diagnosis: a motion energy study. Mol. Autism 11 , 1–14 (2020).

Freeth, M. & Bugembe, P. Social partner gaze direction and conversational phase; factors affecting social attention during face-to-face conversations in autistic adults? Autism 23 , 503–513 (2019).

Cisek, P. & Kalaska, J. F. Neural mechanisms for interacting with a world full of action choices. Annu. Rev. Neurosci. 33 , 269–298 (2010).

Wang, Y. & Hamilton, A. F. D. C. Social top-down response modulation (STORM): a model of the control of mimicry in social interaction. Front. Hum. Neurosci. 6 , 153 (2012).

Cornejo, C., Cuadros, Z., Morales, R. & Paredes, J. Interpersonal coordination: methods, achievements, and challenges. Front. Psychol. 8 , 1685 (2017).

Onnela, J. P., Waber, B. N., Pentland, A., Schnorf, S. & Lazer, D. Using sociometers to quantify social interaction patterns. Sci. Rep. 4 , 1–9 (2014).

Baltrušaitis, T., Robinson, P. & Morency, L. P. in 2016 IEEE Winter Conf. Applications of Computer Vision (WACV) 1–10 (IEEE, 2016).

Issartel, J., Bardainne, T., Gaillot, P. & Marin, L. The relevance of the cross-wavelet transform in the analysis of human interaction — a tutorial. Front. Psychol. 5 , 1566 (2015).

Gatica-Perez, D. Automatic nonverbal analysis of social interaction in small groups: a review. Image Vis. Comput. 27 , 1775–1787 (2009).

Richardson, M. J., Dale, R. & Marsh, K. L. in Handbook of Research Methods in Social and Personality Psychology (eds Reis, H. T. & Judd, C. M.) 253–282 (Cambridge Univ. Press, 2014).

Wykowska, A., Chaminade, T. & Cheng, G. Embodied artificial agents for understanding human social cognition. Philos. Trans. R. Soc. B Biol. Sci. 371 , 20150375 (2016).

Parry, R. in Sage Handbook of Qualitative Methods in Health Research (Bourgeault, I., Dingwall, R. & De Vries, R.) 373–396 (Sage, 2010).

Wild, K. S., Poliakoff, E., Jerrison, A. & Gowen, E. The influence of goals on movement kinematics during imitation. Exp. Brain Res. 204 , 353–360 (2010).

McEllin, L., Sebanz, N. & Knoblich, G. Identifying others’ informative intentions from movement kinematics. Cognition 180 , 246–258 (2018).

Schegloff, E. A. in Talk and Social Structure (eds Boden, D. & Zimmerman, D.) 44–70 (Univ. of California Press, 1991).

Schegloff, E. A. Reflections on quantification in the study of conversation. Res. Lang. Soc. Interact. 26 , 99–128 (1993).

Kendrick, K. H. Using conversation analysis in the lab. Res. Lang. Soc. Interact. 50 , 1–11 (2017).

de Ruiter, J. P. & Albert, S. An appeal for a methodological fusion of conversation analysis and experimental psychology. Res. Lang. Soc. Interact. 50 , 90–107 (2017). This work is a helpful review that draws together ideas from the conversation analysis tradition and cognitive science .

Gomez-Marin, A. & Ghazanfar, A. A. The life of behavior. Neuron 104 , 25–36 (2019). This paper explores the importance of context in studying and understanding behaviour .

Baxter, L. A. & Babbie, E. R. in The Basics of Communication Research (eds Baxter L. A. & Babbie, E.) 296–380 (Cengage Learning, 2003).

Hömke, P., Holler, J. & Levinson, S. C. Eye blinking as addressee feedback in face-to-face conversation. Res. Lang. Soc. Interact. 50 , 54–70 (2017).

Hömke, P., Holler, J. & Levinson, S. C. Eye blinks are perceived as communicative signals in human face-to-face interaction. PLoS ONE 13 , e0208030 (2018). This study elucidates the role of blinks in conversation and communication .

Chen, P. H. A. et al. Socially transmitted placebo effects. Nat. Hum. Behav. 3 , 1295–1305 (2019). This high-resolution face-tracking study demonstrates the power of detailed analyses of structured social interactions .

van der Steen, M. C. & Keller, P. E. The ADaptation and Anticipation Model (ADAM) of sensorimotor synchronization. Front. Hum. Neurosci. 7 , 253 (2013).

van der Steen, M. C., Jacoby, N., Fairhurst, M. T. & Keller, P. E. Sensorimotor synchronization with tempo-changing auditory sequences: modeling temporal adaptation and anticipation. Brain Res. 1626 , 66–87 (2015).

Gratch, J., Wang, N., Gerten, J., Fast, E. & Duffy, R. in Intelligent Virtual Agents (eds Pelachaud, C. et al.) 125–138 (Springer, 2007). This study investigates how artificial agents can create rapport, demonstrating the possibilities and limitations of this technology .

Cooke, M., King, S., Garnier, M. & Aubanel, V. The listening talker: a review of human and algorithmic context-induced modifications of speech. Comput. Speech Lang. 28 , 543–571 (2014).

McEllin, L., Knoblich, G. & Sebanz, N. Distinct kinematic markers of demonstration and joint action coordination? Evidence from virtual xylophone playing. J. Exp. Psychol. Hum. Percept. Perform. 44 , 885–897 (2018). This study shows how people use subtle variations in action kinematics to communicate to a partner in different contexts .

Nunamaker, J. F. Jr, Chen, M. & Purdin, T. D. Systems development in information systems research. J. Manag. Inf. Syst. 7 , 89–106 (1990).

Wallace, W. L. The Logic of Science in Sociology (Routledge, 2017).

Marsella, S. & Gratch, J. in Handbook of Emotions (eds Barrett, L. F., Lewis, M. & Haviland-Jones, J. M.) 113–132 (The Guilford Press, 2016).

Mills, P. F., Harry, B., Stevens, C. J., Knoblich, G. & Keller, P. E. Intentionality of a co-actor influences sensorimotor synchronisation with a virtual partner. Q. J. Exp. Psychol. 72 , 1478–1492 (2019).

van der Steen, M. C., Schwartze, M., Kotz, S. A. & Keller, P. E. Modeling effects of cerebellar and basal ganglia lesions on adaptation and anticipation during sensorimotor synchronization. Ann. NY Acad. Sci. 1337 , 101–110 (2015).

Tracy, L. F., Segina, R. K., Cadiz, M. D. & Stepp, C. E. The impact of communication modality on voice production. J. Speech Lang. Hear. Res. 63 , 2913–2920 (2020).

Cañigueral, R., Ward, J. A. & Hamilton, A. F. D. C. Effects of being watched on eye gaze and facial displays of typical and autistic individuals during conversation. Autism 25 , 210–226 (2021).

Pika, S., Wilkinson, R., Kendrick, K. H. & Vernes, S. C. Taking turns: bridging the gap between human and animal communication. Proc. R. Soc. B 285 , 20180598 (2018).

Henrich, J., Heine, S. J. & Norenzayan, A. Most people are not WEIRD. Nature 466 , 29 (2010).

Stivers, T. et al. Universals and cultural variation in turn-taking in conversation. Proc. Natl Acad. Sci. USA 106 , 10587–10592 (2009). This cross-linguistic study of question–answer pairs in spontaneous conversation shows the speed and accuracy with which people take turns .

Dingemanse, M., Torreira, F. & Enfield, N. J. Is “Huh?” a universal word? Conversational infrastructure and the convergent evolution of linguistic items. PLoS ONE 8 , e78273 (2013).

Bandura, A. & Walters, R. H. Social Learning Theory (Prentice Hall, 1977).

Haensel, J. X., Smith, T. J. & Senju, A. Cultural differences in mutual gaze during face-to-face interactions: a dual head-mounted eye-tracking study. Vis. Cogn. https://doi.org/10.1080/13506285.2021.1928354 (2021).

Article   Google Scholar  

De Lillo, M. et al. Tracking developmental differences in real-world social attention across adolescence, young adulthood and older adulthood. Nat. Hum. Behav. 5 , 1381–1390 (2021).

Eaton, L. G. & Funder, D. C. The creation and consequences of the social world: an interactional analysis of extraversion. Eur. J. Pers. 17 , 375–395 (2003).

Back, M. D., Schmukle, S. C. & Egloff, B. Predicting actual behavior from the explicit and implicit self-concept of personality. J. Pers. Soc. Psychol. 97 , 533–548 (2009).

Uekermann, J. et al. Social cognition in attention-deficit hyperactivity disorder (ADHD). Neurosci. Biobehav. Rev. 34 , 734–743 (2010).

Heerey, E. A. & Kring, A. M. Interpersonal consequences of social anxiety. J. Abnorm. Psychol. 116 , 125–134 (2007).

McNaughton, K. A. & Redcay, E. Interpersonal synchrony in autism. Curr. Psychiatry Rep. 22 , 1–11 (2020).

Barzy, M., Ferguson, H. J. & Williams, D. M. Perspective influences eye movements during real-life conversation: mentalising about self versus others in autism. Autism 24 , 2153–2165 (2020).

Zhao, Z. et al. Random and short-term excessive eye movement in children with autism during face-to-face conversation. J. Autism Dev. Disord. https://doi.org/10.1007/s10803-021-05255-7 (2021).

Cañigueral, R. & Hamilton, A. F. D. C. The role of eye gaze during natural social interactions in typical and autistic people. Front. Psychol. 10 , 560 (2019).

Download references

Acknowledgements

L.V.H is supported by a UKRI Future Leaders Fellowship (MR/T041471/1). G.N. is supported by the Medical Research Council (MR/S003576/1) and the Chief Scientist Office of the Scottish Government. A.F.C.H. was supported by the Leverhulme Trust (RPG-20160-251).

Author information

Authors and affiliations.

Hearing Sciences — Scottish Section, School of Medicine, University of Nottingham, Glasgow, UK

Lauren V. Hadley & Graham Naylor

Institute of Cognitive Neuroscience, Division of Psychology and Language Sciences, University College London, London, UK

Antonia F. de C. Hamilton

You can also search for this author in PubMed   Google Scholar

Contributions

L.V.H and A.F.C.H. contributed substantially to discussion of article content and wrote the article. All authors reviewed and/or edited the manuscript before submission.

Corresponding author

Correspondence to Lauren V. Hadley .

Ethics declarations

Competing interests.

The authors declare no competing interests.

Additional information

Peer review information.

Nature Reviews Psychology thanks Andrew Bayliss, who co-reviewed with Kristina Veranic; Judee Burgoon; and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article.

Hadley, L.V., Naylor, G. & Hamilton, A.F.d.C. A review of theories and methods in the science of face-to-face social interaction. Nat Rev Psychol 1 , 42–54 (2022). https://doi.org/10.1038/s44159-021-00008-w

Download citation

Accepted : 25 October 2021

Published : 12 January 2022

Issue Date : January 2022

DOI : https://doi.org/10.1038/s44159-021-00008-w

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

This article is cited by

Psychometric properties of the arabic versions of the long (27 items) and short (13 items) forms of the interpersonal mindfulness scale (ims).

  • Feten Fekih-Romdhane
  • Diana Malaeb
  • Souheil Hallit

BMC Psychiatry (2024)

A levels-of-analysis framework for studying social emotions

  • Xiaoxue Gao
  • Xiaolin Zhou

Nature Reviews Psychology (2024)

Time to Smile: How Onset Asynchronies Between Reciprocal Facial Expressions Influence the Experience of Responsiveness of a Virtual Agent

  • Leon O. H. Kroczek
  • Andreas Mühlberger

Journal of Nonverbal Behavior (2023)

The fundamental importance of method to theory

  • Anne S. Warlaumont
  • Kerri L. Johnson

Nature Reviews Psychology (2022)

Interpersonal Mindfulness Scale-Short Form Development Using Rasch Analyses

  • Steven D. Pratscher
  • Danielle L. Oyler
  • Oleg N. Medvedev

Mindfulness (2022)

Quick links

  • Explore articles by subject
  • Guide to authors
  • Editorial policies

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

social interaction theory research paper

REVIEW article

Studying social interactions through immersive virtual environment technology: virtues, pitfalls, and future challenges.

\r\nDario Bombari*

  • 1 Department of Organizational Behavior, Faculty of Business and Economics, University of Lausanne, Lausanne, Switzerland
  • 2 Bern University of Applied Sciences, Bern, Switzerland

The goal of the present review is to explain how immersive virtual environment technology (IVET) can be used for the study of social interactions and how the use of virtual humans in immersive virtual environments can advance research and application in many different fields. Researchers studying individual differences in social interactions are typically interested in keeping the behavior and the appearance of the interaction partner constant across participants. With IVET researchers have full control over the interaction partners, can standardize them while still keeping the simulation realistic. Virtual simulations are valid: growing evidence shows that indeed studies conducted with IVET can replicate some well-known findings of social psychology. Moreover, IVET allows researchers to subtly manipulate characteristics of the environment (e.g., visual cues to prime participants) or of the social partner (e.g., his/her race) to investigate their influences on participants’ behavior and cognition. Furthermore, manipulations that would be difficult or impossible in real life (e.g., changing participants’ height) can be easily obtained with IVET. Beside the advantages for theoretical research, we explore the most recent training and clinical applications of IVET, its integration with other technologies (e.g., social sensing) and future challenges for researchers (e.g., making the communication between virtual humans and participants smoother).

Humans spend between 32 and 75% of their waking time in social interactions ( Mehl and Pennebaker, 2003 ). To understand how we behave in social interactions, how we draw conclusions about our social interaction partners, or how the outcome of the social interaction will shape us and our social relationships, we need to observe and study humans engaged in a wide variety of different social contexts. Given the frequency of its occurrence and the importance of social interactions for understanding humans and for bringing about change for individuals and society, the lack of research using direct behavioral observation is surprising ( Baumeister et al., 2007 ). One reason for this gap is that if we focus on natural observation, we may have to wait long periods of time before a desired social situation occurs naturally with us being present to observe it. In an attempt to overcome these constraints, researchers typically use simulations, meaning that people are put in a specific social situation from which their behavior is observed and the interaction outcomes assessed. In the present review, we describe how such simulations can take place in an immersive virtual environment (IVE) with virtual humans as social interaction partners and we discuss the distinct advantages and challenges of this method.

In this article, we focus on social interactions with virtual humans in the IVEs and their use for research and training. While IVET has been around for several decades, the use of this technology for the social sciences is still relatively new ( Fox et al., 2009 ) and particularly the aspect of including virtual humans as social interaction partners to simulate interpersonal encounters is still in its infancy. It is the latter aspect on which we will shed light by describing the state of the art in this domain, some of the main findings, and the existing challenges and future directions of this line of research. Our contribution is at the same time an update of the earlier review by Fox et al. and a focalization on the simulation of social interactions with virtual humans.

The Need for Standardized Social Interaction Partners

For researchers studying how people behave in social interactions, one of the biggest challenges is that the behavior of one person is always, at least in part, a function of the behavior of his/her social interaction partner. If my social interaction partner smiles a lot, then I tend to respond in kind ( Hatfield et al., 1992 ). Typically, social scientists studying interpersonal behavior are interested in investigating why one person behaves differently from another person – known as the study of individual differences. Such differences become hard to interpret if they are affected by what the social interaction partner does. There are different solutions to this problem of non-independence of the observational data in social interactions. One possibility is to include the interaction partner’s behavior as a control variable in the statistical analysis. This is not an optimal solution because the “contamination” of a person’s behavior by another person’s behavior occurs simultaneously through different channels (e.g., verbal and non-verbal) and the behavioral cues are often very subtle and hard to observe and measure. Moreover, it is unclear which out of the many different behaviors a person shows would have to be assessed in order to be able to control for.

The optimal solution is the standardization of the social interaction partner, meaning that the social interaction partner behaves exactly in the same way with each and every participant. With the standardization of the social interaction partner, differences in the behavior of a series of participants can be attributed entirely too actual differences among these people and not to anything their social interaction partner did.

One approach to standardization is the use of trained confederates. These are actors that are instructed and trained to maintain the same verbal and non-verbal reactions across participants and across conditions. Interacting with confederates (that the participants believe to be regular other participants) has high ecological validity because it is an interaction between two humans. However, in terms of standardization, it does not ensure that all behaviors are entirely controlled, especially if one considers non-verbal behavior (e.g., facial mimicry) that is much less under conscious control than, for instance, verbal behavior. Indeed, research ( Congdon and Schober, 2002 ; Topal et al., 2008 ) shows that confederates still behave slightly differently depending on whom they are interacting with and this has an influence on participants’ behavior (see Kuhlen and Brennan, 2013 for a discussion on this topic).

Another experimental setting used to circumvent the issues associated with the inter-dependence of the behavior in a social interaction involves the use of vignettes. In vignette studies, participants are provided with a cover story or with cues (e.g., a picture) describing an interaction partner in a particular situation. Participants are asked to imagine being in an interaction with that partner. This setting has the advantage of maximally controlling the behavior of the social interaction partner (maximal standardization) to the detriment of ecological validity. These studies are quite far removed from real-life interactions and might thus find results that cannot be generalized to or might not be valid for real-life situations.

Typically, the methods high in ecological validity (e.g., social interactions with confederates) are low on standardization and the methods high in standardization (e.g., social interactions with a person described in a vignette) are low on ecological validity. Using virtual humans in an IVE provides us with the best of both worlds: high ecological validity and high standardization ( Blascovich et al., 2002 ). Thus, IVEs presents a valuable possibility to overcome the issues we discussed above. In addition, using a virtual simulation of an interaction enables researchers to easily replicate the studies, which is important especially for those domains, such as social psychology, in which replication is lacking ( Blascovich et al., 2002 ).

Virtual Humans in IVEs

A virtual human is a computer-generated three-dimensional digital representation that looks and acts like a real human. Blascovich et al. (2002) differentiates between human-avatars (virtual humans controlled by humans) and agent-avatars (virtual humans controlled by computers). In the present article, we use the generic term virtual humans.

The first attempts of using virtual humans as social interaction partners became possible in the 90s. These technologies consisted of a desktop computer in which one or more virtual human interaction partners were displayed and could interact with the participant (e.g., provides information, answer standardized questions). Whereas this method constituted an improvement in terms of standardization, realism was still quite low and, as a consequence, the implications of any findings obtained were limited. This changed at the turn of the new millennium with the advancement of technology and the increased processing power of computers, making it possible to incorporate virtual humans in IVEs.

Immersion in the Virtual

Immersive virtual environment technology means that a person is fully immersed in a virtual world in which he or she can walk and look around as in the real world. The basic setup of IVET is the following: (1) the physical movement (e.g., head turning) of a participant is tracked (e.g., via an infrared camera), (2) the perceptual information of the virtual world is updated according to those movements through computer-based calculations, and (3) the perceptual information (e.g., visual information displayed through head-mounted displays) is sent back to the participant ( Blascovich et al., 2002 ). Even though in principle any kind of sensory feedback can be provided to participants, most of the studies on social interactions focused on visual and auditory information, which is typically sent through the head-mounted display (or projected to the physical walls of a room, as in so-called CAVE systems) and headphones or speakers.

We refer to immersion as the objective amount and quality of the perceptual input provided to participants through technological instruments ( Mantovani and Castelnuovo, 2003 ), such as the 3D visual input. Also, the degree of immersion in the virtual world and in the interaction with virtual humans can be manipulated by providing more or less sensorial information to the participants. As an example, IVET is more immersive than desktop virtual reality because it provides more sensorial inputs. We use presence as it refers to the participants’ subjective feeling of “being there,” interacting with their own body in a virtual world that is perceived as real ( Heeter, 1992 ; Ijsselsteijn et al., 2001 ; Schuemie et al., 2001 ). It can be operationalized as the correspondence of participants’ reactions and emotions between a real and a virtual situation and can be measured in different ways (e.g., physiological responses, behavioral measures, and self-assessment). The literature is quite inconsistent in terms of the different definitions of presence and immersion. Some authors refer to the former as “psychological immersion” ( Palmer, 1995 ) and to the latter as “perceptual immersion” ( Biocca and Delaney, 1995 ). Other authors define immersion as a subjective feeling ( Fox et al., 2009 ), as the degree of “realness” of participants’ behavior (which, as explained above, we rather consider as an operationalization of presence), or use the terms presence and immersion interchangeably. In our view, immersion is a determinant of feeling of presence. In Freeman et al.’s (1999) study, participants watching motion scenes in 3D reported higher feelings of presence when compared to 2D. Kober et al. (2012) found that EEG activity in parietal areas of the brain was correlated with feelings of presence and was higher when participants were involved in a highly immersive virtual reality environment compared to a desktop version of the same task. Even though research has shown that virtual reality can evoke a strong feeling of presence, and especially so in immersive virtual environments, the intensity of those reactions are not as pronounced as in real world situations ( Jacobson, 2001 ). Importantly, feeling of presence in IVEs can be improved by using virtual humans as social interaction partners ( Slater et al., 2006b ). Copresence is an aspect of presence that implies the feeling of being there, in the same virtual space, together with virtual humans. As a consequence, individuals feel that virtual partners are “available” and can either influence or be influenced by them ( Lee, 2004 ). Social presence is a broader concept than copresence as it does not require sharing the same virtual space ( Lee, 2004 ). As we will show in the next sections, the use of virtual humans in IVEs represents a powerful social interaction simulation method.

Realistic Looking Virtual Humans

High ecological validity can also be achieved by using virtual humans that look realistic and behave in a realistic way. Technological advances have improved the graphic quality and the motion animation of virtual humans dramatically over the past decade. The virtual humans available to date are very convincing. Typically, the better the esthetic representation of a human and the closer to a real human the human representation comes, the more acceptable the human representation is to an observer, engendering more natural reactions from the observer ( Blascovich, 2002 ; Slater and Steed, 2002 ). However, at a certain point of similarity, an observer’s reaction can be of revulsion, only to return to something more positive when the virtual human becomes more distinguishable from a real human. This is called the uncanny valley effect ( Mori, 1970 ). With the increased realism in virtual humans we become less likely to accept features that deviate from actual human features. That is, unless the representation is absolutely “perfect,” we will pick up on subtle abnormalities in the representation which makes us respond in an adverse way. Indeed, participants have an unpleasant impression of highly realistic (although not perfect) virtual humans as opposed to more caricature-based avatars ( Seyama and Nagayama, 2007 ). To illustrate, a brisk and unnatural hand movement in a very simplistic virtual human would be less surprising and can be attributed to the crudeness of the simulation of the virtual human. However, if an almost perfect virtual human shows the same gesture; observers are bothered and they try to find out what is wrong with the virtual human, which then reduces its perceived realism and participants’ copresence. Even though there are many anecdotal examples about the uncanny valley , the effect has not been systematically studied in an IVE. Overall, studies using IVET and other methodologies (e.g., videoclips, desktop virtual reality) show that virtual humans are reported as odd or eerie when there is a perceived mismatch between their high-quality “physical” appearance and their behavior, such as their gaze behavior ( Garau et al., 2003 ) or their facial expressions ( Tinwell et al., 2011 ).

Are Virtual Social Interactions Similar to Real Social Interactions?

Despite the relatively high ecological validity of IVET-based social interactions, they still remain virtual. One might therefore wonder whether social interaction behavior shown with virtual humans in IVEs is similar to what people would do in real world interactions. Bailenson et al. (2003) measured the interpersonal distance that participants maintained while approaching a virtual human who engaged them in mutual gaze as compared to a virtual human who did not look at the participants. Results show the same behavioral pattern found in real social interactions ( Argyle and Dean, 1965 ; Patterson et al., 2002 ): when the social interaction partner (the virtual human) looked at the participants, the latter maintained greater interpersonal distance than when the social interaction partner was not looking at them.

In the same vein, Hoyt et al. (2003) used IVET to replicate classic social psychology findings on social inhibition. They trained a group of participants in a specific task and subsequently asked them to perform it either in the presence of virtual humans or alone. In accordance with the classic social inhibition finding ( Buck et al., 1992 ), participants performed worse when in the presence of virtual humans. Relatedly, the presence of a social interaction partner often increases arousal in real social interactions ( Patterson, 1976 ) and the same was true in an IVE. Slater et al. (2006b) found that participants had higher arousal, measured through physiological responses such as heart-rate and galvanic skin response, when they were in a virtual environment with virtual humans present (i.e., a bar) compared to a lone training session in the IVE. Also, the closer the virtual human approached participants, the higher their physiological arousal ( Llobera et al., 2010 ).

Giannopoulos et al. (2010) investigated handshakes by asking participants to take part in a virtual cocktail party. They had to shake virtual humans’ hands by using a haptic device controlled either by an algorithm created to produce realistic movements or by a real human. Results showed that virtual handshakes operated by a robot were rated similarly as handshakes operated by humans. Dyck et al. (2008) used the Facial Action Coding System ( Ekman and Friesen, 1978 ) to artificially create facial expressions of six basic emotions on virtual humans that closely matched those displayed by real actors. Specific facial action units used in natural expressions were implemented in virtual humans. Results showed that virtual facial expressions of emotions displayed by virtual humans were overall recognized as accurately, and for some emotions (i.e., sadness and fear) even more accurately, as natural expressions displayed by real human actors. This study suggests that virtual humans can be reliably used to communicate emotions, although some technical advancement is needed to improve the perceived quality of some specific emotions (e.g., disgust). In the same vein, Qu et al. (2014) asked participants to have a conversation with a virtual woman who displayed either positive or negative facial expressions both while speaking and listening to the participants. Results showed that the emotions (positive or negative) displayed by the virtual woman during the interaction, and especially in the speaking phase, evoked a congruent emotional state in the participants. The same effect was observed in real social interactions ( Hess and Blairy, 2001 ; Hess and Fischer, 2013 ). Santos-Ruiz et al. (2010) adapted the Trier Social Stress Test (TSST; Kirschbaum et al., 1993 ), a task typically used to induce acute social stress, to an IVE. As in the original version of the TSST, participants had to deliver a speech addressing their own good and bad qualities. The virtual human audience changed attitude from interested to restless. Following the speech participants performed an arithmetic task (to continuously subtract 13 starting from a given number) and were informed that after an error they would have to start over. Electrodermal responses and increased salivary cortisol levels in the participants were in line with those found in previous research outside IVEs ( Kelly et al., 2007 ).

The engagement in the virtual situation and the extent to which participants perceive the virtual social interactions as real differ among individuals. Typically, the feeling of presence is measured in participants in order to check whether it affects the results obtained. This could be used to discard participants who were for one reason or another not engaged enough in the virtual world or did not have the feeling of being there, which, based on our decade long experience in virtual reality, has very rarely happened. For correlational research it is, however, important to assure that the findings are not due to the fact that some people felt more presence than others. Research shows that individual differences in feelings of presence typically do not affect the results. For instance, in a scenario in which participants were in the role of a patient ( Schmid Mast et al., 2008 ), they behaved differently when interacting with a dominant vs. a non-dominant physician. Importantly, the degree to which they were engaged in the virtual encounter – their feeling of presence – did not affect the results. In the same vein, Hartanto et al. (2014) used IVET to induce social stress in participants through job interviews with two virtual humans. They reported that differences in presence among participants did not affect feelings of stress.

In summary, there is evidence that subjective feelings, behavioral, and physiological reactions during interactions with virtual humans are very similar to those shown during interactions with real humans. IVET-simulated interactions are therefore a dependable manipulation that can be considered a proxy of real life interactions. In the next section, we discuss some of the main advantages of using virtual humans and IVEs for studying social interactions.

Why Use Virtual Humans in IVEs?

The standardization of the social interaction partner is useful for social psychology studies because all the observed variance among participants can fully be attributed to them, or to a previous manipulation, and is not due to or affected by the social interaction partner’s behavior. Interacting with virtual humans in IVEs has also three other distinct advantages. First, it enables the researcher to manipulate something in the environment or about the virtual social interaction partner and then to observe how this manipulation affects the participant’s interaction behavior and/or interaction outcomes. Second, IVEs provide a means of exposing the participant to social interactions that may well be impossible in real life. Third, virtual humans in IVEs are a relatively low-cost and effective solution to train participants or clinical populations in different tasks.

Manipulation of the Virtual Environment and the Virtual Human

Using a standardized simulation of a social interaction with virtual humans and IVEs provide the opportunity to subtly manipulate something in the virtual environment or the virtual human to test the effect of this change on the social interaction. Creating such controlled conditions are crucial for the discovery of causal relationships among variables and for disentangling the single or joint effects of different aspects of the environment or the social interaction partner on the way a social interaction unfolds. To illustrate, Latu et al. (2013) asked participants to deliver a persuasive speech in front of a group of virtual humans. The experimental manipulation centered around a picture hanging on a wall of the virtual room facing the speaker. Female participants showed improved speech performance when the picture displayed a female role model (i.e., Hillary Clinton, Angela Merkel) compared to a male role model, or when no picture was presented. Importantly, the virtual humans maintained the same non-verbal behavior across all participants, which enabled the researchers to conclude that the obtained effect was based solely on the experimental manipulation.

Moreover, the reaction of the public itself can be manipulated in order to study the effect on participant’s behavior. Pertaub et al. (2002) involved participants in a public speaking situation in which they had to deliver a speech in front of a neutral, a positive, or a bored audience composed of eight virtual humans. Unsurprisingly, they found that the negative/bored audience provoked higher levels of anxiety in participants. Overall, in studies involving a public speaking situation, IVEs are a worthy option not only because of the experimental control they afford but also because recruiting a group of actual humans would be time and cost intensive.

Alternative manipulations to virtual scenarios could involve changes to the virtual humans so as to test whether this manipulation affects the participant’s behavior in a social interaction. The use of virtual humans in IVEs enables us to disentangle variables that, in real life, are often interwoven and to study their respective effect on an outcome variable. For example, female doctors typically have a more caring and empathic communication style when interacting with their patients than male doctors ( Roter et al., 2002 ). If we want to test the effect of women doctors and of a caring and empathic communication style independent of each other, we have to be able to vary them independently. We did so in a study in which we had female and male virtual doctors use either a caring or non-caring combined with either a dominant or non-dominant communication style and measured the participants’ satisfaction with the (virtual) consultation ( Schmid Mast et al., 2008 ). Results showed that female patients were particularly satisfied with female doctors who adopted a gender-congruent, thus caring communication style whereas patient satisfaction for female doctors was unaffected by the dominance dimension. Satisfaction with the male doctors was unaffected by either communication style.

In a social situation, we react to the other person’s verbal and non-verbal behavior and also to the other person’s appearance. The effect of these different pieces of information can also be varied independent of each other when virtual humans are used. The same virtual human can, for instance, provide the same spoken information to all participants but differ in the non-verbal information depending on the condition participants are in. For instance, there could be two versions of the virtual human, one that has an expansive and animated body posture and one that has a constricted and rather immobile posture, while holding the spoken information the virtual human delivers constant. In such a setting, researchers could investigate how body language, specifically, affects the social interaction partner. This manipulation would be extremely difficult to obtain when using trained confederates. Indeed, Bailenson and Yee (2005) used a similar paradigm to study the effect of body posture mimicry of virtual humans on participants’ ratings of verbal information and of the general impression made by the virtual humans. Virtual humans delivered a persuasive speech to participants while either mimicking the participant’s body position with a delay of 4 s or while performing prerecorded body movements. Participants rated mimicking virtual humans more positively and their speeches as more persuasive compared to non-mimicking virtual humans. Likewise, Vinayagamoorthy et al. (2008) found that the body posture position of a virtual human providing information to participants played an important role on the perception of affective states of the virtual human. Participants interacting with virtual humans displaying anger reported that their body posture was the primary source of information to detect their emotional state.

Moreover, while the verbal and non-verbal behavior is kept constant, researchers can manipulate the physical appearance of a virtual human in order to test its influence on participants’ behavior. In Dotsch and Wigboldus (2008) ’s study, Caucasian participants approached virtual humans with either White or Moroccan facial features. Participants maintained a bigger interpersonal distance to Moroccan-like virtual humans and the effect was moderated by their implicit negative associations toward this group.

Impossible Real-World Social Interactions in the Virtual

Another advantage of using IVET to study interactions is that situations and manipulations that would be impossible in real life can be created. Although ecological validity of such experiments are by definition low, they can help to understand how different variables interact with each other and advance our theoretical understanding of human cognition and behavior. To illustrate, participants can be embodied (i.e., own or control a virtual body from a first person perspective) in any virtual human with any specific characteristics and this can have an effect on interaction outcomes. The psychological and behavioral effects due to the embodiment of people in a particular virtual human are known as the Proteus effect ( Yee and Bailenson, 2007 ). Yee and Bailenson (2007) made participants adopt more or less attractive virtual humans and found that participants assigned to attractive virtual humans approached more closely other virtual humans. In a second study, participants performed a negotiation task while embodying taller or shorter virtual humans. Participants assigned to taller avatars behaved in a more confident way during the interaction. The method researchers typically use to provide visual feedback about the physical appearance of the virtual human that participants embody is to locate a virtual mirror in the IVE ( Yee and Bailenson, 2007 ). The virtual mirror reflects the real body movements of the participants while the appearance can be rendered in any form.

Many physical appearance manipulations of the virtual human are possible, including gender, race, age, and body size. Importantly, manipulating people’s appearance changes their cognitions, possibly by associating the self with concepts related to other groups ( Maister et al., 2015 ). In this sense, virtual embodiment could be used as an alternative to priming manipulations. As an example, Peck et al. (2013) showed that embodying white participants into dark-skinned avatars reduced their implicit racial bias. Kilteni et al. (2013) found that participants embodied in a dark-skinned and casual-dressed virtual human improved their drumming skills. Given the rather explicit nature of embodiment, some caution should be used in order to avoid social desirability effects (e.g., participants might respond according to what they think it is expected from them).

Another example of manipulations that would be impossible to test in a real life situation is when extreme or complex social behaviors and cognitions are involved. For instance, Slater et al. (2006a) replicated the well-known study by Milgram (1963) in an IVE in which participants administer electric shocks to interaction partners. The results were comparable to the real world study, namely that participants tend to obey to orders from authority figures to the extent of administering severe electric shocks that could endanger another person’s life.

A collaborative virtual environment (CVE) is yet another example of how real world social scenarios can be incorporated into the virtual. In these settings the actual humans do not need to be in the same physical space but can remotely embody an avatar and interact with peers. This manipulation was used by Bailenson et al. (2005) in a study on augmented gaze in which three participants were present in the scenario. One of the participants read a persuasive message to the other two participants. Importantly, the gaze of the reader was manipulated in order to be perceived by the listeners as either natural or transformed. In the transformed condition, listeners perceived the reader as either looking always or never at them. When readers fixated the listeners, the latters rated their message as more persuasive and showed better recall of it. In Bente et al. (2007) ’s study, dyads of participants were involved in interactions while being embodied in virtual humans. Interaction partners were shown with the real partner’s gaze behavior or with a manipulated gaze, displaying either longer or shorter eye contact. Participants showing manipulated longer direct gaze were evaluated more positively by their interaction partners. The advantages of CVEs are that feeling of presence and copresence are high (i.e., participants are involved in an interaction with a human partner) and that very specific behaviors can be rendered non-realistically (the so-called transformed social interactions) and thus the consequences of these individual manipulations can be investigated.

Training with Virtual Humans in IVEs

Simulation of social interactions is not only important for research purposes but also for training. For instance, virtual humans can either function as tutors and give performance feedback or they can be used as specific social interaction partners necessary for training. For example, the virtual human can be a recruiter asking the participant job interview questions and the participant trains on giving good answers and making a favorable first impression. The great advantage of using virtual humans for training is that they are constantly available and do not need to be trained, scheduled, or paid. Bailenson et al. (2008 , Study 1), for instance, trained participants in Tai Chi movements using a virtual teacher. Participants reported a more enjoyable learning experience when they had the possibility to see themselves performing next to their teacher performing the movements compared to a condition in which they could see only the teacher. This finding indicates that some features of the interaction, such as having the possibility to compare one’s own movements to those of the teacher, play a crucial role in the learning outcome.

Poeschl and Doering (2012) modeled a virtual audience from real audience data that can be used to provide feedback in fear of public speaking training. Batrinca et al. (2013) also developed an audience composed of virtual humans that can provide feedback online to presenters about their performance. The advantage of using virtual humans is especially important for trainings such as learning how to speak in front of large audiences. It is now possible to simply program a large audience populated with virtual humans without having to recruit many people to be stooges as audience ( Harris et al., 2002 ; Pertaub et al., 2002 ; Thalmann, 2006 ). However, there are investment costs of setting up an IVE laboratory and the programming of the virtual humans and environments. The development of portable systems is a promising venue to make virtual reality more accessible to practitioners.

Immersive virtual environment technology-based training has already been used in clinical settings. Park et al. (2011) created an IVET version of the traditional social skills training based on role-playing. Schizophrenic patients assigned to the IVE condition improved their conversational skills and assertiveness more than patients in the traditional role-playing group, however, the latter was more effective in emotion expression skills. Perez-Marcos et al. (2012) proposed an approach of neurorehabilitation for patients with reduced mobility based on virtual interactions with healthcare providers who are not in the same physical space. Patients and healthcare providers communicate remotely through a multisensory IVE and through haptic devices located at both sites that enable them to interact (see, hear, and touch) as in a real consultation. Some of the proposed tasks are cooperative, meaning that the patients and the doctor need to perform an action together and simultaneously in order to achieve a goal (e.g., cooperate to lift a virtual object). This kind of task increases patients’ feelings of copresence. This system enables the doctors to evaluate patients with motor deficits (e.g., through force feedback) or with neuropathic pain in upper limbs. In addition, a person-to-person interaction with a real doctor, even though remote, could increase motivation of patients to pursue rehabilitation programs and could help patients who are often socially isolated because of their reduced mobility to meet other people (e.g., doctors, nurses, or other patients) in a virtual environment.

Communication with Virtual Humans

One of the biggest challenges in using virtual humans as social interaction partners is to achieve natural communication (e.g., free speech conversation) between participants and virtual humans. In most of the studies to date, the communication from the virtual human to the participant needs to be mediated by the experimenter. So the experimenter listens to what the participant says and then decides when and what the virtual human should respond. Moreover, the virtual human can only respond with behaviors or statements that have been programmed beforehand. Thus, virtual humans’ responses might not be precisely adjusted to participants’ utterances or to the tone of the conversation. As a result, the prosody, the syntax, or the word choice might not sound natural, hampering the flow of the communication. Even though research in IVEs on this topic is scarce, researchers studying interactions with confederates tried to address this issue by adapting scripts to real life conversations. Brown-Schmidt (2012) analyzed and coded conversations between two people who had to collaborate to correctly arrange pieces in a visual game. Based on occurring frequency of different types of answers (e.g., acknowledgment, repetitions) obtained through this analysis, confederates were instructed to use specific answer forms in a subsequent experiment. Likewise, in a picture description task, Branigan et al. (2007) instructed confederates to replicate errors (e.g., use of inappropriate verbs) that were made by naïve speakers in a previous similar task. Similar procedures inspired by real life conversations could be used to make conversations between virtual and real humans more smooth. Even though these methods might improve perceived realism of the communication, they do not assure an optimal adaptation to participants’ utterances.

Another possibility to achieve natural communication is to use confederates to embody virtual humans ( Bailenson et al., 2005 ). Confederates can control the body position of the avatar (non-verbal behavior of the avatar could be standardized to some extent) while communicating in a natural way with participants. This solution would improve communication realism but it is not optimal because vocal non-verbal behavior of confederates might change across participants and therefore influence them, the detrimental effects of which have already been highlighted above.

Part of the reasons why achieving a realistic communication with virtual humans is problematic is that participants can potentially address them with any kind of utterance. One possibility is to “script” the conversation and to provide the participant with prompts so that the conversation flows more naturally. As an example, Schmid Mast et al. (2008) investigated participants in the role of patients interacting with virtual doctors in a virtual medical consultation. Participants were briefed about their symptoms and there were 16 turns between the virtual doctor and the patient and for each turn, the patient had a prompt card instructing him/her what information to deliver to the virtual doctor (e.g., talk about your symptoms, for how long you have had them and how much they affect your daily life). This ensured a smooth flow of the conversation but it was unnatural because no spontaneous remarks or questions were allowed. Another approach was tested by Qu et al. (2013 , Study 2). They used a priming procedure to induce participants to use specific keywords when addressing virtual humans. They exposed participants to videos and pictures hanging on a wall in a virtual room, in which a virtual human asked them four questions on different topics. For example, when the topic was France, a picture of the Arc de Triomphe in Paris hung on a wall behind the virtual human in the priming condition, whereas only distractor pictures were displayed in the control condition. Results show that participants named the content of the videos and pictures significantly more often compared to a condition in which their content was not related to the question asked by the avatar. This priming procedure is promising because it could be combined with automatic keywords recognition and therefore enable virtual humans to respond in appropriate ways to human participants. For instance, when a participant is primed to use a specific keyword and he/she indeed says it during a virtual interaction, this keyword is automatically recognized by the system and triggers a specific response or behavior by the virtual human.

Automatic Extraction of Participant Interaction Behavior in IVEs

Participant interaction behavior in IVEs is sometimes the dependent variable because the behavioral observation is the goal. The use of IVET makes it possible to extract some interpersonal behavior data of participants directly from the simulation because the system uses that information to function. Another method to extract participant interaction behavior is to use social sensing technology, which will be outlined below.

Participant Interaction Behavior Extracted from IVET

There are some participant behaviors that can be measured directly by the IVE system that renders the virtual world. Interpersonal distance is a prime example for such automatic extraction of participant interaction behavior in a virtual encounter. This is because the IVE system constantly detects and monitors the location of the participant in order to render the virtual world in real time. Based on the location information of the participant and the virtual human, which is usually pre-defined by the programmer, interpersonal distance can be computed and registered during the entire social interaction. Interpersonal distance is an important social interaction behavior that can be indicative of approach-avoidance behavior or dominance ( Hall et al., 2005 ).

Another variable that can be recorded by IVET is the actual scene that is visualized by the participants, which might be an indicator of attentional strategies. This measure can be recorded by placing either visible or invisible markers in specific locations of the virtual scene. Given that participants can still move their eyes to focus on specific portions of the visual scene even without moving their heads, visualized scene can be a proxy of gaze direction but does not represent a precise measure.

Behavior Extraction Using Additional Equipment

In the previous section we discussed the use of visualized scene as a measure of attentional strategies within an IVE. The use of eye-tracking systems combined with the IVET allows more precise measures of attentional strategies. Wieser et al. (2010) involved a group of high and low socially anxious female participants in an IVE study in which they were approached by a virtual human. They measured participants’ eye movements and found that highly anxious participants avoided eye contact with male virtual humans.

Other measures, requiring additional equipment, include physiological data (e.g., heart rate, skin conductance response). Slater et al. (2006b) used an electrocardiogram to obtain measures of heart rate and recorded galvanic skin response while involving participants in a social interaction with five virtual humans. Results showed that the physiological measures changed significantly (i.e., faster heart rate and more pronounced skin conductance response) when virtual humans were present in the virtual world and when breaks in presence were elicited (i.e., short moments in time when participants’ subjective feeling of presence was interrupted by suddenly making the virtual world and the avatars vanish).

Given that this information about participant behavior is immediately available as the social interaction unfolds, these measures could be analyzed in real time and used to change or adapt subsequent behavior of a virtual human during an interaction. As an example, participant’s eye movements can be recorded and, for instance, the virtual human could then move to the location of the visual focus of the participant (or away from it, depending on the question under investigation). This data can also be complemented with information from social sensing to gather information about participant behavior.

Even though the devices outlined in this section are relatively non-invasive, the question remains whether their use interferes with participants’ feeling of presence. Indeed, one of the requirements for a virtual environment to be immersive is that information coming from the real world is shut out by a technological device (e.g., a head-mounted display) in order to enable individuals to focus on rendered information ( Slater and Wilbur, 1997 ) and feel presence. For instance, knowing that eye movements are recorded or feeling an electrode on the skin could remind participants that the virtual simulation is fictitious and as a consequence feeling of presence might be reduced. Future research might experimentally investigate whether indeed feelings of presence are influenced by the use of the external devices (e.g., eye-tracking, electrodes) we outline in this section. Sensing via ubiquitous computing (where the there is no direct input from the participant to the sensing device; the sensing is unobtrusive) is by definition non-invasive and might play a more important role for IVET in the future. There are still technological advancements needed in order to make such devices (e.g., a heart rate monitor watch) as accurate as more invasive standard recording methods (e.g., electrodes for heart rate measurement). One emerging field that will play an important role for the study of social interactions in IVET is social sensing.

Social Sensing of Participants in IVEs

Social sensing means the recording of interpersonal behavior from people engaged in social interactions via ubiquitous computing (i.e., no active computer input necessary, the environment is “smart” and registers people’s behavior) and computational models and algorithms for the automated extraction of social cues and for drawing social inferences ( Schmid Mast et al., 2015 ). Unobtrusive social sensing devices are cameras, microphones, and Kinect sensors, among others. Behavioral extraction algorithms are available for different verbal and non-verbal behaviors (e.g., nodding, gesturing, speech time, loudness of voice, interruptions). We predict that social sensing will play an important role in the future development of automatizing the communication between the participant and the virtual human and for training purposes. As an example, imagine that the computer can detect the quality of the speech a participant is delivering in front of a large audience via social sensing. If the quality of the speech is bad, the program will put the virtual humans in the audience gradually to sleep. If the quality of the speech improves, the virtual humans in the audience will start to pay more attention and signal interest by following the participants with their eyes and erecting their posture. This is the goal of Cicero ( Batrinca et al., 2013 ), a system that encompasses the automatic extraction of non-verbal behavior of a presenter through a Kinect device and gives a feedback (e.g., nodding, leaning forward) based on the evaluated (computed) performance (e.g., time spent gazing the audience, amount of pause fillers) through a virtual audience. Even though Cicero is not yet developed within IVET – only on a desktop virtual reality system - it is reasonable to assume that a similar system could be implemented in an IVE.

Another example in this direction comes from Zhang and Yap (2012) who studied automatic affect detection based on participants’ verbal (written) and non-verbal behavior during a virtual role-play. Affect detection in verbal information was performed through latent semantic analysis, which is an algorithm that automatically learns semantic information about words through their common use in natural language ( Landauer and Dumais, 1997 ). Emotional gesture recognition was based on a Kinect device, which extracted emotional content based on a skeleton tracking procedure. To illustrate, a participant placing his/her hand on the head was identified as a signal of confusion.

Virtual humans that show a human-like behavior (i.e., agents that are able to produce sentences and respond to interaction partners in natural conversations) are called embodied conversational agents. Some research has stressed the importance of implementing complex behavior on embodied conversational agents, like multimodal (e.g., facial expressions and body gestures) emotional expressions ( Pelachaud, 2009 ). Malatesta et al. (2009) developed a model to implement Scherer’s appraisal theory ( Scherer, 2001 ) for the elicitation of emotions in embodied conversational agents by using different intensities and timings. In the future, it could be possible to implement subtle facial mimicry responses on virtual humans and study their effect on participants’ behavior.

Conclusions and Future Challenges

As we illustrate in the present article, research on social interaction using IVET has established important results that were hard to achieve before its development. The here presented research is different from the one by Fox et al. (2009) in that we focus on social interactions with virtual humans in IVET whereas the Fox et al. (2009) paper is a broader review of the how IVET can and is used in the social sciences. Moreover, we are faced with a very fast developing research domain because of the frequent technical improvements and increased availability of relatively cheap virtual reality devices which makes an update since 2009 timely. In particular, in the last years more effort has been put into integrating IVET with other technologies, such as eye-tracking ( Wieser et al., 2010 ), movement extraction devices ( Zhang and Yap, 2012 ; Batrinca et al., 2013 ), and EEG ( Kober et al., 2012 ). Moreover, recent studies have started to address the issue of making the conversation between participants and virtual humans smoother ( Malatesta et al., 2009 ; Zhang and Yap, 2012 ). In addition, more studies investigated influences on participants’ behavior, physiological responses, and cognitions either by manipulating objects in the virtual world ( Latu et al., 2013 ; Qu et al., 2013 ), avatars’ behavior ( Llobera et al., 2010 ), or participants’ physical appearance in the virtual world ( Peck et al., 2013 ). Last but not least, new applications have been created for clinical use ( Park et al., 2011 ; Perez-Marcos et al., 2012 ) and for training participants, for instance when delivering a speech ( Batrinca et al., 2013 ).

Even though research using IVET in social interactions has achieved important results, we argue that researchers will need to face some challenges in the next years. There is evidence showing that participants’ psychological and physiological reactions in IVEs are similar to those in the real world ( Bailenson et al., 2003 ; Slater et al., 2006b ). However, people may still react somehow differently with virtual humans compared to real humans. To illustrate, while more simple or automatic behavior (e.g., avoiding a virtual human that is invading a participant’s personal space) might be comparable between real life and IVEs, more subtle or complex behavior (e.g., being kind or appreciative to an interaction partner) could differ. Different solutions might be adopted in order to address this issue. One possibility is to improve verbal and non-verbal behavioral realism of virtual humans. As discussed above, motion quality should be adapted and match pictorial quality of virtual humans in order to avoid participant’s perception of eeriness due to the uncanny valley effect ( Garau et al., 2003 ; Tinwell et al., 2011 ). Non-verbal behavior and motion of virtual humans could be rendered more realistically and more subtly by extracting it from real human motion. The latest blockbuster movies using computer-generated imagery (e.g., Avatar or The Lord of the Rings) might be taken as inspiration for this improvement. Computer-science advances are needed in order to implement very subtle non-verbal behavior (e.g., facial mimicry) on virtual humans and to improve the synchronization and the coordination between verbal and non-verbal behavior. For instance, lips movements should be adapted precisely to the phonic pattern of a verbal message.

In the same vein, while some effort has been made to improve communication between participants and virtual humans, it remains an important challenge for future research. Being able to have a free speech on any topic with a virtual human is the ultimate goal of this research area. Automatic language recognition, affect detection, social sensing, and speech production algorithms should be coordinated in order to achieve this goal.

Last but not least, perceived realism of virtual humans could be improved by implementing more high-level human qualities, such as personality traits, emotions, and theory of mind. Research shows that we form first impressions about strangers from verbal, non-verbal, and appearance cues ( Funder and Colvin, 1988 ). Thus, virtual humans’ verbal behavior, non-verbal behavior, and physical aspect could convey distinctive and congruent information about their personality. An example of this would be an extraverted virtual human with an open body posture who talks a lot and wears a casual dress. This would be an interesting feature not only in order to achieve interaction realism, but also because participant’s behavior in relation to different personality traits could be studied with high experimental control. Furthermore, simulating emotions in virtual humans would be important to make participants experience that their behavior or anything happening in the virtual world can have an impact, either positive or negative, on virtual humans. Finally, simulating in the virtual humans the ability to infer the internal states of others (the so-called Theory of Mind) would increase participants’ feeling that virtual humans can “understand” them. Taken together, the proposed features would improve perceived realism of the interaction and participants’ feeling of copresence.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgment

We thank Caroline Falconer for her useful comments and insights on an earlier version of the manuscript.

Argyle, M., and Dean, J. (1965). Eye-contact, distance and affiliation. Sociometry 28, 289–304. doi: 10.2307/2786027

PubMed Abstract | CrossRef Full Text | Google Scholar

Bailenson, J. N., Beall, A. C., Loomis, J. M., Blascovich, J., and Turk, M. (2005). Transformed social interaction, augmented gaze, and social influence in immersive virtual environments. Hum. Commun. Res. 31, 511–537. doi: 10.1111/j.1468-2958.2005.tb00881.x

CrossRef Full Text | Google Scholar

Bailenson, J. N., Blascovich, J., Beall, A. C., and Loomis, J. M. (2003). Interpersonal distance in immersive virtual environments. Pers. Soc. Psychol. Bull. 29, 819–833. doi: 10.1177/0146167203029007002

Bailenson, J. N., Patel, K., Nielsen, A., Bajscy, R., Jung, S.-H., and Kurillo, G. (2008). The effect of interactivity on learning physical actions in virtual reality. Media Psychol. 11, 354–376. doi: 10.1080/15213260802285214

Bailenson, J. N., and Yee, N. (2005). Digital chameleons: automatic assimilation of nonverbal gestures in immersive virtual environments. Psychol. Sci. 16, 814–819. doi: 10.1111/j.1467-9280.2005.01619.x

Batrinca, L., Stratou, G., Shapiro, A., Morency, L.-P., and Scherer, S. (2013). “Cicero - Towards a multimodal virtual audience platform for public speaking training,” in Intelligent Virtual Agents , Vol. 8108, eds R. Aylett, B. Krenn, C. Pelachaud, and H. Shimodaira (Heidelberg: Springer Berlin Heidelberg), 116–128.

Google Scholar

Baumeister, R. F., Vohs, K. D., and Funder, D. C. (2007). Psychology as the science of self-reports and finger movements: whatever happened to actual behavior? Perspect. Psychol. Sci. 2, 396–403. doi: 10.1111/j.1745-6916.2007.00051.x

Bente, G., Eschenburg, F., and Aelker, L. (2007). Effects of simulated gaze on social presence, person perception and personality attribution in avatar-mediated communication. Paper Presented at the PRESENCE 2007 , Barcelona.

Biocca, F., and Delaney, B. (1995). “Immersive virtual reality technology,” in Communication in the Age of Virtual Reality , eds F. Biocca and M. Levy (Hillsdale, NJ: Erlbaum), 57–124.

Blascovich, J. (2002). “Social influence within immersive virtual environments,” in The Social Life of Avatars , ed. R. Schroeder (London: Springer London).

Blascovich, J., Loomis, J., Beall, A. C., Swinth, K. R., Hoyt, C. L., and Bailenson, J. N. (2002). Immersive virtual environment technology as a methodological tool for social psychology. Psychol. Inq. 13, 103–124. doi: 10.1207/s15327965pli1302_01

Branigan, H. P., Pickering, M. J., McLean, J. F., and Cleland, A. A. (2007). Syntactic alignment and participant role in dialogue. Cognition 104, 163–197. doi: 10.1016/j.cognition.2006.05.006

Brown-Schmidt, S. (2012). Beyond common and privileged: gradient representations of common ground in real-time language use. Lang. Cogn. Process. 27, 62–89. doi: 10.1080/01690965.2010.543363

Buck, R., Losow, J. I., Murphy, M. M., and Costanzo, P. (1992). Social facilitation and inhibition of emotional expression and communication. J. Pers. Soc. Psychol . 63, 962–968. doi: 10.1037/0022-3514.63.6.962

Congdon, S. P., and Schober, M. F. (2002). How examiners’ discourse cues affect scores on intelligence test. Paper Presented at the 43th Annual Meeting of the Psychonomic Society , Kansas.

Dotsch, R., and Wigboldus, D. H. J. (2008). Virtual prejudice. J. Exp. Soc. Psychol. 44, 1194–1198. doi: 10.1016/j.jesp.2008.03.003

Dyck, M., Winbeck, M., Leiberg, S., Chen, Y., Gur, R. C., and Mathiak, K. (2008). Recognition profile of emotions in natural and virtual faces. PLoS ONE 3:e3628. doi: 10.1371/journal.pone.0003628

Ekman, P., and Friesen, W. (1978). Facial Action Coding System: A Technique for the Measurement of Facial Movement . Palo Alto: Consulting Psychologists Press.

Fox, J., Arena, D., and Bailenson, J. N. (2009). Virtual reality. J. Media Psychol. 21, 95–113. doi: 10.1027/1864-1105.21.3.95

Freeman, J., Avons, S. E., Pearson, D. E., and Ijsselsteijn, W. A. (1999). Effects of sensory information and prior experience on direct subjective ratings of presence. Presence Teleop. Virt. 8, 1–13. doi: 10.1162/105474699566017

Funder, D. C., and Colvin, C. R. (1988). Friends and strangers: acquaintanceship, agreement, and the accuracy of personality judgment. J. Pers. Soc. Psychol . 55, 149–158. doi: 10.1037/0022-3514.55.1.149

Garau, M., Slater, M., Vinayagamoorthy, V., Brogni, A., Steed, A., and Sasse, M. A. (2003). The impact of avatar realism and eye gaze control on perceived quality of communication in a shared immersive virtual environment. Paper Presented at the Proceedings of the SIGCHI Conference on Human Factors in Computing Systems , Ft. Lauderdale, FL. doi: 10.1145/642611.642703

Giannopoulos, E., Wang, Z., Peer, A., Buss, M., and Slater, M. (2010). Comparison of people’s responses to real and virtual handshakes within a virtual environment. Brain Res. Bull . 85, 276–282. doi: 10.1016/j.brainresbull.2010.11.012

Hall, J. A., Coats, E. J., and LeBeau, L. S. (2005). Nonverbal behavior and the vertical dimension of social relations: a meta-analysis. Psychol. Bull. 131, 898–924. doi: 10.1037/0033-2909.131.6.898

Harris, S. R., Kemmerling, R. L., and North, M. M. (2002). Brief virtual reality therapy for public speaking anxiety. Cyberpsychol. Behav. 5, 543–550. doi: 10.1089/109493102321018187

Hartanto, D., Kampmann, I. L., Morina, N., Emmelkamp, P. G. M., Neerincx, M. A., and Brinkman, W.-P. (2014). Controlling social stress in virtual reality environments. PLoS ONE 9:e92804. doi: 10.1371/journal.pone.0092804

Hatfield, E., Cacioppo, J. T., and Rapson, R. L. (1992). “Primitive emotional contagion,” in Emotion and Social Behavior. Review of Personality and Social Psychology , ed. M. S. Clark (Thousand Oaks, CA: Sage Publications), 151–177.

Heeter, C. (1992). Being there: the subjective experience of presence. Presence Teleop. Virt. 1, 262–271.

Hess, U., and Blairy, S. (2001). Facial mimicry and emotional contagion to dynamic emotional facial expressions and their influence on decoding accuracy. Int. J. Psychophysiol . 40, 129–141. doi: 10.1016/S0167-8760(00)00161-6

Hess, U., and Fischer, A. (2013). Emotional mimicry as social regulation. Pers. Soc. Psychol. Rev. 17, 142–157. doi: 10.1177/1088868312472607

Hoyt, C. L., Blascovich, J., and Swinth, K. R. (2003). Social inhibition in immersive virtual environments. Presence Teleop. Virt. 12, 183–195. doi: 10.1162/105474603321640932

Ijsselsteijn, W. A., Lombard, M., and Freeman, J. (2001). Toward a core bibliography of presence. Cyberpsychol. Behav. 4, 317–321. doi: 10.1089/109493101300117983

Jacobson, D. (2001). Presence revisited: Imagination, competence, and activity in text-based virtual worlds. Cyberpsychol. Behav. 4, 653–673. doi: 10.1089/109493101753376605

Kelly, O., Matheson, K., Martinez, A., Merali, Z., and Anisman, H. (2007). Psychosocial stress evoked by a virtual audience: relation to neuroendocrine activity. Cyberpsychol. Behav. 10, 655–662. doi: 10.1089/cpb.2007.9973

Kilteni, K., Bergstrom, I., and Slater, M. (2013). Drumming in immersive virtual reality: the body shapes the way we play. IEEE Trans. Vis. Comput. Graph. 19, 597–605. doi: 10.1109/TVCG.2013.29

Kirschbaum, C., Pirke, K. M., and Hellhammer, D. H. (1993). The ‘Trier Social Stress Test’–a tool for investigating psychobiological stress responses in a laboratory setting. Neuropsychobiology 28, 76–81. doi: 10.1159/000119004

Kober, S. E., Kurzmann, J., and Neuper, C. (2012). Cortical correlate of spatial presence in 2D and 3D interactive virtual reality: an EEG study. Int. J. Psychophysiol . 83, 365–374. doi: 10.1016/j.ijpsycho.2011.12.003

Kuhlen, A. K., and Brennan, S. E. (2013). Language in dialogue: when confederates might be hazardous to your data. Psychon. Bull. Rev. 20, 54–72. doi: 10.3758/s13423-012-0341-8

Landauer, T. K., and Dumais, S. T. (1997). A solution to Plato’s problem: the latent semantic analysis theory of acquisition, induction, and representation of knowledge. Psychol. Rev. 104, 211–240. doi: 10.1037/0033-295X.104.2.211

Latu, I. M., Schmid Mast, M., Lammers, J., and Bombari, D. (2013). Successful female leaders empower women’s behavior in leadership tasks. J. Exp. Soc. Psychol. 49, 444–448. doi: 10.1016/j.jesp.2013.01.003

Lee, K. M. (2004). Presence, explicated. Commun. Theor. 14, 27–50. doi: 10.1111/j.1468-2885.2004.tb00302.x

Llobera, J., Spanlang, B., Ruffini, G., and Slater, M. (2010). Proxemics with multiple dynamic characters in an immersive virtual environment. ACM Trans. Appl. Perce. 8, 1–12. doi: 10.1145/1857893.1857896

Maister, L., Slater, M., Sanchez-Vives, M. V., and Tsakiris, M. (2015). Changing bodies changes minds: owning another body affects social cognition. Trends Cogn. Sci. 19, 6–12. doi: 10.1016/j.tics.2014.11.001

Malatesta, L., Raouzaiou, A., Karpouzis, K., and Kollias, S. (2009). Towards modeling embodied conversational agent character profiles using appraisal theory predictions in expression synthesis. Appl. Intell. 30, 58–64. doi: 10.1007/s10489-007-0076-9

Mantovani, F., and Castelnuovo, G. (2003). “The sense of presence in virtual training: enhancing skills acquisition and transfer of knowledge through learning experience in virtual environments,” in Being There: Concepts, Effects and Measurement of User Presence in Synthetic Environments , eds G. Riva, F. Davide, and W. A. Ijsselsteijn (Amsterdam: Ios Press).

Mehl, M. R., and Pennebaker, J. W. (2003). The sounds of social life: a psychometric analysis of students’ daily social environments and natural conversations. J. Pers. Soc. Psychol. 84, 857–870. doi: 10.1037/0022-3514.84.4.857

Milgram, S. (1963). Behavioral study of obedience. J. Abnorm. Soc. Psychol. 67, 371–378. doi: 10.1037/h0040525

Mori, M. (1970). Bukimi no tani [The uncanny valley]. Energy 7, 33–35.

Palmer, M. T. (1995). “Interpersonal communication and virtual reality: mediating interpersonal relationship,” in Communication in the Age of Virtual Reality , eds F. Biocca and M. Levy (Hillsdale, NJ: Erlbaum), 277–302.

Park, K. M., Ku, J., Choi, S. H., Jang, H. J., Park, J. Y., Kim, S. I., et al. (2011). A virtual reality application in role-plays of social skills training for schizophrenia: a randomized, controlled trial. Psychiatry Res. 189, 166–172. doi: 10.1016/j.psychres.2011.04.003

Patterson, M. L. (1976). An arousal model of interpersonal intimacy. Psychol. Rev. 83, 235–245. doi: 10.1037/0033-295x.83.3.235

Patterson, M. L., Webb, A., and Schwartz, W. (2002). Passing encounters: patterns of recognition and avoidance in pedestrians. Basic Appl. Soc. Psychol. 24, 57–66. doi: 10.1207/s15324834basp2401_5

Peck, T. C., Seinfeld, S., Aglioti, S. M., and Slater, M. (2013). Putting yourself in the skin of a black avatar reduces implicit racial bias. Conscious. Cogn. 22, 779–787. doi: 10.1016/j.concog.2013.04.016

Pelachaud, C. (2009). Modelling multimodal expression of emotion in a virtual agent. Philos. Trans. R. Soc. Lond. B Biol. Sci. 364, 3539–3548. doi: 10.1098/rstb.2009.0186

Perez-Marcos, D., Solazzi, M., Steptoe, W., Oyekoya, O., Frisoli, A., Weyrich, T., et al. (2012). A fully immersive set-up for remote interaction and neurorehabilitation based on virtual body ownership. Front. Neurol. 3:110. doi: 10.3389/fneur.2012.00110

Pertaub, D.-P., Slater, M., and Barker, C. (2002). An experiment on public speaking anxiety in response to three different types of virtual audience. Presence Teleop. Virt. 11, 68–78. doi: 10.1162/105474602317343668

Poeschl, S., and Doering, N. (2012). Designing virtual audiences for fear of public speaking training - an observation study on realistic nonverbal behavior. Stud. Health Technol. Inform. 181, 218–222.

PubMed Abstract | Google Scholar

Qu, C., Brinkman, W.-P., Ling, Y., Wiggers, P., and Heynderickx, I. (2014). Conversations with a virtual human: synthetic emotions and human responses. Comput. Human Behav. 34, 58–68. doi: 10.1016/j.chb.2014.01.033

Qu, C., Brinkman, W.-P., Wiggers, P., and Heynderickx, I. (2013). The effect of priming pictures and videos on a question–answer dialog scenario in a virtual environment. Presence Teleop. Virt. 22, 91–109. doi: 10.1162/PRES_a_00143

Roter, D. L., Hall, J. A., and Aoki, Y. (2002). Physician gender effects in medical communication: a meta-analytic review. JAMA 288, 756–764. doi: 10.1001/jama.288.6.756

Santos-Ruiz, A., Peralta-Ramirez, M. I., Garcia-Rios, M. C., Muñoz, M. A., Navarrete-Navarrete, N., and Blazquez-Ortiz, A. (2010). Adaptation of the trier social stress test to virtual reality: psycho-physiological and neuroendocrine modulation. J. Cyber. Ther. Rehabil. 3, 405–415.

Scherer, K. R. (2001). “Appraisal considered as a process of multi-level sequential checking,” in Appraisal Processes in Emotion: Theory, Methods, Research , eds K. R. Scherer, A. Schorr, and T. Johnstone (New York and Oxford: Oxford University Press), 92–120.

Schmid Mast, M., Gatica-Perez, D., Frauendorfer, D., Nguyen, L., and Choudhury, T. (2015). Social sensing for psychology: automated interpersonal behavior assessment. Curr. Dir. Psychol. Sci. 24, 154–160. doi: 10.1177/0963721414560811

Schmid Mast, M., Hall, J. A., and Roter, D. L. (2008). Caring and dominance affect participants’ perceptions and behaviors during a virtual medical visit. J. Gen. Intern. Med . 23, 523–527. doi: 10.1007/s11606-008-0512-5

Schuemie, M. J., van der Straaten, P., Krijn, M., and van der Mast, C. A. (2001). Research on presence in virtual reality: a survey. Cyberpsychol. Behav. 4, 183–201. doi: 10.1089/109493101300117884

Seyama, J. I., and Nagayama, R. S. (2007). The uncanny valley: effect of realism on the impression of artificial human faces. Presence Teleop. Virt. 16, 337–351. doi: 10.1162/pres.16.4.337

Slater, M., Antley, A., Davison, A., Swapp, D., Guger, C., Barker, C., et al. (2006a). A virtual reprise of the Stanley Milgram obedience experiments. PLoS ONE 1:e39. doi: 10.1371/journal.pone.0000039

Slater, M., Guger, C., Edlinger, G., Leeb, R., Pfurtscheller, G., Antley, A., et al. (2006b). Analysis of physiological responses to a social situation in an immersive virtual environment. Presence Teleop. Virt. 15, 553–569. doi: 10.1162/pres.15.5.553

Slater, M., and Steed, A. (2002). “Meeting people virtually: experiments in shared virtual environments,” in The Social Life of Avatars , ed. R. Schroeder (London: Springer London), 146–171.

Slater, M., and Wilbur, S. (1997). A framework for immersive virtual environments (FIVE): speculations on the role of presence in virtual environments. Presence Teleop. Virt. 6, 603–616.

Thalmann, D. (2006). Populating virtual environments with crowds. Paper Presented at the Proceedings of the 2006 ACM International Conference on Virtual Reality Continuum and its Applications , Hong Kong. doi: 10.1145/1128923.1128925

Tinwell, A., Grimshaw, M., Nabi, D. A., and Williams, A. (2011). Facial expression of emotion and perception of the Uncanny Valley in virtual characters. Comput. Human Behav. 27, 741–749. doi: 10.1016/j.chb.2010.10.018

Topal, J., Gergely, G., Miklosi, A., Erdohegyi, A., and Csibra, G. (2008). Infants’ perseverative search errors are induced by pragmatic misinterpretation. Science 321, 1831–1834. doi: 10.1126/science.1161437

Vinayagamoorthy, V., Steed, A., and Slater, M. (2008). The impact of a character posture model on the communication of affect in an immersive virtual environment. IEEE Trans. Vis. Comput. Graph. 14, 965–982. doi: 10.1109/tvcg.2008.62

Wieser, M. J., Pauli, P., Grosseibl, M., Molzow, I., and Mühlberger, A. (2010). Virtual social interactions in social anxiety–the impact of sex, gaze, and interpersonal distance. Cyberpsychol. Behav. Soc. Netw. 13, 547–554. doi: 10.1089/cyber.2009.0432

Yee, N., and Bailenson, J. (2007). The proteus effect: the effect of transformed self-representation on behavior. Hum. Commun. Res. 33, 271–290. doi: 10.1111/j.1468-2958.2007.00299.x

Zhang, L., and Yap, B. (2012). Affect detection from text-based virtual improvisation and emotional gesture recognition. Adv. Human Comput. Interact . 2012:12. doi: 10.1155/2012/461247

Keywords : social interaction, immersive virtual environment, virtual humans, avatars, copresence

Citation: Bombari D, Schmid Mast M, Canadas E and Bachmann M (2015) Studying social interactions through immersive virtual environment technology: virtues, pitfalls, and future challenges. Front. Psychol. 6:869. doi: 10.3389/fpsyg.2015.00869

Received: 28 January 2015; Accepted: 12 June 2015; Published: 24 June 2015.

Reviewed by:

Copyright © 2015 Bombari, Schmid Mast, Canadas and Bachmann. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY) . The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Dario Bombari, Department of Organizational Behavior, Faculty of Business and Economics, University of Lausanne, Quartier UNIL-Dorigny, Internef Building, CH-1015 Lausanne, Switzerland, [email protected]

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Social Constructivism—Jerome Bruner

  • First Online: 09 September 2020

Cite this chapter

social interaction theory research paper

  • Miia Rannikmäe 3 ,
  • Jack Holbrook 3 &
  • Regina Soobard 3  

Part of the book series: Springer Texts in Education ((SPTE))

22k Accesses

14 Citations

2 Altmetric

This chapter considers the similarities between Constructivism and Social Constructivism, seen as two learning theories sharing a multitude of underlying assumptions. Major aspects, more specific to Social Constructivism, such as an emphasis on the collaborative nature of learning and the importance of a cultural and social context, are elaborated within the frame of an ‘education through science’ paradigm. Bruner’s ideas are introduced, especially emphasising the role of the teacher and instruction, plus different processes used by learners in undertaking problem-solving and socio-scientific decision-making. The need and constraints for curriculum change, initiated by Bruner, based on the notion that learning is a social process in which students construct new ideas, are discussed from different stakeholder viewpoints, building on students’ current knowledge and experiences. A theoretically justified case study, carried out under the framework of a research project, funded by the European Commission, is introduced. This focuses on a student relevant and motivational science teaching module, using the concept of contextualisation, re-contextualisation and de-contextualisation stages of the learning frame and encompassing a science for all philosophy associated with ‘education through science’. The uniqueness of the case study is the student’s socio-constructivist input into creating a scenario, which becomes the frame for initiating the overall teaching module, created by science educators. Also discussed is how a social constructivist approach covers different levels: learning and teaching. The overall conclusion leads to the recommendation that social constructivism, as an educational theory, needs greater acknowledgement in science education circles.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
  • Available as EPUB and PDF
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Bruner, J. S. (1960). The process of education. Harvard University Press.

Google Scholar  

Bruner, J. S. (1966). The process of education . New York: Vintage.

Bruner, J. S. (1997). A narrative model of self-construction. Annals of the New York Academy of Sciences, 818 (1), 145–161.

Article   Google Scholar  

Bruner, J. S. (2009). The process of education, Revised Edition . Harvard University Press.

Capel, S., Leask, M., & Turner, T. (2000). Learning to teach in the secondary school. A companion to school experience (2nd ed.). Great Britain: TJ International Ltd.

Cooper, P. A. (1993). Paradigm shifts in designed instruction: From behaviorism to cognitivism to constructivism. Educational technology, 33 (5), 12–19.

Estonian Government. (2011). Gümnaasiumi riiklik õppekava (National curriculum for gymnasium). Regulation of the Government of the Republic of Estonia, No. 2 . Estonia: Tallinn.

Gredler, M. E. (1997). Learning and instruction: Theory into practice (3rd ed.). Upper Saddle River, NJ: Prentice-Hall.

Holbrook, J., & Rannikmae, M. (2007). Nature of science education for enhancing scientific literacy. International Journal of Science Education, 29 (11), 1347–1362.

Holbrook, J., & Rannikmäe, M. (2009). The meaning of scientific literacy. International Journal of Environmental & Science Education, 4 (3), 275–288.

Holbrook, J. & Rannikmäe, M. (2010). Contextualisation, de-contextualisation, re-contextualisation—A science teaching approach to enhance meaningful learning for scientific literacy. In: I. Eilks, & B. Ralle (Eds.), Contemporary science education (pp. 69–82). Shaker Verlag.

Jordan, A., Carlile, O., & Stack, A. (2008). Approaches to learning: A guide for teachers . Berkshire: McGraw-Hill, Open University Press.

Kukla, A. (2000). Social constructivism and the philosophy of science . New York: Routledge.

MultiCo. (2015 ). Promoting youth scientific career awareness and its attractiveness through multi-stakeholder cooperation. Retrieved from http://www.multico-project.eu/ .

OECD (2016). PISA 2015 results (Vol. 1): Excellence and equity in education . Paris: OECD Publishing.

Roberts, D. A. (2007). Scientific literacy/science literacy. In S. K. Abell & N. G. Lederman (Eds.), Handbook of research on science education (pp. 729–780). Mahwah: Lawrence Erlbaum Associates.

Ryan, R. M., & Deci, E. L. (2002). An overview of self-determination theory. In E. L. Deci & R. M. Ryan (Eds.), Handbook of self-determination research (pp. 3–33). Rochester, NY: University of Rochester Press.

Schunk, D. (2012). Learning theories: An educational Perspective (6th ed.). Boston, MA: Pearson Education.

Scott, P., Asoko, H., & Leach, J. (2007). Student conceptions and conceptual learning in science. In S. K. Abell & N. G. Lederman (Eds.), Handbook of research on science education (pp. 31–56). United States of America: Lawrence Erlbaum Associates.

Smith, M. K. (2002). Jerome S. Bruner and the process of education. The encyclopedia of informal education. Retrieved from http://infed.org/mobi/jerome-bruner-and-the-process-of-education/ .

Taber, K. S. (2011). Constructivism as educational theory: Contingency in learning, and optimally guided instruction. In J. Hassaskhah (Ed.), educational theory (pp. 39–61). New York: Nova.

Vygotsky, L. S. (1962). Thought and language . Cambridge, MA: MIT Press.

Book   Google Scholar  

Vygotsky, L. S. (1978). Problems of method. Mind in society (M. Cole, Trans.). Cambridge, MA: Harvard University Press.

Wertsch, J. V. (1997). Vygotsky and the formation of the mind . MA: Cambridge Press.

Download references

Author information

Authors and affiliations.

Centre for Science Education, University of Tartu, Tartu, Estonia

Miia Rannikmäe, Jack Holbrook & Regina Soobard

You can also search for this author in PubMed   Google Scholar

Corresponding author

Correspondence to Jack Holbrook .

Editor information

Editors and affiliations.

Science Teachers Association of Nigeria, Abuja, Nigeria

University of Texas at Tyler, Tyler, TX, USA

Teresa J. Kennedy

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this chapter

Rannikmäe, M., Holbrook, J., Soobard, R. (2020). Social Constructivism—Jerome Bruner. In: Akpan, B., Kennedy, T.J. (eds) Science Education in Theory and Practice. Springer Texts in Education. Springer, Cham. https://doi.org/10.1007/978-3-030-43620-9_18

Download citation

DOI : https://doi.org/10.1007/978-3-030-43620-9_18

Published : 09 September 2020

Publisher Name : Springer, Cham

Print ISBN : 978-3-030-43619-3

Online ISBN : 978-3-030-43620-9

eBook Packages : Education Education (R0)

Share this chapter

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Publish with us

Policies and ethics

  • Find a journal
  • Track your research
  • Vygotsky’s Theory of Cognitive Development

Saul Mcleod, PhD

Editor-in-Chief for Simply Psychology

BSc (Hons) Psychology, MRes, PhD, University of Manchester

Saul Mcleod, PhD., is a qualified psychology teacher with over 18 years of experience in further and higher education. He has been published in peer-reviewed journals, including the Journal of Clinical Psychology.

Learn about our Editorial Process

Olivia Guy-Evans, MSc

Associate Editor for Simply Psychology

BSc (Hons) Psychology, MSc Psychology of Education

Olivia Guy-Evans is a writer and associate editor for Simply Psychology. She has previously worked in healthcare and educational sectors.

On This Page:

Sociocultural Theory 

The work of Lev Vygotsky (1934, 1978) has become the foundation of much research and theory in cognitive development over the past several decades, particularly what has become known as sociocultural theory.

Vygotsky’s theory comprises concepts such as culture-specific tools, private speech, and the zone of proximal development.

Vygotsky believed cognitive development is influenced by cultural and social factors. He emphasized the role of social interaction in the development of mental abilities e.g., speech and reasoning in children.

Vygotsky strongly believed that community plays a central role in the process of “making meaning.”

Cognitive development is a socially mediated process in which children acquire cultural values, beliefs, and problem-solving strategies through collaborative dialogues with more knowledgeable members of society.

The more knowledgeable other (MKO) is someone who has a higher level of ability or greater understanding than the learner regarding a particular task, process, or concept.

The MKO can be a teacher, parent, coach, or even a peer who provides guidance and modeling to enable the child to learn skills within their zone of proximal development (the gap between what a child can do independently and what they can achieve with guidance).

The interactions with more knowledgeable others significantly increase not only the quantity of information and the number of skills a child develops, but also affects the development of higher-order mental functions such as formal reasoning. Vygotsky argued that higher mental abilities could only develop through interaction with more advanced others.

According to Vygotsky, adults in society foster children’s cognitive development by engaging them in challenging and meaningful activities. Adults convey to children how their culture interprets and responds to the world.

They show the meaning they attach to objects, events, and experiences. They provide the child with what to think (the knowledge) and how to think (the processes, the tools to think with).

Vygotsky’s theory encourages collaborative and cooperative learning between children and teachers or peers. Scaffolding and reciprocal teaching are effective educational strategies based on Vygotsky’s ideas.

Scaffolding involves the teacher providing support structures to help students master skills just beyond their current level. In reciprocal teaching, teachers and students take turns leading discussions using strategies like summarizing and clarifying. Both scaffolding and reciprocal teaching emphasize the shared construction of knowledge, in line with Vygotsky’s views.

Vygotsky highlighted the importance of language in cognitive development. Inner speech is used for mental reasoning, and external speech is used to converse with others.

These operations occur separately. Indeed, before age two, a child employs words socially; they possess no internal language.

Once thought and language merge, however, the social language is internalized and assists the child with their reasoning. Thus, the social environment is ingrained within the child’s learning.

Effects of Culture

Vygotsky emphasized the role of the social environment in the child’s cognitive development.

Vygotsky claimed that infants are born with the basic abilities for intellectual development called “elementary mental functions” (Piaget focuses on motor reflexes and sensory abilities). These develop throughout the first two years of life due to direct environmental contact.

Elementary mental functions include –

o Attention o Sensation o Perception o Memory

Eventually, through interaction within the sociocultural environment, these are developed into more sophisticated and effective mental processes, which Vygotsky refers to as “higher mental functions.”

Tools of intellectual adaptation

Each culture provides its children with tools of intellectual adaptation that allow them to use basic mental functions more effectively/adaptively.

Tools of intellectual adaptation is Vygotsky’s term for methods of thinking and problem-solving strategies that children internalize through social interactions with the more knowledgeable members of society.

For example, memory in young children is limited by biological factors. However, culture determines the type of memory strategy we develop.

For example, in Western culture, children learn note-taking to aid memory, but in pre-literate societies, other strategies must be developed, such as tying knots in a string to remember, carrying pebbles, or repeating the names of ancestors until large numbers can be repeated.

Vygotsky, therefore, sees cognitive functions, even those carried out alone, as affected by the beliefs, values, and tools of intellectual adaptation of the culture in which a person develops and, therefore, socio-culturally determined.

Therefore, intellectual adaptation tools vary from culture to culture – as in the memory example.

Social Influences on Cognitive Development

Like Piaget, Vygotsky believes that young children are curious and actively involved in their own learning and discovering and developing new understandings/schema .

However, Vygotsky emphasized social contributions to the development process, whereas Piaget emphasized self-initiated discovery.

According to Vygotsky (1978), much important learning by the child occurs through social interaction with a skillful tutor. The tutor may model behaviors and/or provide verbal instructions for the child.

Vygotsky refers to this as cooperative or collaborative dialogue. The child seeks to understand the actions or instructions provided by the tutor (often the parent or teacher) and then internalizes the information, using it to guide or regulate their performance.

Shaffer (1996) gives the example of a young girl given her first jigsaw. Alone, she performs poorly in attempting to solve the puzzle. The father then sits with her and describes or demonstrates some basic strategies, such as finding all the corner/edge pieces, and provides a couple of pieces for the child to put together herself, and offers encouragement when she does so.

As the child becomes more competent, the father allows the child to work more independently. According to Vygotsky, this social interaction involving cooperative or collaborative dialogue promotes cognitive development.

To understand Vygotsky’s theories on cognitive development, one must understand two of the main principles of Vygotsky’s work: the More Knowledgeable Other (MKO) and the Zone of Proximal Development (ZPD).

More Knowledgeable Other

The more knowledgeable other (MKO) is somewhat self-explanatory; it refers to someone who has a better understanding or a higher ability level than the learner, concerning a particular task, process, or concept.

Although the implication is that the MKO is a teacher or an older adult, this is not necessarily the case. Often, a child’s peers or an adult’s children may be the individuals with more knowledge or experience.

For example, who is more likely to know more about the newest teenage music groups, how to win at the most recent PlayStation game, or how to correctly perform the newest dance craze – a child or their parents?

In fact, the MKO need not be a person at all. To support employees in their learning process, some companies are now using electronic performance support systems.

Electronic tutors have also been used in educational settings to facilitate and guide students through learning. The key to MKOs is that they must have (or be programmed with) more knowledge about the topic being learned than the learner does.

Zone of Proximal Development

The concept of the more knowledgeable other relates to the second important principle of Vygotsky’s work, the zone of proximal development .

This important concept relates to the difference between what a child can achieve independently and what a child can achieve with guidance and encouragement from a skilled partner.

Vygotsky consequently focuses much more closely on social interaction as an aid to learning, arguing that, left alone, children will develop – but not to their full potential.

He refers to the gap between actual and potential learning as the zone of proximal development (ZPD) – and argues that it is only through collaboration with adults and other learners that this gap can be bridged.

Vygotsky

The zone of proximal development is the gap between the level of actual development, what the child can do on his own, and the level of potential development, what a child can do with the assistance of more advanced and competent individuals.

Social interaction, therefore, supports the child’s cognitive development in the ZPD, leading to a higher level of reasoning. It is generally believed that social dialogues have two important features.

The first is intersubjectivity, where two individuals who might have different understandings of a task, arrive at a shared understanding by adjusting to the perspective of the other.

The second feature is referred to as scaffolding. Adults may begin with direct instruction, but as children’s mastery of a task increases, so the adult tends to withdraw their own contributions in recognition of the child’s increasing success.

For example, the child could not solve the jigsaw puzzle (in the example above) by itself and would have taken a long time to do so (if at all), but was able to solve it following interaction with the father, and has developed competence at this skill that will be applied to future jigsaws.

ZPD is the zone where instruction is the most beneficial, as it is when the task is just beyond the individual’s capabilities. To learn, we must be presented with tasks just out of our ability range. Challenging tasks promote maximum cognitive growth.

As a result of shared dialogues with more knowledgeable others, who provide hints, instructions, and encouragement, the child can internalize the ‘how to do it’ part of the task as part of their inner or private speech. The child can then use this on later occasions when they tackle a similar task on their own.

Vygotsky (1978) sees the Zone of Proximal Development as the area where the most sensitive instruction or guidance should be given – allowing the child to develop skills they will then use on their own – developing higher mental functions.

Vygotsky also views peer interaction as an effective way of developing skills and strategies.  He suggests that teachers use cooperative learning exercises where less competent children develop with help from more skillful peers – within the zone of proximal development.

Evidence for Vygotsky and the ZPD

Freund (1990) conducted a study in which children had to decide which items of furniture should be placed in particular areas of a doll’s house.

Some children were allowed to play with their mother in a similar situation before they attempted it alone (zone of proximal development) while others were allowed to work on this by themselves (Piaget’s discovery learning).

Freund found that those who had previously worked with their mother (ZPD) showed the greatest improvement compared with their first attempt at the task.

The conclusion is that guided learning within the ZPD led to greater understanding/performance than working alone (discovery learning).

Vygotsky and Language

Vygotsky believed that language develops from social interactions for communication purposes. Vygotsky viewed language as man’s greatest tool for communicating with the outside world.

According to Vygotsky (1962), language plays two critical roles in cognitive development:
  • It is the main means by which adults transmit information to children.
  • Language itself becomes a very powerful tool for intellectual adaptation.
Vygotsky (1987) differentiates between three forms of language:
  • Social speech, which is external communication used to talk to others (typical from the age of two);
  • Private speech (typical from the age of three) which is directed to the self and serves an intellectual function;
  • Private speech goes underground , diminishing in audibility as it takes on a self-regulating function and is transformed into silent inner speech (typical from the age of seven).

For Vygotsky, thought and language are initially separate systems from the beginning of life, merging at around three years of age.

At this point, speech and thought become interdependent: thought becomes verbal, and speech becomes representational.

As children develop mental representation, particularly the skill of language, they start to communicate with themselves in much the same way as they would communicate with others.

When this happens, children’s monologues are internalized to become inner speech. The internalization of language is important as it drives cognitive development.

“Inner speech is not the interiour aspect of external speech – it is a function in itself. It still remains speech, i.e., thought connected with words. But while in external speech thought is embodied in words, in inner speech words dies as they bring forth thought. Inner speech is to a large extent thinking in pure meanings.” (Vygotsky, 1962: p. 149)

Private Speech

Vygotsky (1987) was the first psychologist to document the importance of private speech.

He considered private speech as the transition point between social and inner speech, the moment in development where language and thought unite to constitute verbal thinking.

Thus, in Vygotsky’s view, private speech was the earliest manifestation of inner speech. Indeed, private speech is more similar (in form and function) to inner speech than social speech.

Private speech is “typically defined, in contrast to social speech, as speech addressed to the self (not to others) for the purpose of self-regulation (rather than communication).” (Diaz, 1992, p.62)

Private speech is overt, audible, and observable, often seen in children who talk to themselves while problem-solving.

Conversely, inner speech is covert or hidden because it happens internally. It is the silent, internal dialogue that adults often engage in while thinking or problem-solving.

In contrast to Piaget’s (1959) notion of private speech representing a developmental dead-end, Vygotsky (1934, 1987) viewed private speech as:

“A revolution in development which is triggered when preverbal thought and preintellectual language come together to create fundamentally new forms of mental functioning.” (Fernyhough & Fradley, 2005: p. 1)

In addition to disagreeing on the functional significance of private speech, Vygotsky and Piaget also offered opposing views on the developmental course of private speech and the environmental circumstances in which it occurs most often (Berk & Garvin, 1984).

Piaget

Through private speech, children collaborate with themselves, in the same way a more knowledgeable other (e.g., adults) collaborate with them to achieve a given function.

Vygotsky sees “private speech” as a means for children to plan activities and strategies, aiding their development. Private speech is the use of language for self-regulation of behavior.

Therefore, language accelerates thinking/understanding ( Jerome Bruner also views language in this way). Vygotsky believed that children who engage in large amounts of private speech are more socially competent than children who do not use it extensively.

Vygotsky (1987) notes that private speech does not merely accompany a child’s activity but acts as a tool the developing child uses to facilitate cognitive processes, such as overcoming task obstacles, and enhancing imagination, thinking, and conscious awareness.

Children use private speech most often during intermediate difficulty tasks because they attempt to self-regulate by verbally planning and organizing their thoughts (Winsler et al., 2007).

The frequency and content of private speech correlate with behavior or performance. For example, private speech appears functionally related to cognitive performance: It appears at times of difficulty with a task.

For example, tasks related to executive function (Fernyhough & Fradley, 2005), problem-solving tasks (Behrend et al., 1992), and schoolwork in both language (Berk & Landau, 1993), and mathematics (Ostad & Sorensen, 2007).

Berk (1986) provided empirical support for the notion of private speech. She found that most private speech exhibited by children serves to describe or guide the child’s actions.

Berk also discovered that children engaged in private speech more often when working alone on challenging tasks and when their teacher was not immediately available to help them.

Furthermore, Berk also found that private speech develops similarly in all children regardless of cultural background.

There is also evidence (Behrend et al., 1992) that those children who displayed the characteristic whispering and lip movements associated with private speech when faced with a difficult task were generally more attentive and successful than their ‘quieter’ classmates.

Vygotsky (1987) proposed that private speech is a product of an individual’s social environment. This hypothesis is supported by the fact that there exist high positive correlations between rates of social interaction and private speech in children.

Children raised in cognitively and linguistically stimulating environments (situations more frequently observed in higher socioeconomic status families) start using and internalizing private speech faster than children from less privileged backgrounds.

Indeed, children raised in environments characterized by low verbal and social exchanges exhibit delays in private speech development.

Children’s use of private speech diminishes as they grow older and follows a curvilinear trend. This is due to changes in ontogenetic development whereby children can internalize language (through inner speech) to self-regulate their behavior (Vygotsky, 1987).

For example, research has shown that children’s private speech usually peaks at 3–4 years of age, decreases at 6–7, and gradually fades out to be mostly internalized by age 10 (Diaz, 1992).

Vygotsky proposed that private speech diminishes and disappears with age not because it becomes socialized, as Piaget suggested, but because it goes underground to constitute inner speech or verbal thought” (Frauenglass & Diaz, 1985).

Educational Implications

Vygotsky’s approach to child development is a form of social constructivism , based on the idea that cognitive functions are the products of social interactions.

Social constructivism posits that knowledge is constructed and learning occurs through social interactions within a cultural and historical context.

Vygotsky emphasized the collaborative nature of learning by constructing knowledge through social negotiation. He rejected the assumption made by Piaget that it was possible to separate learning from its social context.

Vygotsky believed everything is learned on two levels. First, through interaction with others, then integrated into the individual’s mental structure.

Every function in the child’s cultural development appears twice: first, on the social level, and later, on the individual level; first, between people (interpsychological) and then inside the child (intrapsychological). This applies equally to voluntary attention, to logical memory, and to the formation of concepts. All the higher functions originate as actual relationships between individuals. (Vygotsky, 1978, p.57)

Teaching styles grounded in constructivism represent a deliberate shift from traditional, didactic, memory-oriented transmission models (Cannella & Reiff, 1994) to a more student-centered approach.

Traditionally, schools have failed to foster environments where students actively participate in their own and their peers’ education. Vygotsky’s theory, however, calls for both the teacher and students to assume non-traditional roles as they engage in collaborative learning.

Rather than having a teacher impose their understanding onto students for future recitation, the teacher should co-create meaning with students in a manner that allows learners to take ownership (Hausfather, 1996).

For instance, a student and teacher might start a task with varying levels of expertise and understanding. As they adapt to each other’s perspective, the teacher must articulate their insights in a way that the student can comprehend, leading the student to a fuller understanding of the task or concept.

The student can then internalize the task’s operational aspect (“how to do it”) into their inner speech or private dialogue. Vygotsky referred to this reciprocal understanding and adjustment process as intersubjectivity.”

Because Vygotsky asserts that cognitive change occurs within the zone of proximal development, instruction would be designed to reach a developmental level just above the student’s current developmental level.

Vygotsky proclaims, “learning which is oriented toward developmental levels that have already been reached is ineffective from the viewpoint of the child’s overall development. It does not aim for a new stage of the developmental process but rather lags behind this process” (Vygotsky, 1978).

Appropriation is necessary for cognitive development within the zone of proximal development. Individuals participating in peer collaboration or guided teacher instruction must share the same focus to access the zone of proximal development.

“Joint attention and shared problem solving is needed to create a process of cognitive, social, and emotional interchange” (Hausfather,1996).

Furthermore, it is essential that the partners be on different developmental levels and the higher-level partner be aware of the lower’s level. If this does not occur or one partner dominates, the interaction is less successful (Driscoll, 1994; Hausfather, 1996).

Vygotsky’s theories also feed into the current interest in collaborative learning, suggesting that group members should have different levels of ability so more advanced peers can help less advanced members operate within their ZPD.

Scaffolding and reciprocal teaching are effective strategies to access the zone of proximal development.

Reciprocal Teaching

A contemporary educational application of Vygotsky’s theory is “reciprocal teaching,” used to improve students” ability to learn from text.

In this method, teachers and students collaborate in learning and practicing four key skills: summarizing, questioning, clarifying, and predicting. The teacher’s role in the process is reduced over time.

Reciprocal teaching allows for the creation of a dialogue between students and teachers. This two-way communication becomes an instructional strategy by encouraging students to go beyond answering questions and engage in the discourse (Driscoll, 1994; Hausfather, 1996).

A study conducted by Brown and Palincsar (1989) demonstrated the Vygotskian approach with reciprocal teaching methods in their successful program to teach reading strategies.

The teacher and students alternated turns leading small group discussions on a reading. After modeling four reading strategies, students began to assume the teaching role.

The results showed significant gains over other instructional strategies (Driscoll, 1994; Hausfather,1996).

Cognitively Guided Instruction is another strategy to implement Vygotsky’s theory. This strategy involves the teacher and students exploring math problems and then sharing their problem-solving strategies in an open dialogue (Hausfather,1996).

Based on Vygotsky’s theory, the physical classroom would provide clustered desks or tables and workspace for peer instruction, collaboration, and small-group instruction. Learning becomes a reciprocal experience for the students and teacher.

Like the environment, the instructional design of the material to be learned would be structured to promote and encourage student interaction and collaboration. Thus the classroom becomes a community of learning.

Scaffolding

Also, Vygotsky’s theory of cognitive development on learners is relevant to instructional concepts such as “scaffolding” and “apprenticeship,” in which a teacher or more advanced peer helps to structure or arrange a task so that a novice can work on it successfully.

A teacher’s role is to identify each individual’s current level of development and provide them with opportunities to cross their ZPD.

A crucial element in this process is the use of what later became known as scaffolding; the way in which the teacher provides students with frameworks and experiences which encourage them to extend their existing schemata and incorporate new skills, competencies, and understandings.

Scaffolding describes the conditions that support the child’s learning, to move from what they already know to new knowledge and abilities.

Scaffolding requires the teacher to allow students to extend their current skills and knowledge.

During scaffolding, the support offered by an adult (or more knowledgeable other) gradually decreases as the child becomes more skilled in the task.

As the adult withdraws their help, the child assumes more of the strategic planning and eventually gains competence to master similar problems without a teacher’s aid or a more knowledgeable peer.

It is important to note that this is more than simply instruction; learning experiences must be presented in such a way as to actively challenge existing mental structures and provide frameworks for learning.

Five ways in which an adult can “scaffold” a child’s learning:

  • Engaging the child’s interest
  • Maintaining the child’s interest in the task e.g., avoiding distraction and providing clear instructions on how to start the task.
  • Keeping the child’s frustration under control e.g., by supportive interactions, adapting instructions according to where the child is struggling.
  • Emphasizing the important features of the task
  • Demonstrating the task: showing the child how to do the task in simple, clear steps.

As the child progresses through the ZPD, the necessary scaffolding level declines from 5 to 1.

The teacher must engage students’ interests, simplify tasks to be manageable, and motivate students to pursue the instructional goal.

In addition, the teacher must look for discrepancies between students” efforts and the solution, control for frustration and risk, and model an idealized version of the act (Hausfather, 1996).

Challenges to Traditional Teaching Methods

Vygotsky’s social development theory challenges traditional teaching methods. Historically, schools have been organized around recitation teaching.

The teacher disseminates knowledge to be memorized by the students, who in turn recite the information to the teacher (Hausfather,1996).

However, the studies described above offer empirical evidence that learning based on the social development theory facilitates cognitive development over other instructional strategies.

The structure of our schools does not reflect the rapid changes our society is experiencing. The introduction and integration of computer technology in society has tremendously increased the opportunities for social interaction.

Therefore, the social context for learning is transforming as well. Whereas collaboration and peer instruction were once only possible in shared physical space, learning relationships can now be formed from distances through cyberspace.

Computer technology is a cultural tool that students can use to meditate and internalize their learning. Recent research suggests changing the learning contexts with technology is a powerful learning activity (Crawford, 1996).

If schools continue to resist structural change, students will be ill-prepared for the world they will live.

Critical Evaluation

Vygotsky’s work has not received the same level of intense scrutiny that Piaget’s has, partly due to the time-consuming process of translating Vygotsky’s work from Russian.

Also, Vygotsky’s sociocultural perspective does not provide as many specific hypotheses to test as Piaget’s theory, making refutation difficult, if not impossible.

Perhaps the main criticism of Vygotsky’s work concerns the assumption that it is relevant to all cultures. Rogoff (1990) dismisses the idea that Vygotsky’s ideas are culturally universal and instead states that scaffolding- heavily dependent on verbal instruction – may not be equally useful in all cultures for all types of learning.

Indeed, in some instances, observation and practice may be more effective ways of learning certain skills.

There is much emphasis on social interaction and culture, but many other aspects of development are neglected, such as the importance of emotional factors, e.g., the joys of success and the disappointments and frustration of failure act as motivation for learning.

Vygotsky overemphasized socio-cultural factors at the expense of biological influences on cognitive development. This theory cannot explain why cross-cultural studies show that the stages of development (except the formal operational stage ) occur in the same order in all cultures suggesting that cognitive development is a product of a biological process of maturation.

Vygotky’s theory has been applied successfully to education. Scaffolding has been shown to be an effective way of teaching (Freund, 1990), and based on this theory, teachers are trained to guide children from what they can do to the next step in their learning through careful scaffolding.

Collaborative work is also used in the classroom, mixing children of different levels of ability to make use of reciprocal / peer teaching.

Vygotsky vs. Piaget

Unlike Piaget’s notion that children’s cognitive development must necessarily precede their learning, Vygotsky argued, “learning is a necessary and universal aspect of the process of developing culturally organized, specifically human psychological function” (1978, p. 90).  In other words, social learning precedes (i.e., come before) development.

Differences betwee Vygotsky and Piaget In Psychology

Vygotsky’s theory differs from that of Piaget in several important ways:

Vygotsky places more emphasis on culture affecting cognitive development.

Unlike Piaget, who emphasized universal cognitive change (i.e., all children would go through the same sequence of cognitive development regardless of their cultural experiences), Vygotsky leads us to expect variable development depending on cultural diversity. 

This contradicts Piaget’s view of universal stages of development (Vygotsky does not refer to stages like Piaget does).

Hence, Vygotsky assumes cognitive development varies across cultures, whereas Piaget states cognitive development is mostly universal across cultures.

Vygotsky places considerably more emphasis on social factors contributing to cognitive development.

  • Vygotsky states the importance of cultural and social context for learning. Cognitive development stems from social interactions from guided learning within the zone of proximal development as children and their partners co-construct knowledge. In contrast, Piaget maintains that cognitive development stems largely from independent explorations in which children construct knowledge.
  • For Vygotsky, the environment in which children grow up will influence how they think and what they think about. The importance of scaffolding and language may differ for all cultures. Rogoff (1990) emphasizes the importance of observation and practice in pre-industrial societies (e.g., learning to use a canoe among Micronesian Islanders).

Vygotsky places more (and different) emphasis on the role of language in cognitive development.

According to Piaget , language depends on thought for its development (i.e., thought comes before language). For Vygotsky, thought and language are initially separate systems from the beginning of life, merging at around three years of age, producing verbal thought (inner speech).

In Piaget’s theory, egocentric (or private) speech gradually disappears as children develop truly social speech, in which they monitor and adapt what they say to others.

Vygotsky disagreed with this view, arguing that as language helps children to think about and control their behavior, it is an important foundation for complex cognitive skills.

As children age, this self-directed speech becomes silent (or private) speech, referring to the inner dialogues we have with ourselves as we plan and carry out activities.

For Vygotsky, cognitive development results from an internalization of language.

According to Vygotsky, adults are an important source of cognitive development.

Adults transmit their culture’s tools of intellectual adaptation that children internalize.

In contrast, Piaget emphasizes the importance of peers, as peer interaction promotes social perspective-taking.

Behrend, D.A., Rosengren, K.S., & Perlmutter, M. (1992). The relation between private speech and parental interactive style. In R.M. Diaz & L.E. Berk (Eds.), Private speech: From social interaction to self-regulation (pp. 85–100) . Hillsdale, NJ: Erlbaum.

Berk, L. E. (1986). Relationship of elementary school children’s private speech to behavioral accompaniment to task, attention, and task performance. Developmental Psychology, 22(5) , 671.

Berk, L. & Garvin, R. (1984). Development of private speech among low-income Appalachian children. Developmental Psychology, 20(2) , 271-286.

Berk, L. E., & Landau, S. (1993). Private speech of learning-disabled and normally achieving children in classroom academic and laboratory contexts. Child Development, 64 , 556–571.

Cannella, G. S., & Reiff, J. C. (1994). Individual constructivist teacher education: Teachers as empowered learners . Teacher education quarterly , 27-38.

Crawford, K. (1996) Vygotskian approaches to human development in the information era. Educational Studies in Mathematics, (31) ,43-62.

Diaz, R. M., & Berk, L. E. (1992). Private speech: From social interaction to self-regulation. Lawrence Erlbaum.

Driscoll, M. P. (1994). Psychology of Learning for Instruction . Needham, Ma: Allyn && Bacon.

Frauenglass, M. & Diaz, R. (1985). Self-regulatory functions of children’s private speech: A critical analysis of recent challenges to Vygotsky’s theory. Developmental Psychology, 21(2) , 357-364.

Fernyhough, C., & Fradley, E. (2005). Private speech on an executive task: Relations with task difficulty and task performance . Cognitive Development, 20 , 103–120.

Freund, L. S. (1990). Maternal regulation of children’s problem-solving behavior and its impact on children’s performance . Child Development, 61 , 113-126.

Hausfather, S. J. (1996). Vygotsky and Schooling: Creating a Social Contest for learning. Action in Teacher Education, (18) ,1-10.

Ostad, S. A., & Sorensen, P. M. (2007). Private speech and strategy-use patterns: Bidirectional comparisons of children with and without mathematical difficulties in a developmental perspective. Journal of Learning Disabilities, 40 , 2–14.

Piaget, J. (1959). The language and thought of the child (Vol. 5) . Psychology Press.

Rogoff, B. (1990).  Apprenticeship in thinking: Cognitive development in social context . Oxford university press.

Saettler, P. (1990). The Evolution of American Educational Technology . Egnlewood, Co: Libraries Unlimited.

Schaffer, R. (1996) . Social development. Oxford: Blackwell.

Vygotsky, L. S. (1962). Thought and language. Cambridge MA: MIT Press.

Vygotsky, L. S. (1978). Mind in society: The development of higher psychological processes . Cambridge, MA: Harvard University Press.

Vygotsky, L. S. (1987). Thinking and speech. In R.W. Rieber & A.S. Carton (Eds.), The collected works of L.S. Vygotsky, Volume 1: Problems of general psychology (pp. 39–285) . New York: Plenum Press. (Original work published 1934.)

Winsler, A., Abar, B., Feder, M. A., Schunn, C. D., & Rubio, D. A. (2007). Private speech and executive functioning among high-functioning children with autistic spectrum disorders. Journal of Autism and Developmental Disorders, 37 , 1617-1635.

Wertsch, J. V., Sohmer, R. (1995). Vygotsky on learning and development. Human Development, (38), 332-37.

Further Reading

What is vygotsky’s theory.

Vygotsky believed that cognitive development was founded on social interaction. According to Vygotsky, much of what children acquire in their understanding of the world is the product of collaboration.

How is Vygotsky’s theory applied in teaching and learning?

Vygotsky’s theory has profound implications for classroom learning. Teachers guide, support, and encourage children, yet also help them to develop problem-solving strategies that can be generalized to other situations.

Children learn best not when they are isolated, but when they interact with others, particularly more knowledgeable others who can provide the guidance and encouragement to master new skills.

What was Vygotsky’s best know concept?

Lev Vygotsky was a seminal Russian psychologist best known for his sociocultural theory. He constructed the idea of a zone of proximal development ,  which are those tasks which are too difficult for a child to solve alone but s/he can accomplish with the help of adults or more skilled peers.

Vygotsky has developed a sociocultural approach to cognitive development. He developed his theories at around the same time as  Jean Piaget  was starting to develop his ideas (1920’s and 30″s), but he died at the age of 38, and so his theories are incomplete – although some of his writings are still being translated from Russian.

Like Piaget, Vygotsky could be described as a  constructivist , in that he was interested in knowledge acquisition as a cumulative event – with new experiences and understandings incorporated into existing cognitive frameworks.

However, while Piaget’s theory is structural (arguing that physiological stages govern development), Vygotsky denies the existence of any guiding framework independent of culture and context.

No single principle (such as Piaget’s equilibration) can account for development. Individual development cannot be understood without reference to the social and cultural context within which it is embedded. Higher mental processes in the individual have their origin in social processes.

What is Vygotsky’s Social Development Theory?

Vygotsky’s Social Development Theory is often referred to as the Sociocultural Theory.

Vygotsky’s Social Development Theory posits that social interaction is fundamental to cognitive development. Vygotsky emphasized the influence of cultural and social contexts on learning, claiming that knowledge is constructed through social collaboration.

His most known concept, the Zone of Proximal Development, refers to the difference between what a learner can do independently and what they can achieve with guidance.

Print Friendly, PDF & Email

COMMENTS

  1. (PDF) Social Interaction

    studies, theory papers, and applied work, many of which have a focus on social interaction. Journal of Intercultural Communication This journal grew out of the work by the Nordic Network for ...

  2. A review of theories and methods in the science of face-to-face social

    Face-to-face social interaction depends on dynamic integration and coordination of verbal and non-verbal information. In this Review, Hadley et al. describe the ways that social interaction ...

  3. (Re)Introducing Vygotsky's Thought: From Historical Overview to

    The first core principle of Vygotsky's approach to the research of psychological functioning is the application of a systems perspective. Vygotsky continuously stressed that it is a mistake to study psychological functions individually, e.g., to study development of memory or development of perception.

  4. Learning from others is good, with others is better: the role of social

    Overall, these papers demonstrate the importance of social interaction in young children's word-learning. The critical role of social interaction for optimal language development is relatively unsurprising, considering how heavily human language relies on the 'social brain' and vice versa [41,55,56].

  5. Face2face: advancing the science of social interaction

    Such systems can provide an interesting test of theories of social interaction, but the paper warns that they must be built carefully and with due consideration of the types of mentalizing required for the interaction, if they are to provide a valid test of our theories. ... 2022 Face-to-face dialogue: theory, research, and applications.

  6. Frontiers

    The Relation component describes the bond between the actor and the partner(s), which can affect or be affected by the perception of the social interaction (Horwitz et al., 1997; Stafford et al., 2011). During a social interaction the behaviors of the interaction partners refer to each other (Reis et al., 1980). These interdependent behaviors ...

  7. The effect of social interaction quantity and quality on depressed mood

    The effect of trait social interaction quantity was small and non-significant (b = −0.07, 95% CI = −0.14 to 0.01); however, state social interaction quantity was negatively associated with depressed mood (b = −0.14, 95% CI = −0.15 to −0.12), an effect that was slightly attenuated on days where individuals perceived higher ...

  8. Symbols, meaning, and action: The past, present, and future of symbolic

    Symbolic interactionism is a theoretical perspective in sociology that addresses the manner in which society is created and maintained through face-to-face, repeated, meaningful interactions among individuals. This article surveys past theory and research in the interactionist tradition.

  9. Frontiers

    The here presented research is different from the one by Fox et al. (2009) in that we focus on social interactions with virtual humans in IVET whereas the Fox et al. (2009) paper is a broader review of the how IVET can and is used in the social sciences. Moreover, we are faced with a very fast developing research domain because of the frequent ...

  10. Applications of Vygotsky's sociocultural approach for teachers

    Abstract. This paper outlines an approach to teachers' professional development (PD) that originates in Vygotsky's sociocultural theory (SCT), arguing that what Vygotsky claimed about students' learning in the school setting is applicable to the teachers and that the developmental theories of Vygotsky resting on the notions of social origin of mental functions, unity of behavior and ...

  11. Social Constructivism—Jerome Bruner

    Jerome Bruner has, arguably, given the latest and most updated influence into widening social constructivism and in highlighting its value in modern societies. This chapter starts with brief introduction of Bruner's work, followed by a comparison of constructivism, Bruner's work and a comparison of constructivism and social constructivism.

  12. The Connection Prescription: Using the Power of Social Interactions and

    Social connection is a pillar of lifestyle medicine. Humans are wired to connect, and this connection affects our health. From psychological theories to recent research, there is significant evidence that social support and feeling connected can help people maintain a healthy body mass index, control blood sugars, improve cancer survival, decrease cardiovascular mortality, decrease depressive ...

  13. Symbolic Interaction and Applied Social Research

    Abstract. In symbolic interaction, a traditional yet unfortunate and unnecessary distinction has been made between basic and applied research. The argument has been made that basic research is intended to generate new knowledge, whereas applied research is intended to apply knowledge to the solution of practical (social and organizational ...

  14. [PDF] Social interdependence: interrelationships among theory, research

    Social interdependence theory is a classic example of the interaction among theory, research, and practice that the way in which goals are structured determines how individuals interact, which creates outcomes. Social interdependence theory is a classic example of the interaction among theory, research, and practice. The premise of the theory is that the way in which goals are structured ...

  15. (PDF) Application of Lev Vygotsky's Sociocultural ...

    p> The current study endeavours to explore the application of the Vygotskian sociocultural approach to students' cognitive development, particularly as related to the employment of experiential ...

  16. PDF A Theory of Social Interactions

    A Theory of Social Interactions Gary S. Becker University of Chicago and National Bureau of Economic Research This essay uses simple tools of economic theory to analyze interactions between the behavior of some persons and different characteristics of other persons. Although these interactions are emphasized in the

  17. Vygotsky's Theory of Cognitive Development

    Lev Vygotsky's theory of child development, known as the sociocultural theory, emphasizes the importance of social interaction and cultural context in learning and cognitive development. Vygotsky proposed the concept of the Zone of Proximal Development (ZPD), which is the gap between what a child can do independently and what they can achieve with guidance from a more knowledgeable person.

  18. Social influence research in consumer behavior: What we learned and

    These theories include; social influence theory, social norms theory, social support theory, social comparison theory, regulatory focus theory, and an elaboration likelihood model. Researchers frequently combine any two of these theories to precisely investigate and better understand the relationship between social influence and consumer behavior.

  19. Social relationships, interactions and learning in early childhood

    Social relationships and interactions are crucial for social, emotional, and cognitive learning processes. Extensive research has demonstrated that warm and supportive interactions and relationships significantly contribute to successful learning in early childhood - both in families and in ECCE institutions such as preschools and daycare centres (Bradley, Citation 2019; Burchinal, Peisner ...

  20. [PDF] A Theory of Social Interactions

    Published in Journal of Political Economy 1 June 1974. Economics, Sociology. This essay incorporates a general treatment of social interactions into the modern theory of consumer demand. Section 1 introduces the topic and explores some of the existing perspectives on social interactions and their importance in the basic structure of wants.