U.S. flag

An official website of the United States government

The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

  • Publications
  • Account settings
  • My Bibliography
  • Collections
  • Citation manager

Save citation to file

Email citation, add to collections.

  • Create a new collection
  • Add to an existing collection

Add to My Bibliography

Your saved search, create a file for external citation management software, your rss feed.

  • Search in PubMed
  • Search in NLM Catalog
  • Add to Search

Social media platforms: a primer for researchers

Affiliations.

  • 1 Department of Internal Medicine No. 2, Danylo Halytsky Lviv National Medical University, Lviv, Ukraine.
  • 2 Departments of Rheumatology and Research and Development, Dudley Group NHS Foundation Trust (Teaching Trust of the University of Birmingham, UK), Russells Hall Hospital, Dudley, West Midlands, UK.
  • PMID: 33976459
  • PMCID: PMC8103414
  • DOI: 10.5114/reum.2021.102707

Social media platforms play an increasingly important role in research, education, and clinical practice. As an inseparable part of open science, these platforms may increase the visibility of research outputs and facilitate scholarly networking. The editors who ethically moderate Twitter, Facebook, and other popular social media accounts for their journals may engage influential authors in the post-publication communication and expand societal implications of their publications. Several social media aggregators track and generate alternative metrics which can be used by researchers for visualizing trending articles in their fields. More and more publishers showcase their achievements by displaying such metrics along with traditional citations. The Scopus database also tracks both metrics to offer a comprehensive coverage of the indexed articles' impact. Understanding the advantages and limitations of various social media channels is essential for actively contributing to the post-publication communication, particularly in research-intensive fields such as rheumatology.

Keywords: periodicals as topic; publication ethics; rheumatology; social media.

Copyright: © 2021 Narodowy Instytut Geriatrii, Reumatologii i Rehabilitacji w Warszawie.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Similar articles

  • Presence of social media mentions for vascular surgery publications is associated with an increased number of literature citations. Chau M, Ramedani S, King T, Aziz F. Chau M, et al. J Vasc Surg. 2021 Mar;73(3):1096-1103. doi: 10.1016/j.jvs.2020.09.029. Epub 2020 Oct 17. J Vasc Surg. 2021. PMID: 33080327 Review.
  • Social media for research, education and practice in rheumatology. Zimba O, Radchenko O, Strilchuk L. Zimba O, et al. Rheumatol Int. 2020 Feb;40(2):183-190. doi: 10.1007/s00296-019-04493-4. Epub 2019 Dec 20. Rheumatol Int. 2020. PMID: 31863133 Review.
  • Social Media as a Means to Disseminate and Advocate Cardiovascular Research: Why, How, and Best Practices. Lee G, Choi AD, Michos ED. Lee G, et al. Curr Cardiol Rev. 2021;17(2):122-128. doi: 10.2174/1573403X15666191113151325. Curr Cardiol Rev. 2021. PMID: 31729303 Free PMC article. Review.
  • Measuring the social impact of nursing research: An insight into altmetrics. Dardas LA, Woodward A, Scott J, Xu H, Sawair FA. Dardas LA, et al. J Adv Nurs. 2019 Jul;75(7):1394-1405. doi: 10.1111/jan.13921. Epub 2019 Jan 24. J Adv Nurs. 2019. PMID: 30507052 Review.
  • Alternative Metrics ("Altmetrics") for Assessing Article Impact in Popular General Radiology Journals. Rosenkrantz AB, Ayoola A, Singh K, Duszak R Jr. Rosenkrantz AB, et al. Acad Radiol. 2017 Jul;24(7):891-897. doi: 10.1016/j.acra.2016.11.019. Epub 2017 Feb 27. Acad Radiol. 2017. PMID: 28256440
  • Ethics for Disseminating Health-Related Information on YouTube. Zimba O, Gasparyan AY, Qumar AB. Zimba O, et al. J Korean Med Sci. 2024 Feb 26;39(7):e93. doi: 10.3346/jkms.2024.39.e93. J Korean Med Sci. 2024. PMID: 38412615 Free PMC article. No abstract available.
  • The Ebb and Flow of Social Media for Researchers. Saygılı S, Yıldız M. Saygılı S, et al. Turk Arch Pediatr. 2023 Sep;58(5):456-457. doi: 10.5152/TurkArchPediatr.2023.232208. Turk Arch Pediatr. 2023. PMID: 37670548 Free PMC article. No abstract available.
  • Bibliometric analysis of global literature productivity in systemic lupus erythematosus from 2013 to 2022. Xie X, Yu H, He Y, Li M, Yin F, Zhang X, Yang Q, Wei G, Chen H, He C, He Y, Chen J. Xie X, et al. Clin Rheumatol. 2024 Jan;43(1):175-187. doi: 10.1007/s10067-023-06728-z. Epub 2023 Sep 5. Clin Rheumatol. 2024. PMID: 37668951
  • Evaluation of YouTube Videos on Hepatocellular Carcinoma. Kaplan K, Solak Y. Kaplan K, et al. J Korean Med Sci. 2023 Feb 20;38(7):e50. doi: 10.3346/jkms.2023.38.e50. J Korean Med Sci. 2023. PMID: 36808545 Free PMC article.
  • Bibliometric and altmetric analysis of research relating to antiphospholipid syndrome based on VOS viewer (2011-2021). He Y, Li M, Yu H, Yin F, Zhang X, Yang Q, Xie X, Wei G, Chen H, He C, He Y, Chen J. He Y, et al. Clin Rheumatol. 2023 May;42(5):1285-1295. doi: 10.1007/s10067-022-06485-5. Epub 2023 Jan 5. Clin Rheumatol. 2023. PMID: 36604357
  • Syrkiewicz-Świtała M, Romaniuk P, Strzelecka A, et al. Prospects for the Use of Social Media Marketing Instruments in Health Promotion by Polish Marshal Offices. Front Public Health. 2018;6:65. doi: 10.3389/fpubh.2018.00065. - DOI - PMC - PubMed
  • Zeng R, Li M. Social Media Use for Health Communication by the CDC in Mainland China: National Survey Study 2009–2020. J Med Internet Res. 2020;22:e19470. doi: 10.2196/19470. - DOI - PMC - PubMed
  • Lien CH, Cao Y. Examining WeChat users’ motivations, trust, attitudes, and positive word-of-mouth: Evidence from China. Comput Hum Behav. 2014;41:104–111. doi: 10.1016/j.chb.2014.08.013. - DOI
  • Wang J, Zhou Y, Zhang W, et al. Concerns Expressed by Chinese Social Media Users During the COVID-19 Pandemic: Content Analysis of Sina Weibo Microblogging Data. J Med Internet Res. 2020;22:e22152. doi: 10.2196/22152. - DOI - PMC - PubMed
  • Gupta L, Gasparyan AY, Zimba O, Misra DP. Scholarly publishing and journal targeting in the time of the Coronavirus Disease 2019 (COVID-19) pandemic: a cross-sectional survey of rheumatologists and other specialists. Rheumatol Int. 2020;40:2023–2030. doi: 10.1007/s00296-020-04718-x. - DOI - PMC - PubMed

LinkOut - more resources

Full text sources.

  • Europe PubMed Central
  • PubMed Central
  • Termedia Publishing House Ltd.

Other Literature Sources

  • scite Smart Citations

Research Materials

  • NCI CPTC Antibody Characterization Program

full text provider logo

  • Citation Manager

NCBI Literature Resources

MeSH PMC Bookshelf Disclaimer

The PubMed wordmark and PubMed logo are registered trademarks of the U.S. Department of Health and Human Services (HHS). Unauthorized use of these marks is strictly prohibited.

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here .

Loading metrics

Open Access

Peer-reviewed

Research Article

Social media usage to share information in communication journals: An analysis of social media activity and article citations

Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

* E-mail: [email protected]

Affiliation Department of Radio Television and Cinema, Selcuk University, Konya, Turkey

ORCID logo

  • Yasemin Özkent

PLOS

  • Published: February 9, 2022
  • https://doi.org/10.1371/journal.pone.0263725
  • Reader Comments

Table 1

Social media has surrounded every area of life, and social media platforms have become indispensable for today’s communication. Many journals use social media actively to promote and disseminate new articles. Its use to share the articles contributes many benefits, such as reaching more people and spreading information faster. However, there is no consensus in the studies that to evaluate between tweeted and non-tweeted papers regarding their citation numbers. Therefore, it was aimed to show the effect of social media on the citations of articles in the top ten communication-based journals. For this purpose, this work evaluated original articles published in the top 10 communication journals in 2018. The top 10 communication-based journals were chosen based on SCImago Journal & Country Rank (cited in 2019). Afterward, it was recorded the traditional citation numbers (Google Scholar and Thompson-Reuters Web of Science) and social media exposure of the articles in January 2021 (nearly three years after the articles’ publication date). It was assumed that this period would allow the impact of the published articles (the citations and Twitter mentions) to be fully observed. Based on this assessment, a positive correlation between exposure to social media and article citations was observed in this study.

Citation: Özkent Y (2022) Social media usage to share information in communication journals: An analysis of social media activity and article citations. PLoS ONE 17(2): e0263725. https://doi.org/10.1371/journal.pone.0263725

Editor: Marcelo Hermes-Lima, Universidade de Brasilia, BRAZIL

Received: May 3, 2021; Accepted: January 25, 2022; Published: February 9, 2022

Copyright: © 2022 Yasemin Özkent. This is an open access article distributed under the terms of the Creative Commons Attribution License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: The data underlying the results presented in the study are available from: https://github.com/yaseminozkent/minimal-data-set.git .

Funding: The authors received no specific funding for this work.

Competing interests: The authors have declared that no competing interests exist.

Introduction

The social network has become a tool for bringing people together, allowing individuals to list the users they are connected to, and to see other users’ connections [ 1 ]. Social media platforms (blogs, social networking sites, microblogging, etc.) contain all Web 2.0-based services. Social media has surrounded every area of life, and social media platforms are indispensable for today’s communication [ 2 ]. Scientists from various fields frequently use SoMe, especially Twitter, in most of their professional activities [ 3 ].

Scientists often use social media platforms to produce and debate ideas, share real-time information, spread their research, and find collaborators [ 2 , 4 ]. The way information is collected, disseminated and consumed has been significantly changed because social media by it is encompassing and easily accessible. There has been a significant increase in the number of studies related to social media with an increase in the use of the Internet [ 5 ]. The increased use of social media has also significantly affected how research is spread. Circulated articles through social media are more visible than not circulated articles [ 6 ]. Various scientific studies have examined this relationship, and most have found a positive correlation between article citations and Twitter exposure [ 7 – 9 ].

Studies focusing on new media technologies in connection with the digital age since the 2000s have taken an important place in communication studies [ 10 ]. In their extensive studies on research topics in communication journals, Elisabeth Günther and Emese Domahidi (2017) observed that the Internet and social media have become the most important focus for communication research, in parallel with classical media, such as TV or newspaper [ 10 ]. Social media is an important field of study and practice in both interdisciplinary and communication fields [ 11 – 14 ]. Social media research is encouraged in the field of communication, as people today present themselves through digitally networked platforms. Therefore, this study aimed to demonstrate the effect of social media on citation numbers of articles in communication-based journals. The relationship between the traditional citation numbers of articles and social media posts was analyzed in present study. Thus, it was aimed to shed light on the relationship between social media usage and the number of article citations in the field of communication.

Literature review

The emergence of social media took place simultaneously with Web 2.0. With the introduction of Web 2.0 into our lives, the Internet has become individualized, and use of social networks increased gradually. The Internet has become an interactive virtual world from a read-only state, and it has brought a different dimension to communication [ 12 ]. Today, social media is a wide network of interactions where people from many areas [ 15 ]. In particular, Internet has become a part of life due to the widespread use of smartphones. The majority of people actively use social media in daily life [ 16 ]. A recent study has stated that 70% of peoples in the USA have at least one social media account nowadays. The peoples over 65 years old of 62% have a social media account and they are regularly on social media. This observation is also similar for adolescents. The using of social networks has been reported as 77% for teenagers aged 13–16 years in 25 European countries [ 16 , 17 ]. These social network users interact for an average of more than 2.4 hours a day on social media [ 18 ]. This increased instant interaction has further increased the use of social media. Beside, social media platforms allows independent sharing, regardless of age, venue, and gender. Thus, information spreads rapidly across a wide area [ 15 , 18 ]. Additionally, social media platforms ensure simple interaction pathways between people, companies, and scientists without leaving the desk. Therefore, many scientists use social media in their personal or professional lives [ 19 ].

The most commonly used platform for the purpose of spreading science is Twitter [ 20 ]. Twitter is the most popular microblogging platform nowadays. This platform allows the publication of short messages by its users and enables them to communicate with each other. Evan Williams, Biz Stone, and Jack Dorsey created Twitter in March 2006 and brought into use in July 2006. Twitter has become one of the 10 most visited websites in 2013 and was defined as “the SMS of the Internet” [ 21 , 22 ]. In 2019, it was reported that there were 330 million monthly and 145 million daily active Twitter users. Nowadays, it was reached 339 million users in 2020 [ 23 ]. The number of Twitter users has been increasing daily. Today, Twitter ranks as the world’s second most widely used social network. Twitter users can follow a conversation and discuss a topic using messages named “tweets.” Tweets are constrained to 140 characters of text. Later, this limit was increased to 280 characters. Twitter allows the sharing of photos or videos [ 24 ]. Twitter was initially used to share news about the lives of celebrities. Afterward, it reached a broader audience quickly, especially with the participation of famous names and the involvement of political campaigns. On average, approximately 98 thousand tweets are sent every minute on the platform, allowing an excellent interaction. Given the rise in popularity of Twitter, its use is increasing in all parts of society [ 25 ]. This widespread use has also caught the attention of the scientific community. The use of Twitter as a tool for the dissemination of academic articles has soon become the focus of attention in the scientific world.

Academic output has increased gradually worldwide. Therefore, eliminating the relevant from the irrelevant has become essential for the scientific world. Thus, an impact scale has been necessitated the published articles [ 26 ]. The primary impact of the published article has been measured by its citations [ 26 ]. However, article citations have recently become questionable due to negative factors, such as the slow process of identifying truly impactful articles. The long wait time required the emergence of the articles’ importance has led to need for an alternative metric scale [ 27 ].

Today, non-traditional metrics, altmetrics, are increasingly used to measure the real-time reach and influence of a scientific article [ 26 , 28 ]. The term altmetrics was first proposed by Jason Priem in 2010 [ 29 ]. Thereafter, it has gained wide use in highlighting previously unknown and unrecognized scholarly impact metrics of studies [ 26 , 30 ]. The alternative metric scores play a role in complementing traditional metrics or indicators [ 31 ]. Many publishers, such as SAGE, Taylor-Francis Group, Elsevier, Nature Publishing Group, and Public Library of Science provide much information to their readers by their altmetrics evaluating system [ 32 ]. These altmetrics are calculated by various methods, including “Altmetric” and “Plum Analytics” [ 33 ]. All altmetrics that are an alternative to the traditional citation system provide a score for the research output.

These scores have become attractive for researchers [ 34 ]. Many scientists believe that these alternative metric scores show the real impact of published articles [ 35 – 37 ]. Many web-based platforms play a significant role in obtaining an altmetric score [ 11 , 31 ]. With the data obtained from these platforms, a digital score was acquired for academic output [ 11 ]. All altmetrics are based on the using social media and other online tools for disseminating scholarly information [ 38 ]. The use of social media platforms contributes significantly the spread of shared information in a wider environment. Thus, the sharing of academic output on social media accounts can reach more people faster by eliminating the waiting period in the traditional citation system [ 39 , 40 ]. They have also allowed the impact of articles to be more immediately determined, contrary to traditional citation metrics [ 41 ].

However, some concerns remain that the altmetric score can be manipulated [ 42 , 43 ]. In particular, the use of automatic bot can affect the altmetric scores of articles [ 44 ]. Further, Twitter accounts can affect the results of shared articles. These account holders may be social workers, companies, or politicians, and would have more followers than others [ 45 , 46 ]. Thus, some researchers suggest the use of “alt‑index” to measure the social visibility of scientific research [ 6 ]. Similarly, Haustein et al. argued that social media metrics could not actually be regarded as alternatives to traditional citations; hence, they proposed these metrics as promoters of traditional citations [ 11 ]. The authors suggested the use of “Twitter Coupling” to deal with these concerns [ 38 ]. Although this may be a solution there is a consensus in many studies that social media usage will increase the impact of academic papers, thus ignoring these concerns [ 47 , 48 ].

The potential of social media platforms to connect with other fields raises various scientific questions [ 49 ]. Therefore, the papers in the social sciences and humanities are more often found on social media platforms [ 11 ]. Today, social interaction is so intertwined with media that it is not possible to separate social media from the media sector. Thus, most studies related to social media have been published in the communication sciences. This increased usage of these social networks has led to the research question, “How are altmetrics and citation measures related in communication journals?” Tonia et al. (2016) stated that there were no statistical differences between tweeted and non-tweeted papers regarding their citation numbers [ 50 ]. Costas et al. (2015) found only weak correlations in citations suggested by altmetrics and traditional citation analysis [ 35 ]. Further, some articles can be more attractive than other on social media platforms. Hence, some researchers have argued that there is a difference between social impact and real impact [ 11 ]. However, Thelwall et al. (2013) found that altmetrics was associated with citation counts [ 51 ]. Another study stated that there is a positive relationship between social media posts and academic citations [ 52 ]. Similarly, Shuai et al. (2012) detected significant correlations between tweets and early citations on 4,606 pre-prints articles [ 8 ]. Nevertheless, there is no consensus in the studies that to evaluate between tweeted and non-tweeted papers regarding their citation numbers in communication science, which is the scientific field mostly associated with social media and Twitter. The studies related to social media are encouraged in the field of communication as people present themselves through digitally networked platforms today. Therefore, it was hypothesized that there might be a correlation between Twitter posts and traditional citations the articles in the top ten communication-based journals.

Materials and methods

Study design.

The present study was designed as a retrospective cross-sectional study. The aim was to examine the effect of Twitter and other social media platforms on academic citations. Therefore, the top ten communication-based journals [ 9 ] were evaluated based on the SCImago Journal & Country Rank (cited in 2019). It was used the SCImago Journal & Country Rank search field to select journals and filtered only “communication” journals. The selection criteria were the top ten communication-based journals according to their SCImago Journal Rank indicator. This indicator is a measure of journals’ academic impact and accounts for both the number of citations received by a journal and the importance or prestige of the journals in which the citations [ 53 ]. The impact factor of these journals was ≥ 2, and the quartile (Q) index was Q1. All ten journals had similar indexes and similar impact factors. It was assumed that this would reduce the potential for unwanted variation differences in social media activity.

Impact Factor: The impact factor has been defined as an indicator of academic journal. It reflects the year’s average number of citations per paper published during the preceding two years. It is often used as a relative indicator of a journal’s importance in its field. The journals with high impact factors are thought to be more important than those with low impact factors [ 54 ].

Q index: The journals’ rank in each specific category is separated into quartiles by the Journal Citation Report and SCImago Journal and Country Rank: Q1, Q2, Q3 and Q4. Q1 comprises the top 25% of journals in the list; Q2, Q3, and Q4 comprise 25% to 50%, 50% to 75%, and 75% to 100% of journals in the list, respectively ( https://www.mondragon.edu/en/web/biblioteka/publications-impact-indexes ).

These Journals “Political Communication (Q1, IF: 4.339)”, “Journal of Advertising (Q1, IF: 6.302)”, “Journal of Communication (Q1, IF: 4.846)”, Big Data and Society (Q1, IF: 4.577), Applied Linguistics (Q1, IF: 4.286), Communication Methods and Measures (Q1, IF: 5.281), New Media and Society (Q1, IF: 4.577), Human Communication Research (Q1, IF: 3.540), Public Opinion Quarterly (Q1, IF: 2.494), and Digital Journalism (Q1, IF: 4.476) were included in this present study ( Table 1 ).

thumbnail

  • PPT PowerPoint slide
  • PNG larger image
  • TIFF original image

https://doi.org/10.1371/journal.pone.0263725.t001

Data collection

All issues of these journals in 2018 were reviewed through the journals’ web pages. All published articles in this year were evaluated on the web pages’ archive. The date range was considered based on published issues in the journals. Only original articles (meta-analyses, systematic reviews, original research articles, and research notes) in 2018 were included in this study. Articles such as editorials, book review articles, case reports, letters to the editor, and other non-research correspondences were excluded.

The findings of the articles (title, doi number, article type) were recorded. In January 2021, the title of the article or doi number was searched one by one in Google Scholar (GS) and Web of Science (WoS) Clarivate. The traditional citation numbers (GS and WoS) of these articles were recorded. The tweet number and social media posts of these articles were searched by their metric evaluating system ( https://www.altmetric.com/ ) and recorded. The data were appraised nearly three years after the articles’ publication date. It was thought that this period would allow the impact of published articles (the citations and Twitter mentions) to be fully observed.

Seven hundred and eleven articles were published in the top ten communication-based journals in 2018. After the exclusion criteria were applied, 572 articles were included for analysis. A total of 570 articles (99.7%) were cited at least once on GS, and 518 articles (90.6%) were cited at least once on WoS. The total cumulative number of citations for all the articles was 21,242 for GS and 5,874 for WoS. The number value of citations ranged from 0 to 868 on GS, and 0–235 on WoS. The median number of citations was 19 citations (interquartile range (IQR): 0–868 citations) for GS, and five citations (IQR: 0–235 citations) for WoS ( Table 2 ).

thumbnail

https://doi.org/10.1371/journal.pone.0263725.t002

In all, 522 articles (91.3%) were posted at least once on Twitter or other platforms. The total number of mentions on all social media platforms was 50,624 items. Overall, the most-used social media platform was Twitter. The majority of articles (n: 500; 87.4%) were mentioned at least once on Twitter, and these articles had cumulatively tweets 13,438 tweet. The median value of Twitter posts was nine tweets (IQR: 0–502 tweet).

The median citation value of articles on GS was 21 citations (IQR: 0–868) for the articles that had been tweeted at least one. However, it was nine (IQR: 0–72) citations for non-tweeted articles. Further, the median WoS citation number was five (IQR: 0–235) for tweeted articles and two (IQR: 0–19) for non-tweeted articles.

The tweeted articles were cited more often than those with no tweets on both platforms (for GS: Mann-Whitney U: 10107, Z: -6.022, p< 0.001; for WoS: Mann-Whitney U: 10547, Z: -5.699, p< 0.001, respectively). This observation was also similar for the other platforms (for GS: Mann-Whitney U: 6493, Z: -5.875, p< 0.001; for WoS: Mann-Whitney U: 6735.5, Z: -5.671, p< 0.001, respectively) ( Table 3 ).

thumbnail

https://doi.org/10.1371/journal.pone.0263725.t003

It was observed that a significant correlation between the number of Twitter posts and the number of citations in GS (r = 0.44, p<0.001) and in WoS (r = 0.50, P<0.001). Similarly, there was also a positive correlation between the number of mentions on all platforms and the number of citations in GS (r = 0.83, p<0.001) and WoS (r = 0.71, p<0.001) ( Fig 1 ).

thumbnail

A. The relationship between metric value of articles and Google scholar citations. B. The relationship between metric value of articles and Web of Science citations. C. The relationship between Twitter posts of articles and Web of Science citations. D. The relationship between Twitter posts of articles and Google Scholar citations.

https://doi.org/10.1371/journal.pone.0263725.g001

Social networks in academia are rapidly improving, and significantly increasing use by the scientific community [ 55 ]. Many journals frequently use these tools for advertising and sharing information [ 56 ]. Social media platforms have brought another dimension to access the information. With developments in social sharing platforms, there has been a transition to the digital age of accessing information [ 57 ]. Thus, these platforms have presented new opportunities for researchers to extend their publications the scientific society [ 36 , 58 ]. Similarly, this present study found that the articles exposed to social media were cited more than the articles not posted on social media. This study showed that the visibility of articles might be increased by sharing them on social media, which allows the real effects of the articles to emerge more rapidly.

Generally, the use of all networks has a similar effect, but the most used platform for this purpose is Twitter. Twitter allows for the rapid sharing of information within seconds of posting a tweet. Thus, the dissemination rate of tweets increases exponentially [ 59 ]. As in other studies, it was found that more use of Twitter than other platforms in this study. Moreover, a positive correlation was observed between the altmetric score, Twitter posts, and citation rate of articles. The articles that were tweeted at least once were cited more than those with no posts on both platforms (for GS, p <0.001; and for WoS, p <0.001). The findings support the conclusion that Twitter activity may reflect the quality of articles or increase their citations. Thus, the measure of social platforms based on tweets should be used to complement the traditional metrics of article citations.

Especially the science of communication is the social sciences field most related to social networks. The traditional publishing continues in the communication sciences, but its adoption of social media-related studies is increasing daily [ 10 , 14 ]. Therefore, sharing and disseminating the articles in communication journals may significantly increase their citation rates. This present study revealed this relationship and highlighted the impact of using social networks in the academic world. Thus, authors and journals should share all articles using social media tools to increase the impact of the article.

The use of social media platforms in the scientific world will make an important contribution to traditional metric systems. Published articles can be posted on social media to reach more people, disseminate information quickly and increase their impact faster [ 60 ]. Further, academic journals’ use of Twitter will promote the journals and increase the citation numbers of the articles [ 9 , 60 ]. Many journals share all of their articles published on their social media accounts. Some journals only share articles they deem important. However, increasing the visibility of articles on social media platforms could be a tool for reaching more people. Thus, this study shows that Twitter posts could mediate these articles to reach more people.

The results of this study show that articles with exposure to social media had higher citation rates. There was also a positive correlation between exposure to social media and article citations. Therefore, the scientist and journals should develop new projects to increase the usage of social media.

However, the present study has some limitations. First, it analyzed only the number of citations and the number of tweets and did not evaluate their content or the Twitter account holders. Second, this study evaluated only the top 10 journals in the communication science. Third, multiple factors (such as sending articles to the press at the same time, focusing on some specific communication topics, and evaluating popular topics) may play a role in the emergence of the articles’ impact. Posting articles on Twitter is only one contributor to this impact. Further, this study examined the relationship between the number of Twitter posts and the number of citations of articles. However, the cause of the relationship was not analyzed. Therefore, more work is needed to explain potential causes of the relationships between posts and citations of an article. Nevertheless, this study presented significant findings that highlight the importance of using social media in academia.

  • View Article
  • Google Scholar
  • PubMed/NCBI
  • 21. The top 500 sites on the web [Available from: https://www.alexa.com/topsites .
  • 23. Twitter: number of monthly active users 2010–2019: Statista Research Department; 2021 [Available from: https://www.statista.com/statistics/282087/number-of-monthly-active-twitter-users/ .
  • 39. Ananiadou S, Thompson P, Nawaz R, editors. Enhancing search: Events and their discourse context. International conference on intelligent text processing and computational linguistics; 2013: Springer.
  • 45. Quercia D, Kosinski M, Stillwell D, Crowcroft J, editors. Our twitter profiles, our selves: Predicting personality with twitter. 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing; 2011: IEEE.
  • 46. Tsou A, Bowman TD, Ghazinejad A, Sugimoto CR, editors. Who tweets about science? Issi; 2015: Istanbul.
  • 48. Darling ES, Shiffman D, Côté IM, Drew JA. The role of Twitter in the life cycle of a scientific publication. arXiv preprint arXiv:. 2013.
  • Open access
  • Published: 16 March 2020

Exploring the role of social media in collaborative learning the new domain of learning

  • Jamal Abdul Nasir Ansari 1 &
  • Nawab Ali Khan 1  

Smart Learning Environments volume  7 , Article number:  9 ( 2020 ) Cite this article

383k Accesses

183 Citations

19 Altmetric

Metrics details

This study is an attempt to examine the application and usefulness of social media and mobile devices in transferring the resources and interaction with academicians in higher education institutions across the boundary wall, a hitherto unexplained area of research. This empirical study is based on the survey of 360 students of a university in eastern India, cognising students’ perception on social media and mobile devices through collaborative learning, interactivity with peers, teachers and its significant impact on students’ academic performance. A latent variance-based structural equation model approach was followed for measurement and instrument validation. The study revealed that online social media used for collaborative learning had a significant impact on interactivity with peers, teachers and online knowledge sharing behaviour.

Additionally, interactivity with teachers, peers, and online knowledge sharing behaviour has seen a significant impact on students’ engagement which consequently has a significant impact on students’ academic performance. Grounded to this finding, it would be valuable to mention that use of online social media for collaborative learning facilitate students to be more creative, dynamic and research-oriented. It is purely a domain of knowledge.

Introduction

The explosion of Information and Communication Technology (ICT) has led to an increase in the volume and smoothness in transferring course contents, which further stimulates the appeasement of Digital Learning Communities (DLCs). The millennium and naughtiness age bracket were Information Technology (IT) centric on web space where individual and geopolitical disperse learners accomplished their e-learning goals. The Educause Center for Applied Research [ECAR] ( 2012 ) surveyed students in higher education mentioned that students are pouring the acceptance of mobile computing devices (cellphones, smartphones, and tablet) in Higher Education Institutions (HEIs), roughly 67% surveyed students accepted that mobile devices and social media play a vital role in their academic performance and career enhancement. Mobile devices and social media provide excellent educational e-learning opportunities to the students for academic collaboration, accessing in course contents, and tutors despite the physical boundary (Gikas & Grant, 2013 ). Electronic communication technologies accelerate the pace of their encroachment of every aspect of life, the educational institutions incessantly long decades to struggle in seeing the role of such devices in sharing the contents, usefulness and interactivity style. Adoption and application of mobile devices and social media can provide ample futuristic learning opportunities to the students in accessing course contents as well as interaction with peers and experts (Cavus & Ibrahim, 2008 , 2009 ; Kukulska-Hulme & Shield, 2008 ; Nihalani & Mayrath, 2010 ; Richardson & Lenarcic, 2008 , Shih, 2007 ). Recently Pew Research Center reported that 55% American teenage age bracket of 15–17 years using online social networking sites, i.e. Myspace and Facebook (Reuben, 2008 ). Social media, the fast triggering the mean of virtual communication, internet-based technologies changed the life pattern of young youth.

Use of social media and mobile devices presents both advantages as well as challenges, mostly its benefits seen in terms of accessing course contents, video clip, transfer of the instructional notes etc. Overall students feel that social media and mobile devices are the cheap and convenient tools of obtaining relevant information. Studies in western countries have confronted that online social media use for collaborative learning has a significant contribution to students’ academic performance and satisfaction (Zhu, 2012 ). The purpose of this research project was to explore how learning and teaching activities in higher education institutions were affected by the integration and application of mobile devices in sharing the resource materials, interaction with colleagues and students’ academic performance. The broad goal of this research was to contemporise the in-depth perspectives of students’ perception of mobile devices and social media in learning and teaching activities. However, this research paper paid attention to only students’ experiences, and their understanding of mobile devices and social media fetched changes and its competency in academic performance. The fundamental research question of this research was, what are the opinions of students on social media and mobile devices when it is integrating into higher education for accessing, interacting with peers.

A researcher of the University of Central Florida reported that electronic devices and social media create an opportunity to the students for collaborative learning and also allowed the students in sharing the resource materials to the colleagues (Gikas & Grant, 2013 ). The result of the eight Egyptian universities confirmed that social media have the significant impact on higher education institutions especially in term of learning tools and teaching aids, faculty members’ use of social media seen at a minimum level due to several barriers (internet accessibility, mobile devices etc.).

Social media and mobile devices allow the students to create, edit and share the course contents in textual, video or audio forms. These technological innovations give birth to a new kind of learning cultures, learning based on the principles of collective exploration and interaction (Selwyn, 2012 ). Social media the phenomena originated in 2005 after the Web2.0 existence into the reality, defined more clearly as “a group of Internet-based applications that build on the ideological and technological foundation of web 2.0 and allow creation and exchange of user-generated contents (Kaplan & Haenlein, 2010 ). Mobile devices and social media provide opportunities to the students for accessing resources, materials, course contents, interaction with mentor and colleagues (Cavus & Ibrahim, 2008 , 2009 ; Richardson & Lenarcic, 2008 ).

Social media platform in academic institutions allows students to interact with their mentors, access their course contents, customisation and build students communities (Greenhow, 2011a , 2011b ). 90% school going students currently utilise the internet consistently, with more than 75% teenagers using online networking sites for e-learning (DeBell & Chapman, 2006 ; Lenhart, Arafeh, & Smith, 2008 ; Lenhart, Madden, & Hitlin, 2005 ). The result of the focus group interview of the students in 3 different universities in the United States confirmed that use of social media created opportunities to the learners for collaborative learning, creating and engaging the students in various extra curriculum activities (Gikas & Grant, 2013 ).

Research background and hypotheses

The technological innovation and increased use of the internet for e-learning by the students in higher education institutions has brought revolutionary changes in communication pattern. A report on 3000 college students in the United States revealed that 90% using Facebook while 37% using Twitter to share the resource materials as cited in (Elkaseh, Wong, & Fung, 2016 ). A study highlighted that the usage of social networking sites in educational institutions has a practical outcome on students’ learning outcomes (Jackson, 2011 ). The empirical investigation over 252 undergraduate students of business and management showed that time spent on twitter and involvement in managing social lives and sharing information, course-related influences their performance (Evans, 2014 ).

Social media for collaborative learning, interactivity with teachers, interactivity with peers

Many kinds of research confronted on the applicability of social media and mobile devices in higher education for interaction with colleagues.90% of faculty members use some social media in courses they were usually teaching or professional purposes out of the campus life. Facebook and YouTube are the most visited sites for the professional outcomes, around 2/3rd of the all-faculty use some medium fora class session, and 30% posted contents for students engagement in reading, view materials (Moran, Seaman, & Tinti-Kane, 2011 ). Use of social media and mobile devices in higher education is relatively new phenomena, completely hitherto area of research. Research on the students of faculty of Economics at University of Mortar, Bosnia, and Herzegovina reported that social media is already used for the sharing the materials and exchanges of information and students are ready for active use of social networking site (slide share etc.) for educational purposes mainly e-learning and communication (Mirela Mabić, 2014 ).

The report published by the U.S. higher education department stated that the majority of the faculty members engaged in different form of the social media for professional purposes, use of social media for teaching international business, sharing contents with the far way students, the use of social media and mobile devices for sharing and the interactive nature of online and mobile technologies build a better learning environment at international level. Responses on 308 graduate and postgraduate students in Saudi Arabia University exhibited that positive correlation between chatting, online discussion and file sharing and knowledge sharing, and entertainment and enjoyment with students learning (Eid & Al-Jabri, 2016 ). The quantitative study on 168 faculty members using partial least square (PLS-SEM) at Carnegie classified Doctoral Research University in the USA confirmed that perceived usefulness, external pressure and compatibility of task-technology have positive effect on social media use, the higher the degree of the perceived risk of social media, the less likely to use the technological tools for classroom instruction, the study further revealed that use of social media for collaborative learning has a positive effect on students learning outcome and satisfaction (Cao, Ajjan, & Hong, 2013 ). Therefore, the authors have hypothesized:

H1: Use of social media for collaborative learning is positively associated with interactivity with teachers.

Additionally, Madden and Zickuhr ( 2011 ) concluded that 83% of internet user within the age bracket of 18–29 years adopting social media for interaction with colleagues. Kabilan, Ahmad, and Abidin ( 2010 ) made an empirical investigation on 300 students at University Sains Malaysia and concluded that 74% students found to be the same view that social media infuses constructive attitude towards learning English (Fig. 1 ).

figure 1

Research Model

Reuben ( 2008 ) concluded in his study on social media usage among professional institutions revealed that Facebook and YouTube used over half of 148 higher education institutions. Nevertheless, a recent survey of 456 accredited United States institutions highlighted 100% using some form of social media, notably Facebook 98% and Twitter 84% for e-learning purposes, interaction with mentors (Barnes & Lescault, 2011 ).

Information and communication technology (ICT), such as web-based application and social networking sites enhances the collaboration and construction of knowledge byway of instruction with outside experts (Zhu, 2012 ). A positive statistically significant relationship was found between student’s use of a variety of social media tools and the colleague’s fellow as well as the overall quality of experiences (Rutherford, 2010 ). The potential use of social media leads to collaborative learning environments which allow students to share education-related materials and contents (Fisher & Baird, 2006 ). The report of 233 students in the United States higher educations confirmed that more recluse students interact through social media, which assist them in collaborative learning and boosting their self-confidence (Voorn & Kommers, 2013 ). Thus hypotheses as

H2: Use of social media for collaborative learning is positively associated with interactivity with peers.

Social media for collaborative learning, interactivity with peers, online knowledge sharing behaviour and students’ engagement

Students’ engagement in social media and its types represent their physical and mental involvement and time spent boost to the enhancement of educational Excellency, time spent on interaction with peers, teachers for collaborative learning (Kuh, 2007 ). Students’ engagement enhanced when interacting with peers and teacher was in the same direction, shares of ideas (Chickering & Gamson, 1987 ). Engagement is an active state that is influenced by interaction or lack thereof (Leece, 2011 ). With the advancement in information technology, the virtual world becomes the storehouse of the information. Liccardi et al. ( 2007 ) concluded that 30% students were noted to be active on social media for interaction with their colleagues, tutors, and friends while more than 52% used some social media forms for video sharing, blogs, chatting, and wiki during their class time. E-learning becomes now sharp and powerful tools in information technology and makes a substantial impact on the student’s academic performance. Sharing your knowledge will make you better. Social network ties were shown to be the best predictors of online knowledge sharing intention, which in turn associated with knowledge sharing behaviour (Chen, Chen, & Kinshuk, 2009 ). Social media provides the robust personalised, interactive learning environment and enhances in self-motivation as cited in (Al-Mukhaini, Al-Qayoudhi, & Al-Badi, 2014 ). Therefore, it was hypothesised that:

H3: Use of social media for collaborative learning is positively associated with online knowledge sharing behaviour.

Broadly Speaking social media/sites allow the students to interact, share the contents with colleagues, also assisting in building connections with others (Cain, 2008 ). In the present era, the majority of the college-going students are seen to be frequent users of these sophisticated devices to keep them informed and updated about the external affair. Facebook reported per day 1,00,000 new members join; Facebook is the most preferred social networking sites among the students of the United States as cited in (Cain, 2008 ). The researcher of the school of engineering, Swiss Federal Institute of Technology Lausanne, Switzerland, designed and developed Grasp, a social media platform for their students’ collaborative learning, sharing contents (Bogdanov et al., 2012 ). The utility and its usefulness could be seen in the University of Geneva and Tongji University at both two educational places students were satisfied and accept ‘ Grasp’ to collect, organised and share the contents. Students use of social media will interact ubiquity, heterogeneous and engaged in large groups (Wankel, 2009 ). So we hypotheses

H4: More interaction with teachers leads to higher students’ engagement.

However, a similar report published on 233 students revealed that social media assisted in their collaborative learning and self-confidence as they prefer communication technology than face to face communication. Although, the students have the willingness to communicate via social media platform than face to face (Voorn & Kommers, 2013 ). The potential use of social media tools facilitates in achieving higher-level learning through collaboration with colleagues and other renewed experts in their field (Junco, Heiberger, & Loken, 2011 ; Meyer, 2010 ; Novak, Razzouk, & Johnson, 2012 ; Redecker, Ala-Mutka, & Punie, 2010 ). Academic self-efficacy and optimism were found to be strongly related to performance, adjustment and consequently both directly impacted on student’s academic performance (Chemers, Hu, & Garcia, 2001 ). Data of 723 Malaysian researchers confirmed that both male and female students were satisfied with the use of social media for collaborative learning and engagement was found positively affected with learning performance (Al-Rahmi, Alias, Othman, Marin, & Tur, 2018 ). Social media were seen as a powerful driver for learning activities in terms of frankness, interactivity, and friendliness.

Junco et al. ( 2011 ) conducted research on the specific purpose of the social media; how Twitter impacted students’ engagement, found that it was extent discussion out of class, their participation in panel group (Rodriguez, 2011 ). A comparative study conducted by (Roblyer, McDaniel, Webb, Herman, & Witty, 2010 ) revealed that students were more techno-oriented than faculty members and more likely using Facebook and such similar communication technology to support their class-related task. Additionally, faculty members were more likely to use traditional techniques, i.e. email. Thus hypotheses framed is that:

H5: More interaction with peers ultimately leads to better students’ engagement.

Social networking sites and social media are closely similar, which provide a platform where students can interact, communicate, and share emotional intelligence and looking for people with other attitudes (Gikas & Grant, 2013 ). Facebook and YouTube channel use also increased in the skills/ability and knowledge and outcomes (Daniel, Isaac, & Janet, 2017 ). It was highlighted that 90% of faculty members were using some sort of social media in their courses/ teaching. Facebook was the most visited social media sites as per study, 40% of faculty members requested students to read and views content posted on social media; majority reports that videos, wiki, etc. the primary source of acquiring knowledge, social networking sites valuable tool/source of collaborative learning (Moran et al., 2011 ). However, more interestingly, in a study which was carried out on 658 faculty members in the eight different state university of Turkey, concluded that nearly half of the faculty member has some social media accounts.

Further reported that adopting social media for educational purposes, the primary motivational factor which stimulates them to use was effective and quick means of communication technology (Akçayır, 2017 ). Thus hypotheses formulated is:

H6: Online knowledge sharing behaviour is positively associated with the students’ engagement.

Using multiple treatment research design, following act-react to increase students’ academic performance and productivity, it was observed when self–monitoring record sheet was placed before students and seen that students engagement and educational productivity was increased (Rock & Thead, 2007 ). Student engagement in extra curriculum activities promotes academic achievement (Skinner & Belmont, 1993 ), increases grade rate (Connell, Spencer, & Aber, 1994 ), triggering student performance and positive expectations about academic abilities (Skinner & Belmont, 1993 ). They are spending time on online social networking sites linked to students engagement, which works as the motivator of academic performance (Fan & Williams, 2010 ). Moreover, it was noted in a survey of over 236 Malaysian students that weak association found between the online game and student’s academic performance (Eow, Ali, Mahmud, & Baki, 2009 ). In a survey of 671 students in Jordan, it was revealed that student’s engagement directly influences academic performance, also seen the indirect effect of parental involvement over academic performance (Al-Alwan, 2014 ). Engaged students are perceptive and highly active in classroom activities, ready to participate in different classroom extra activities and expose motivation to learn, which finally leads in academic achievement (Reyes, Brackett, Rivers, White, & Salovey, 2012 ). A mediated role of students engagement seen in 1399 students’ classroom emotional climate and grades (Reyes et al., 2012 ). A statistically significant relation was noticed between online lecture and exam performance.

Nonetheless, intelligence quotient, personality factors, students must be engaged in learning activities as cited in (Bertheussen & Myrland, 2016 ). The report of the 1906 students at 7 universities in Colombia confirmed that the weak correlation between collaborative learning, students faculty interaction with academic performance (Pineda-Báez et al., 2014 ) Thus, the hypothesis

H7: Student's Engagement is positively associated with the student's academic performance.

Methodology

To check the students’ perception on social media for collaborative learning in higher education institutions, Data were gathered both offline and online survey administered to students from one public university in Eastern India (BBAU, Lucknow). For the sake of this study, indicators of interactivity with peers and teachers, the items of students engagement, the statement of social media for collaborative learning, and the elements of students’ academic performance were adopted from (AL-Rahmi & Othman, 2013 ). The statement of online knowledge sharing behaviour was taken from (Ma & Yuen, 2011 ).

The indicators of all variables which were mentioned above are measured on the standardised seven-point Likert scale with the anchor (1-Strongly Disagree, to 7-Strongly Agree). Interactivity with peers was measured using four indicators; the sample items using social media in class facilitates interaction with peers ; interactivity with teachers was measured using four symbols, the sample item is using social media in class allows me to discuss with the teacher. ; engagement was measured using three indicators by using social media I felt that my opinions had been taken into account in this class ; social media for collaborative learning was measured using four indicators collaborative learning experience in social media environment is better than in a face-to-face learning environment ; students’ academic performance was measured using five signs using social media to build a student-lecturer relationship with my lecturers, and this improves my academic performance ; online knowledge sharing behaviour was assessed using five symbols the counsel was received from other colleague using social media has increased our experience .

Procedure and measurement

A sample of 360 undergraduate students was collected by convenience sampling method of a public university in Eastern India. The proposed model of study was measured and evaluated using variance based structured equation model (SEM)-a latent multi variance technique which provides the concurrent estimation of structural and measurement model that does not meet parametric assumption (Coelho & Duarte, 2016 ; Haryono & Wardoyo, 2012 ; Lee, 2007 ; Moqbel, Nevo, & Kock, 2013 ; Raykov & Marcoulides, 2000 ; Williams, Rana, & Dwivedi, 2015 ). The confirmatory factor analysis (CFA) was conducted to ensure whether the widely accepted criterion of discriminate and convergent validity met or not. The loading of all the indicators should be 0.50 or more (Field, 2011 ; Hair, Anderson, Tatham, & Black, 1992 ). And it should be statistically significant at least at the 0.05.

Demographic analysis (Table 1 )

The majority of the students in this study were females (50.8%) while male students were only 49.2% with age 15–20 years (71.7%). It could be pointed out at this juncture that the majority of the students (53.9%) in BBAU were joined at least 1–5 academic pages for their getting information, awareness and knowledge. 46.1% of students spent 1–5 h per week on social networking sites for collaborative learning, interaction with teachers at an international level. The different academic pages followed for accessing material, communication with the faculty members stood at 44.4%, there would be various forms of the social networking sites (LinkedIn, Slide Share, YouTube Channel, Researchgate) which provide the facility of online collaborative learning, a platform at which both faculty members and students engaged in learning activities.

As per report (Nasir, Khatoon, & Bharadwaj, 2018 ), most of the social media user in India are college-going students, 33% girls followed by 27% boys students, and this reports also forecasted that India is going to become the highest 370.77 million internet users in 2022. Additionally, the majority of the faculty members use smartphone 44% to connect with the students for sharing material content. Technological advantages were the pivotal motivational force which stimulates faculty members and students to exploits the opportunities of resource materials (Nasir & Khan, 2018 ) (Fig. 2 ).

figure 2

Reasons for Using Social Media

When the students were asked for what reason did they use social media, it was seen that rarely using for self-promotion, very frequently using for self-education, often used for passing the time with friends, and so many fruitful information the image mentioned above depicting.

Instrument validation

The structural model was applied to scrutinize the potency and statistically significant relationship among unobserved variables. The present measurement model was evaluated using Confirmatory Factor Analysis (CFA), and allied procedures to examine the relationship among hypothetical latent variables has acceptable reliability and validity. This study used both SPSS 20.0 and AMOS to check measurement and structural model (Field, 2013 ; Hair, Anderson, et al., 1992 ; Mooi & Sarstedt, 2011 ; Norusis, 2011 ).

The Confirmatory Factor Analysis (CFA) was conducted to ensure whether the widely accepted criterion of discriminant and convergent validity met or not. The loading of all the indicators should be 0.70 or more it should be statistically significant at least at the 0.05 (Field, 2011 ; Hair, Anderson, et al., 1992 ).

CR or CA-based tests measured the reliability of the proposed measurement model. The CA provides an estimate of the indicators intercorrelation (Henseler & Sarstedt, 2013 . The benchmark limits of the CA is 0.7 or more (Nunnally & Bernstein, 1994 ). As per Table 2 , all latent variables in this study above the recommended threshold limit. Although, Average Variance Extracted (AVE) has also been demonstrated which exceed the benchmark limit 0.5. Thus all the above-specified values revealed that our instrument is valid and effective. (See Table 2 for the additional information) (Table 3 ).

In a nutshell, the measurement model clear numerous stringent tests of convergent validity, discriminant validity, reliability, and absence of multi-collinearity. The finding demonstrated that our model meets widely accepted data validation criteria. (Schumacker & Lomax, 2010 ).

The model fit was evaluated through the Chi-Square/degree of freedom (CMIN/DF), Root Mean Residual (RMR), Root Mean Square Error of Approximation (RMSEA), Comparative Fit Index (CFI), and Goodness of fit index (GFI) and Tucker-Lewis Index (TLI). The benchmark limit of the CFI, TLI, and GFI 0.90or more (Hair et al., 2016 ; Kock, 2011 ). The model study demonstrated in the table, as mentioned above 4 that the minimum threshold limit was achieved (See Table 4 for additional diagnosis).

Path coefficient of several hypotheses has been demonstrated in Fig.  3 , which is a variable par relationship. β (beta) Coefficients, standardised partial regression coefficients signify the powers of the multivariate relationship among latent variables in the model. Remarkably, it was observed that seven out of the seven proposed hypotheses were accepted and 78% of the explained variance in students’ academic performance, 60% explained variance in interactivity with teachers, 48% variance in interactivity with peers, 43% variance in online knowledge sharing behaviour and 79% variance in students’ engagement. Social media collaborative learning has a significant association with teacher interactivity(β = .693, P  < 0.001), demonstrating that there is a direct effect on interaction with the teacher by social media when other variables are controlled. On the other hand, use of social media for collaborative learning has noticed statistically significant positive relationship with peers interactivity (β = .704, p  < 0.001) meaning thereby, collaborative learning on social media by university students, leads to the high degree of interaction with peers, colleagues. Implied 10% rise in social media use for learning purposes, expected 7.04% increase in interaction with peers.

figure 3

Path Diagram

Use of social media for collaborating learning has a significant positive association with online knowledge sharing behaviour (β = .583, p  < 0.001), meaning thereby that the more intense use of social media for collaborative learning by university students, the more knowledge sharing between peers and colleagues. Also, interaction with the teacher seen the significant statistical positive association with students engagement (β = .450, p  < 0.001), telling that the more conversation with teachers, leads to a high level of students engagement. Similarly, the practical interpretation of this result is that there is an expected 4.5% increase in student’s participation for every 10% increase in interaction with teachers. Interaction with peers has a significant positive association with students engagement (β = .210, p  < 0.001). Practically, the finding revealed that 10% upturn in student’s involvement, there is a 2.1% increase in peer’s interaction. There is a significant positive association between online knowledge sharing behaviour and students engagement (β = 0.247, p  < 0.001), and finally students engagement has been a statistically significant positive relationship with students’ academic performance (β = .972, p  < 0.001), this is the clear indication that more engaged students in collaborative learning via social media leads to better students’ academic performance.

Discussion and implication

There is a continuing discussion in the academic literature that use of such social media and social networking sites would facilitate collaborative learning. It is human psychology generally that such communication media technology seems only for entertainment, but it should be noted here carefully that if such communication technology would be followed with due attention prove productive. It is essential to acknowledge that most university students nowadays adopting social media communication to interact with colleagues, teachers and also making the group be in touch with old friends and even a convenient source of transferring the resources. In the present era, the majority of the university students having diversified social media community groups like Whatsapp, Facebook pages following different academic web pages to upgrade their knowledge.

Practically for every 10% rise in students’ engagement, expected to be 2.1% increase in peer interaction. As the study suggested that students engage in different sites, they start discussing with colleagues. More engaged students in collaborative learning through social media lead better students’ academic performance. The present study revealed that for every 10% increase in student’s engagement, there would be an expected increase in student academic performance at a rate of 9.72. This extensive research finding revealed that the application of online social media would facilitate the students to become more creative, dynamics and connect to the worldwide instructor for collaborative learning.

Accordingly, the use of online social media for collaborative learning, interaction with mentors and colleagues leadbetter student’s engagement which consequently affects student’s academic performance. The higher education authority should provide such a platform which can nurture the student’s intellectual talents. Based on the empirical investigation, it would be said that students’ engagement, social media communication devices facilitate students to retrieve information and interact with others in real-time regarding sharing teaching materials contents. Additionally, such sophisticated communication devices would prove to be more useful to those students who feel too shy in front of peers; teachers may open up on the web for the collaborative learning and teaching in the global scenario and also beneficial for physically challenged students. It would also make sense that intensive use of such sophisticated technology in teaching pedagogical in higher education further facilitates the teachers and students to interact digitally, web-based learning, creating discussion group, etc. The result of this investigation confirmed that use of social media for collaborative learning purposes, interaction with peers, and teacher affect their academic performance positively, meaning at this moment that implementation of such sophisticated communication technology would bring revolutionary, drastic changes in higher education for international collaborative learning (Table 5 ).

Limitations and future direction

Like all the studies, this study is also not exempted from the pitfalls, lacunas, and drawbacks. The first and foremost research limitation is it ignores the addiction of social media; excess use may lead to destruction, deviation from the focal point. The study only confined to only one academic institution. Hence, the finding of the project cannot be generalised as a whole. The significant positive results were found in this study due to the fact that the social media and mobile devices are frequently used by the university going students not only as a means of gratification but also for educational purposes.

Secondly, this study was conducted on university students, ignoring the faculty members, it might be possible that the faculty members would not have been interested in interacting with the students. Thus, future research could be possible towards faculty members in different higher education institutions. To the authors’ best reliance, this is the first and prime study to check the usefulness and applicability of social media in the higher education system in the Indian context.

Concluding observations

Based on the empirical investigation, it could be noted that application and usefulness of the social media in transferring the resource materials, collaborative learning and interaction with the colleagues as well as teachers would facilitate students to be more enthusiastic and dynamic. This study provides guidelines to the corporate world in formulating strategies regarding the use of social media for collaborative learning.

Availability of data and materials

The corresponding author declared here all types of data used in this study available for any clarification. The author of this manuscript ready for any justification regarding the data set. To make publically available of the data used in this study, the seeker must mail to the mentioned email address. The profile of the respondents was completely confidential.

Akçayır, G. (2017). Why do faculty members use or not use social networking sites for education? Computers in Human Behavior, 71 , 378–385.

Article   Google Scholar  

Al-Alwan, A. F. (2014). Modeling the relations among parental involvement, school engagement and academic performance of high school students. International Education Studies, 7 (4), 47–56.

Al-Mukhaini, E. M., Al-Qayoudhi, W. S., & Al-Badi, A. H. (2014). Adoption of social networking in education: A study of the use of social networks by higher education students in Oman. Journal of International Education Research, 10 (2), 143–154.

Google Scholar  

Al-Rahmi, W. M., Alias, N., Othman, M. S., Marin, V. I., & Tur, G. (2018). A model of factors affecting learning performance through the use of social media in Malaysian higher education. Computers & Education, 121 , 59–72.

Al-Rahmi, W. M., & Othman, M. S. (2013). Evaluating student’s satisfaction of using social media through collaborative learning in higher education. International Journal of Advances in Engineering & Technology, 6 (4), 1541–1551.

Arbuckle, J. (2008). Amos 17.0 user's guide . Chicago: SPSS Inc..

Barnes, N. G., & Lescault, A. M. (2011). Social media adoption soars as higher-ed experiments and reevaluates its use of new communications tools . North Dartmouth: Center for Marketing Research. University of Massachusetts Dartmouth.

Bertheussen, B. A., & Myrland, Ø. (2016). Relation between academic performance and students’ engagement in digital learning activities. Journal of Education for Business, 91 (3), 125–131.

Bogdanov, E., Limpens, F., Li, N., El Helou, S., Salzmann, C., & Gillet, D. (2012). A social media platform in higher education. In Proceedings of the 2012 IEEE Global Engineering Education Conference (EDUCON) (pp. 1–8). IEEE.

Byrne, B. M. (1994). Structural equation modeling with EQS and EQS/windows: basic concepts, applications, and programming . Thousand Oaks: Sage.

Cain, J. (2008). Online social networking issues within academia and pharmacy education. American Journal of Pharmaceutical Education. https://doi.org/10.5688/aj720110 .

Cao, Y., Ajjan, H., & Hong, P. (2013). Using social media applications for educational outcomes in college teaching: a structural equation analysis. British Journal of Educational Technology, 44 (4), 581–593. https://doi.org/10.1111/bjet.12066 .

Cavus, N., & Ibrahim, D. (2008). A mobile tool for learning English words, Online Submission (pp. 6–9) Retrieved from http://libezproxy.open.ac.uk/login?url=http://search.ebscohost.com/login.aspx?direct=true&db=eric&AN=ED504283&site=ehost-live&scope=site .

Cavus, N., & Ibrahim, D. (2009). M-learning: An experiment in using SMS to support learning new English language words. British Journal of Educational Technology, 40 (1), 78–91.

Chemers, M. M., Hu, L. T., & Garcia, B. F. (2001). Academic self-efficacy and first-year college student performance and adjustment. Journal of Educational Psychology, 93 (1), 55–64. https://doi.org/10.1037/0022-0663.93.1.55 .

Chen, I. Y. L., Chen, N.-S., & Kinshuk. (2009). International forum of Educational Technology & Society Examining the factors influencing participants’ knowledge sharing behavior in virtual learning communities published by : International forum of Educational Technology & Society Examining the factor. Educational Technology & Society, 12 (1), 134–148.

Chickering, A. W., & Gamson, Z. F. (1987). Seven principles for good practise in undergraduate education. AAHE bulletin, 3 , 7.

Coelho, J., & Duarte, C. (2016). A literature survey on older adults' use of social network services and social applications. Computers in Human Behavior, 58 , 187–205.

Connell, J. P., Spencer, M. B., & Aber, J. L. (1994). Educational risk and resilience in African-American youth: Context, self, action, and outcomes in school. Child Development, 65 (2), 493–506.

Daniel, E. A., Isaac, E. N., & Janet, A. K. (2017). Influence of Facebook usage on employee productivity: A case of university of cape coast staff. African Journal of Business Management, 11 (6), 110–116. https://doi.org/10.5897/AJBM2017.8265 .

DeBell, M., & Chapman, C. (2006). Computer and internet use by students in 2003. Statistical analysis report. NCES 2006-065. National Center for education statistics.

Dziuban, C., & Walker, J. D. (2012). ECAR Study of Undergraduate Students and Information Technology, 2012 (Research Report) . Louisville: EDUCAUSE Centre for Applied Research.

Eid, M. I. M., & Al-Jabri, I. M. (2016). Social networking, knowledge sharing, and student learning: The case of university students. Computers and Education, 99 , 14–27. https://doi.org/10.1016/j.compedu.2016.04.007 .

Elkaseh, A. M., Wong, K. W., & Fung, C. C. (2016). Perceived ease of use and perceived usefulness of social media for e-learning in Libyan higher education: A structural equation modeling analysis. International Journal of Information and Education Technology, 6 (3), 192.

Eow, Y. L., Ali, W. Z. b. W., Mahmud, R. b., & Baki, R. (2009). Form one students’ engagement with computer games and its effect on their academic achievement in a Malaysian secondary school. Computers and Education, 53 (4), 1082–1091. https://doi.org/10.1016/j.compedu.2009.05.013 .

Evans, C. (2014). Twitter for teaching: Can social media be used to enhance the process of learning? British Journal of Educational Wiley Online Library, 45 (5), 902–915. https://doi.org/10.1111/bjet.12099 .

Fan, W., & Williams, C. M. (2010). The effects of parental involvement on students’ academic self-efficacy, engagement and intrinsic motivation. Educational Psychology, 30 (1), 53–74. https://doi.org/10.1080/01443410903353302 .

Field, A. (2011). Discovering statistics using SPSS: (and sex and drugs and rock'n'roll) (Vol. 497). London: Sage.

Field, A. (2013). Factor analysis using SPSS. Scientific Research and Essays, 22 (June), 1–26. https://doi.org/10.1016/B978-0-444-52272-6.00519-5 .

Fisher, M., & Baird, D. E. (2006). Making mLearning work: Utilizing mobile technology for active exploration, collaboration, assessment, and reflection in higher education. Journal of Educational Technology Systems, 35 (1), 3–30.

Gikas, J., & Grant, M. M. (2013). Mobile computing devices in higher education: Student perspectives on learning with cellphones, smartphones &amp; social media. Internet and Higher Education Mobile, 19 , 18–26. https://doi.org/10.1016/j.iheduc.2013.06.002 .

Greenhow, C. (2011a). Online social networks and learning. On the horizon, 19 (1), 4–12.

Greenhow, C. (2011b). Youth, learning, and social media. Journal of Educational Computing Research, 45 (2), 139–146. https://doi.org/10.2190/EC.45.2.a .

Hair Anderson, R. E., Tatham, R. L., & Black, W. C. (1992). Multivariate data analysis. International Journal of Pharmaceutics . https://doi.org/10.1016/j.ijpharm.2011.02.019 .

Hair Jr., J. F., Sarstedt, M., Matthews, L. M., & Ringle, C. M. (2016). Identifying and treating unobserved heterogeneity with FIMIX-PLS: part I–method. European Business Review.

Harrington, D. (2009). Confirmatory factor analysis . Oxford university press.

Haryono, S., & Wardoyo, P. (2012). Structural Equation Modeling (Vol. 331).

Henseler, J., & Sarstedt, M. (2013). Goodness-of-fit indices for partial least squares path modeling. Computational Statistics, 28 (2), 565–580.

Jackson, C. (2011). Your students love social media… and so can you. Teaching Tolerance, 39 , 38–41.

Junco, R., Heiberger, G., & Loken, E. (2011). The effect of twitter on college student engagement and grades. Journal of Computer Assisted Learning, 27 (2), 119–132.

Kabilan, M. K., Ahmad, N., & Abidin, M. J. Z. (2010). Facebook: An online environment for learning of English in institutions of higher education? The Internet and Higher Education, 13 (4), 179–187.

Kaplan, A. M., & Haenlein, M. (2010). Users of the world, unite! The challenges and opportunities of social media. Business Horizons, 53 (1), 59–68.

Kock, N. (2011). Using WarpPLS in e-collaboration studies: Mediating effects, control and second order variables, and algorithm choices. International Journal of e-Collaboration (IJeC), 7 (3), 1–13.

Kuh, G. D. (2007). What student engagement data tell us about college readiness. Peer Review, 9 (1), 4–8.

Kukulska-Hulme, A., & Shield, L. (2008). An overview of mobile assisted language learning: From content delivery to supported collaboration and interaction. ReCALL, 20 (3), 271–289.

Lee, S.-Y. (2007). Structural equation modeling: A Bayesian approach (Wiley series in probability and statistics). Ecotoxicology and Environmental Safety, 73 . https://doi.org/10.1016/j.ecoenv.2009.09.012 .

Leece, R. (2011). Engaging students through social media. Journal of the Australian and New Zealand Student Services Association, 38 , 10–14 Retrieved from https://www.researchgate.net/profile/Anthony_Jorm/publication/235003484_Introduction_to_guidelines_for_tertiary_education_institutions_to_assist_them_in_supporting_students_with_mental_health_problems/links/0c96052ba5314e1202000000.pdf#page=67 .

Lenhart, A., Arafeh, S., & Smith, A. (2008). Writing, technology and teens . Pew Internet & American Life Project.

Lenhart, A., Madden, M., & Hitlin, P. (2005). Teens and technology (p. 2008). Washington, DC: Pew Charitable Trusts Retrieved September 29.

Liccardi, I., Ounnas, A., Pau, R., Massey, E., Kinnunen, P., Lewthwaite, S., …, Sarkar, C. (2007). The role of social networks in students’ learning experiences. In ACM Sigcse Bulletin (39, 4, 224–237).

Ma, W. W. K., & Yuen, A. H. K. (2011). Understanding online knowledge sharing: An interpersonal relationship perspective. Computers & Education, 56 (1), 210–219.

Madden, M., & Zickuhr, K. (2011). 65% of online adults use social networking sites. Pew Internet & American Life Project, 1 , 14.

Meyer, K. A. (2010). A comparison of web 2.0 tools in a doctoral course. The Internet and Higher Education, 13 (4), 226–232.

Mirela Mabić, D. G. (2014). Facebook as a learning tool. Igarss, 2014 (1), 1–5. https://doi.org/10.1007/s13398-014-0173-7.2 .

Mooi, E., & Sarstedt, M. (2011). A concise guide to market research: The process, data, and methods using IBM SPSS statistics . Springeringer. https://doi.org/10.1007/978-3-642-12541-6 .

Moqbel, M., Nevo, S., & Kock, N. (2013). Organizational members’ use of social networking sites and job performance. Information Technology & People, 26 (3), 240–264. https://doi.org/10.1108/ITP-10-2012-0110 .

Moran, M., Seaman, J., & Tinti-Kane, H. (2011). Teaching, learning, and sharing: How Today’s higher education faculty use social media (pp. 1–16). Babson survey research group, (April. https://doi.org/10.1016/j.chb.2013.06.015 .

Nasir, J. A., & Khan, N. A. (2018). Faculty member usage of social media and mobile devices in higher education institution. International Journal of Advance and Innovative Research, 6 (1), 17–25.

Nasir, J. A., Khatoon, A., & Bharadwaj, S. (2018). Social media users in India: A futuristic approach. International Journal of Research and Analytical Reviews, 5 (4), 762–765 Retrieved from http://ijrar.com/ .

Nihalani, P. K., & Mayrath, M. C. (2010). Statistics I. Findings from using an iPhone app in a higher education course. In White Paper .

Norusis, M. (2011). IBM SPSS statistics 20 brief guide (pp. 1–170). IBM Corporation Retrieved from http://www.ibm.com/support .

Novak, E., Razzouk, R., & Johnson, T. E. (2012). The educational use of social annotation tools in higher education: A literature review. The Internet and Higher Education, 15 (1), 39–49.

Nunnally, J. C., & Bernstein, I. H. (1994). Psychological theory .

Pineda-Báez, C., José-Javier, B. A., Rubiano-Bello, Á., Pava-García, N., Suárez-García, R., & Cruz-Becerra, F. (2014). Student engagement and academic performance in the Colombian University context. RELIEVE-Revista Electrónica de Investigación y Evaluación Educativa, 20 (2), 1–19.

Raykov, T., & Marcoulides, G. A. (2000). A First Course in Structural Equation Modeling .

Redecker, C., Ala-Mutka, K., & Punie, Y. (2010). Learning 2.0-the impact of social media on learning in Europe. Policy brief. JRC scientific and technical report. EUR JRC56958 EN, Available from http://bit.ly/cljlpq . Accessed 6 Feb 2011.

Reuben, B. R. (2008). The use of social Media in Higher Education for marketing and communications : A guide for professionals in higher education (Vol. 5) Retrieved from httpdoteduguru comwpcontentuploads200808socialmediainhighereducation pdf)). https://doi.org/10.1108/S2044-9968(2012)0000005018 .

Book   Google Scholar  

Reyes, M. R., Brackett, M. A., Rivers, S. E., White, M., & Salovey, P. (2012). Classroom emotional climate, student engagement, and academic achievement. Journal of Educational Psychology, 104 (3), 700–712. https://doi.org/10.1037/a0027268 .

Richardson, J., & Lenarcic, J. (2008). Text Messaging as a Catalyst for Mobile Student Administration: The “Trigger” Experience. International Journal of Emerging Technologies & Society, 6 (2), 140–155.

Roblyer, M. D., McDaniel, M., Webb, M., Herman, J., & Witty, J. V. (2010). Findings on Facebook in higher education: A comparison of college faculty and student uses and perceptions of social networking sites. The Internet and Higher Education, 13 (3), 134–140.

Rock, M. L., & Thead, B. K. (2007). The effects of fading a strategic self-monitoring intervention on students’ academic engagement, accuracy, and productivity. Journal of Behavioral Education, 16 (4), 389–412. https://doi.org/10.1007/s10864-007-9049-7 .

Rodriguez, J. E. (2011). Social media use in higher education : Key areas to consider for educators. MERLOT Journal of Online Learning and Teaching, 7 (4), 539–550 https://doi.org/ISSN1558-9528 .

Rutherford, C. (2010). Using online social media to support Preservice student engagement. MERLOT Journal of Online Learning and Teaching, 6 (4), 703–711 Retrieved from http://jolt.merlot.org/vol6no4/rutherford_1210.pdf .

Schumacker, R. E., & Lomax, R. G. (2010). A Beginner’s Guide to structural equation modeling (3rd ed.). New York: Taylor & Francis Group.

Selwyn, N. (2012). Making sense of young people, education and digital technology: The role of sociological theory. Oxford Review of Education, 38 (1), 81–96.

Shih, Y. E. (2007). Setting the new standard with mobile computing in online learning. The International Review of Research in Open and Distributed Learning, 8 (2), 1–16.

Skinner, E. A., & Belmont, M. J. (1993). Motivation in the classroom: Reciprocal effects of teacher behavior and student engagement across the school year. Journal of educational psychology, 85 (4), 571.

Tabachnick, B. G., Fidell, L. S., & Ullman, J. B. (2007). Using multivariate statistics (Vol. 5). Boston: Pearson.

Voorn, R. J., & Kommers, P. A. (2013). Social media and higher education: Introversion and collaborative learning from the student’s perspective. International Journal of Social Media and Interactive Learning Environments, 1 (1), 59–73.

Wankel, C. (2009). Management education using social media. Organization Management Journal, 6 (4), 251–262.

Williams, M. D., Rana, N. P., & Dwivedi, Y. K. (2015). The unified theory of acceptance and use of technology (UTAUT): a literature review. Journal of Enterprise Information Management, 28 (3), 443–488.

Zhu, C. (2012). Student satisfaction, performance, and knowledge construction in online collaborative learning. Journal of Educational Technology & Society, 15 (1), 127–136.

Download references

Acknowledgements

We want to express our special gratitude to the Almighty who has blessed us with such hidden talent to give the shape of this research paper.

The authors of this manuscript, solemnly declared that no funding agency was supported to execute this research project.

Author information

Authors and affiliations.

Department of Commerce, Aligarh Muslim University, Aligarh, 202002, India

Jamal Abdul Nasir Ansari & Nawab Ali Khan

You can also search for this author in PubMed   Google Scholar

Contributions

Jamal Abdul Nasir Ansari: The first author of this manuscript has performed all sorts of necessary works like the collection of data from respondents, administration of the questionnaire. Collection of information from the respondents was quite challenging. The author faced a lot of difficulties while collecting data. The main contribution of the author in this manuscript is that the entire work, like data analysis and its interpretation performed by him. Additionally, the author has tried to explore and usefulness of social media and its applicability in transferring the course contents. Nawab Ali Khan: The second author of this manuscript has checked all types of grammatical issues, and necessary corrections wherever required. The author(s) read and approved the final manuscript.

Corresponding author

Correspondence to Jamal Abdul Nasir Ansari .

Ethics declarations

Competing interests.

The authors declare that they have no competing interests.

Additional information

Publisher’s note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/ ), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article.

Ansari, J.A.N., Khan, N.A. Exploring the role of social media in collaborative learning the new domain of learning. Smart Learn. Environ. 7 , 9 (2020). https://doi.org/10.1186/s40561-020-00118-7

Download citation

Received : 27 November 2019

Accepted : 18 February 2020

Published : 16 March 2020

DOI : https://doi.org/10.1186/s40561-020-00118-7

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Social media
  • Higher education
  • Faculty members

social media platforms research paper

  • - Google Chrome

Intended for healthcare professionals

  • My email alerts
  • BMA member login
  • Username * Password * Forgot your log in details? Need to activate BMA Member Log In Log in via OpenAthens Log in via your institution

Home

Search form

  • Advanced search
  • Search responses
  • Search blogs
  • News & Views
  • Understanding the...

Understanding the complex links between social media and health behaviour

How are social media influencing vaccination read the full collection.

  • Related content
  • Peer review
  • Fabiana Zollo , professor 1 2 ,
  • Cornelia Betsch , professor 4 5 ,
  • Marco Delmastro , research fellow 1 6 ,
  • 1 Ca’ Foscari University of Venice, Venice, Italy
  • 2 New Institute Centre for Environmental Humanities, Venice, Italy
  • 3 City University of London, London, United Kingdom
  • 4 University of Erfurt, Erfurt, Germany
  • 5 Bernhard Nocht Institute for Tropical Medicine, Hamburg, Germany
  • 6 Enrico Fermi Center for Study and Research, Rome, Italy
  • 7 Sapienza University, Rome, Italy
  • Correspondence to: F Zollo fabiana.zollo{at}unive.it

Fabiana Zollo and colleagues call for comprehensive, robust research on the influence of social media on health behaviour in order to improve public health responses

Key messages

Monitoring social media is important to understand public perceptions, biases, and false beliefs

Drawing conclusions on how social media affects health behaviour is difficult because measures are unstandardised, sources are limited, and data are incomplete and biased

Rigorous research is needed from varied settings and demographics to improve understanding of the effect of social media on health behaviour

Over 90% of people connected to the internet are active on social media, with a total of 4.76 billion users worldwide in January 2023. 1 The digital revolution has reshaped the news landscape and changed the way users interact with information. Social media’s targeted communication rapidly reaches vast audiences, who in turn actively participate in shaping and engaging with content. This marks a departure from the more passive consumption patterns associated with traditional media.

Over the past few years, social media have emerged as a primary source of news for many people, despite widespread user concerns about potential misinformation ( box 1 ) and the necessity to discern between reliable and untrustworthy information. 4 Data from six continents also indicate a preference among users for content that reflects their reading or viewing history, rather than content selected by journalists, suggesting a shift towards personalised and user driven content curation. In this evolving landscape, celebrities, influencers, and social media personalities are increasingly assuming roles as news sources, especially on platforms such as TikTok, Instagram, and Snapchat.

What is misinformation?

The term “misinformation” is commonly used, yet its definitions can vary between studies, methods, and scholars, leading to disagreements on its precise meaning 2

Misinformation encompasses false, inaccurate, or misleading information, and is often distinguished from disinformation by its lack of deliberate creation and dissemination with the intent to deceive. Classifications may also extend to conspiracy theories and propaganda

Misinformation poses a substantial risk to public health since it can undermine compliance with important public health measures such as vaccination uptake or physical distancing guidelines 3

Public health organisations have recognised the crucial role of social media in shaping the public debate and are working to utilise social media platforms to inform the public, combat misinformation, and improve health knowledge, attitudes, and behaviour. However, causal research on how social media information affects actual health behaviour is inconclusive, primarily because of methodological challenges associated with connecting online activity to offline actions and accurately measuring behavioural outcomes. 5 Thus, exploring the complex relations between information consumption, personal beliefs, and societal effects remains an important area of study. The development of vaccines against covid-19 was accompanied by an infodemic—an overabundance of information, not all of which is accurate. 6 Study of this phenomenon provides useful insight into the interplay between social media and health behaviours and the opportunities and challenges for research and practice.

Access to data on misinformation and health behaviour

Social media have provided unprecedented opportunities through which health information, including misinformation, can be amplified and spread. However, the impact of exposure to and interaction with misinformation on health behaviour remains a subject of debate within the scientific community. 7 While considerable evidence exists from research indicating that misinformation can affect knowledge, attitudes, or behavioural intentions, reaching a consensus in the scientific community on the links between social media and actual health behaviour has been challenging due to lack of data and inherent limitations in study design.

A recent systematic review of randomised controlled trials, for example, highlighted the need for more conceptual and theoretical work on the causal pathways through which misinformation shapes people’s beliefs and behaviours. 8 This influence is often indirect, meaning that exposure to misinformation may affect changes in health behaviour by shaping psychological factors such as beliefs, feelings, and motivations (the so called psychological antecedents) which are commonly used to explain and predict behaviours. However, the roles of potential mediators such as emotions, social norms, and trust are still poorly understood. While all the studies in that review assessed the effect of misinformation on antecedents (intentions, attitudes, and subjective norms), only two of them measured actual behaviour. These studies included behavioural measures of activism, such as the act of signing petitions, yet none examined the effects of misinformation exposure on direct health measures or behaviours, such as vaccination. Indeed, the literature is unclear about the causal effect of individual online activity on behaviour. For example, while some research shows that risk perceptions and vaccination intentions can be affected by short visits to antivaccination websites, 9 exposure to antivaccination comments posted on news stories online appears to have little influence on individuals’ perspectives regarding vaccines, although it could potentially undermine individuals’ trust in important health communication institutions. 10

Furthermore, drawing links or establishing causality is not a trivial endeavour. One important obstacle to understanding the effect of social media on behaviour lies in the challenge of linking online activity with offline behaviour. This difficulty stems from factors such as data scarcity and privacy concerns, particularly regarding personal and sensitive information, that complicate efforts to assess how online interactions translate into real word actions. Establishing a clear connection between information consumption on social media platforms and tangible behavioural outcomes, while excluding the influence of external variables, is a complex task, especially when examining behaviour over medium to long term periods. Examining behaviour in the long term requires longitudinal data, which are often lacking due to the resources (such as costs and time) required for such research. Adding to these challenges is the lack of standardised measures and definitions across studies. As we have seen, misinformation is not unanimously defined ( box 1 ), and health behaviours also encompass a variety of actions—in the covid-19 pandemic alone, behaviours ranged from adherence to nonpharmaceutical measures such as physical distancing to lockdowns, from handwashing to vaccinations. Even with clear definitions, measuring health behaviours reliably and accurately remains a considerable challenge. 11 Many studies rely heavily on self reported data, which may have a low correlation with objectively measured behaviour. Moreover, many differences exist between countries in terms of how data of this nature are collected. This variability makes it difficult to extrapolate definitive findings from different settings and contexts.

The covid-19 pandemic presented a unique opportunity to further investigate the potential effect of infodemics on health behaviour, especially on vaccine hesitancy and refusal. The evidence in the literature paints a complex picture of the relationship between social media misinformation and vaccination. On the one hand, researchers have identified a negative relationship between sharing misinformation online and vaccination uptake in the United States of America. 12 Similarly, a study in the UK and US suggests that exposure to misinformation reduces individuals’ intention to vaccinate for their own and others’ protection. 13 These findings highlight the potentially detrimental effect of misinformation on public health efforts. A large review of 205 articles looked more specifically into conspiracies around vaccination (under review by journal not yet accepted). While some studies showed causal evidence of exposure to conspiracies on vaccination intentions, most studies were correlative and behaviour was not investigated. Thus, the findings of many studies suggest that uncertainty persists on the causality of this relationship. Further investigation into the association between social media behaviour and attitudes towards covid-19 vaccines showed that vaccine hesitancy was associated with interaction and consumption of low quality information online. 14 These results remained significant even after accounting for relevant variables, suggesting that social media behaviour may play an important role in predicting vaccine attitudes. Supporting this finding, a recent systematic review examining the role of social media as a predictor of covid-19 vaccine outcomes showed predominantly negative associations between social media predictors and vaccine perceptions, in particular concerning vaccine hesitancy. However, the evidence suggests a multifaceted landscape, with findings varying across different social media predictors, populations, and platforms. 15 Moreover, while concerns about infodemics shaping individuals’ behavioural intentions are prevalent, some findings suggest a more nuanced reality. Despite the proliferation of information and debate on covid-19 vaccines, the relatively stable and positive trend in vaccine acceptance rates at an aggregated level challenges simplistic explanations about the effect of misinformation. 16

Overall, despite the importance of the effect of social media and misinformation on health behaviour and the extensive assumptions within policy debates, the literature fails to provide definitive conclusions on a clear association between social media and health behaviour. As discussed earlier, measuring behavioural change is challenging due to the scarcity of studies incorporating actual behavioural measures, limitations in laboratory experiments, and difficulties in establishing connections with online activity. Studies are often confined to specific geographical areas, primarily Western countries (notably the US), or limited to specific time periods. In addition, data samples are often constrained by the lack of comprehensive information, such as demographics or geolocation. Furthermore, longitudinal studies are required with extensive access to social media behaviour as well as access to actual behavioural data.

Therefore, further studies are needed to assess the causal effect of social media on offline behaviour. This will require overcoming the ethical issues of data linkage and protection. Such studies will need to integrate social media data with information from different sources, adjusting statistical methods to handle sampling biases, and accounting for the inherent dynamics of social media discussions, which are often characterised by extreme polarisation and user segregation.

Social media dynamics and health

Social media debates are often marked by intense segregation. Users tend to seek out information that aligns with their existing beliefs while dismissing opposing viewpoints. Social media platforms, especially those employing content filtering algorithms, tend to exploit this natural tendency by favouring content aligned with the user’s history and preferences. 17 After all, platforms such as Facebook are built on the foundational unit of the “like,” which represents the most fundamental action a user can take within the environment. Selective exposure to like minded content can contribute to the formation of echo chambers—that is, well separated groups of like minded users—where individuals are surrounded by others who share similar opinions. This phenomenon can act as a breeding ground for the spread of misinformation and hinder its correction. 18 Analysis of Facebook users has shown the existence of opposing and separate communities— provaccine and antivaccine—with the latter group generally being more active. 19 Another study on the public discussion on covid-19 vaccines found similar results, showing users’ inclination to interact with like minded individuals and the presence of segregated communities, with antivaccine groups exhibiting greater cohesiveness and stability over time. 20 Recent research has found that, on Facebook, like minded sources—that is, sources that align with users’ political leanings— are indeed prevalent in what people see on the platform, although they do not seem to affect polarisation. In other words, no measurable effect on polarisation was seen when exposure to content from like minded sources was reduced. 21 Echo chambers and user segregation are crucial factors, as provaccination campaigns, for example, may become confined to individuals who already support vaccination, thus limiting their overall effectiveness. Recent research has explored how users engaged with covid-19 information on social media, and how such engagement changed over time. 22 Despite earlier findings suggesting that false news might spread faster than trustworthy information, analysis of various platforms indicates no substantial difference in the spread of reliable versus questionable information. Posts and interactions with misinformation sources follow similar growth patterns to those of reliable ones, although scaling factors specific to the platform apply. Mainstream platforms and Reddit have a smaller proportion of posts from questionable sources relative to reliable ones, while Gab stands out by notably amplifying posts from questionable sources. These results suggest that the primary drivers behind the spreading of reliable information and misinformation are the specific rules of the platform and the behaviours of groups and individuals engaged in the discussion, rather than the nature of the content.

Opportunities and challenges for using social media to improve health

The public actively engages in public debates through social media platforms based on their prior perceptions and beliefs. Identity is important; the extent to which people identify with their vaccination status is linked to the way the social media platforms are used. People who identify more strongly with being unvaccinated are less likely to use traditional news sources and rely more on information from social media and messaging services. 23 In this context, monitoring social media has become an essential and powerful tool for a dynamic and real time understanding of the information available to large parts of the public, their perceptions, and the presence of biases and false beliefs. The vast amount of data generated online enables the exploration and analysis of sociocognitive factors underlying the consumption and processing of information. When examined and aggregated, these data can provide valuable insights and reveal hidden patterns on people’s perspectives. These insights can, in turn, support public communication efforts, ranging from monitoring public sentiment, concerns, and reactions to helping identify the informational needs of the population. Ultimately, this information can drive the development of recommendations aimed at improving the effectiveness of communication strategies and health measures. For instance, a recent World Health Organization manual offers a guide to addressing the gap between health guidance recommendations and population behaviour using social listening. 24 Social media sources can be used to respond to specific question of concern, such as understanding why a certain community remains undervaccinated despite widespread availability of vaccines and strong recommendations for vaccination. This approach may facilitate a deeper understanding of the information environment of the population, their behaviour in seeking health information, and their health behaviours, thereby enabling the development of tailored strategies and recommendations.

Social media analyses usually rely on large amounts of data. However, it is important to acknowledge that these data may relate to unrepresentative segments of the population. 25 Therefore, it is crucial to pay careful attention to sample creation, which involves selecting a smaller subset of data from a larger population using a predefined selection method. This statistical challenge, known as sample selection bias, must be duly considered when seeking information about the overall population or specific groups who are less inclined to use social media. Although often oversimplified, social media presents a varied landscape, and the extent of sample selection problems may vary across countries and platforms. 26 For instance, Facebook usually covers a broader spectrum of the population in terms of both audience size and diversity of social groups, while X (formerly Twitter) and TikTok predominantly cater to specific subgroups, such as professionals and younger individuals. Additionally, the varying levels of user engagement in actively participating in conversations on social networks through comments, posts, likes, and other forms of interaction can also contribute to sample bias. Combining social data with information from other sources (for example, census data, electoral rolls, surveys, and health data) and employing statistical methods to adjust for sampling biases is thus crucial to obtain solid research outcomes (see, in another context, previous work on the Brexit referendum 27 ). Such considerations are important for health communication campaigns that are inclusive and resonate with target audiences.

Learning the lessons

The covid-19 pandemic has heightened concerns about the potential effect of misinformation in posing risks to public health. Yet, the issue extends beyond the recent crisis, and is important in shaping our response to future pandemics. Ensuring dissemination of accurate information is essential not only to safeguard public health in the present but also to mitigate risks and enhance preparedness for potential future crises. Assessing the effect of social media use on health behaviour is a complex task, with current evidence yet to be consolidated. To avoid biased outcomes, a comprehensive, multidimensional, and causal approach is necessary when investigating the interplay between online information and real world behaviour. Understanding causal relationships and their drivers will allow interventions to be developed to reduce the detrimental effects of online information on health. It is also essential to define clear outcomes. Indeed, online information about health can cover various aspects, including the formation of public opinion, effects on public discourse and agenda setting, interactions between doctors and patients, as well as influences on health behaviours in the short, medium, or long term. 28

Further research is required to identify vulnerable populations and gain a better understanding of sociodemographic and ideological factors influencing users’ behaviour. Additionally, cultural differences in information consumption and behaviours must also be considered to develop targeted and effective interventions and mitigate the influence of health misinformation. 29

Addressing these questions requires robust data and study designs, with collaboration from digital platforms being crucial in accessing such data. A recent cooperation with Meta 30 allowed researchers to conduct multiple experiments and provided extensive access to user data from Facebook and Instagram. However, the success of this model relies entirely on the willingness of social media companies to participate. This highlights the need for an ethical, transparent collaboration and advocates for the democratisation of social media research through equitable data access. 31 Future studies should replicate these efforts in contexts other than politics, such as health, and expand research beyond the US to achieve a more comprehensive understanding of the effect of social media on behaviour globally.

Acknowledgments

AB, FZ, and WQ acknowledge support from the IRIS Infodemic Coalition (UK government, grant No SCH-00001-3391).

Contributors and sources: The authors have collective experience in studying social dynamics and misinformation. AB, FZ, and WQ have expertise in data science and are cofounders of the IRIS Academic Research Group, dedicated to understanding infodemics and fostering healthy information ecosystems through cross disciplinary collaboration. CB specialises in health communication and decision making. MD is an economist with expertise in investigating social issues, including public discourse and health related decisions. All authors contributed to the writing of the paper and developing the list of references. FZ is the guarantor.

Competing interests: We have read and understood the BMJ policy on declaration of interests and have no interests to declare.

Provenance and peer review Commissioned; externally peer reviewed.

This article is part of a collection that was proposed by the Advancing Health Online Initiative (AHO), a consortium of partners including Meta and MSD, and several non-profit collaborators ( https://www.bmj.com/social-media-influencing-vaccination ). Research articles were submitted following invitations by The BMJ and associated BMJ journals, after consideration by an internal BMJ committee. Non-research articles were independently commissioned by The BMJ with advice from Sander van der Linden, Alison Buttenheim, Briony Swire-Thompson, and Charles Shey Wiysonge. Peer review, editing, and decisions to publish articles were carried out by the respective BMJ journals. Emma Veitch was the editor for this collection.

This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/ .

  • ↵ Kemp S. Digital 2023: global overview report. DataReportal–Global Digital Insights, 2023. https://datareportal.com/reports/digital-2023-global-overview-report
  • Kozyreva A ,
  • Lewandowsky S ,
  • van der Linden S
  • ↵ Reuters Institute digital news report. 2023. https://reutersinstitute.politics.ox.ac.uk/sites/default/files/2023-06/Digital_News_Report_2023.pdf
  • Miller BS ,
  • Manning EM ,
  • ↵ World Health Organization. Infodemic. 2023 https://www.who.int/health-topics/infodemic
  • Briand SC ,
  • Cinelli M ,
  • Renkewitz F ,
  • DeVerna MR ,
  • de Figueiredo A ,
  • Piatek SJ ,
  • de Graaf K ,
  • Roozenbeek J ,
  • Van Bavel JJ ,
  • McKinley CJ ,
  • ↵ Valensise CM, Cinelli M, Nadini M, Galeazzi A, Peruzzi A, Etta G, et al. Lack of evidence for correlation between COVID-19 infodemic and vaccine acceptance. arXiv 2021:2107.07946. [Preprint.] doi: 10.48550/arXiv.2107.07946
  • De Francisci Morales G ,
  • Galeazzi A ,
  • Quattrociocchi W ,
  • Del Vicario M ,
  • Schmidt AL ,
  • Quattrociocchi W
  • Santoro A ,
  • Scantamburlo T ,
  • Baronchelli A ,
  • Thorson E ,
  • Sprengholz P ,
  • ↵ World Health Organization, United Nations Children’s Fund. How to build an infodemic insights report in six steps. 2023. https://www.who.int/publications/i/item/9789240075658
  • Olteanu A ,
  • Castillo C ,
  • Cherepnalkoski D ,
  • Mozetič I ,
  • Kralj Novak P
  • Torbica A ,
  • Lorenz-Spreen P ,
  • ↵ Tollefson J. Tweaking Facebook feeds is no easy fix for polarization, studies find. Nature 2023 Jul 27. doi: 10.1038/d41586-023-02420-z

social media platforms research paper

Social media analytics: a survey of techniques, tools and platforms

  • Open access
  • Published: 26 July 2014
  • Volume 30 , pages 89–116, ( 2015 )

Cite this article

You have full access to this open access article

social media platforms research paper

  • Bogdan Batrinca 1 &
  • Philip C. Treleaven 1  

238k Accesses

335 Citations

75 Altmetric

Explore all metrics

This paper is written for (social science) researchers seeking to analyze the wealth of social media now available. It presents a comprehensive review of software tools for social networking media, wikis, really simple syndication feeds, blogs, newsgroups, chat and news feeds. For completeness, it also includes introductions to social media scraping, storage, data cleaning and sentiment analysis. Although principally a review, the paper also provides a methodology and a critique of social media tools. Analyzing social media, in particular Twitter feeds for sentiment analysis, has become a major research and business activity due to the availability of web-based application programming interfaces (APIs) provided by Twitter, Facebook and News services. This has led to an ‘explosion’ of data services, software tools for scraping and analysis and social media analytics platforms. It is also a research area undergoing rapid change and evolution due to commercial pressures and the potential for using social media data for computational (social science) research. Using a simple taxonomy, this paper provides a review of leading software tools and how to use them to scrape, cleanse and analyze the spectrum of social media. In addition, it discussed the requirement of an experimental computational environment for social media research and presents as an illustration the system architecture of a social media (analytics) platform built by University College London. The principal contribution of this paper is to provide an overview (including code fragments) for scientists seeking to utilize social media scraping and analytics either in their research or business. The data retrieval techniques that are presented in this paper are valid at the time of writing this paper (June 2014), but they are subject to change since social media data scraping APIs are rapidly changing.

Similar content being viewed by others

social media platforms research paper

A survey on sentiment analysis methods, applications, and challenges

Advances in social media research: past, present and future.

social media platforms research paper

Artificial intelligence in E-Commerce: a bibliometric study and literature review

Avoid common mistakes on your manuscript.

1 Introduction

Social media is defined as web-based and mobile-based Internet applications that allow the creation, access and exchange of user-generated content that is ubiquitously accessible (Kaplan and Haenlein 2010 ). Besides social networking media (e.g., Twitter and Facebook), for convenience, we will also use the term ‘social media’ to encompass really simple syndication (RSS) feeds, blogs, wikis and news, all typically yielding unstructured text and accessible through the web. Social media is especially important for research into computational social science that investigates questions (Lazer et al. 2009 ) using quantitative techniques (e.g., computational statistics, machine learning and complexity) and so-called big data for data mining and simulation modeling (Cioffi-Revilla 2010 ).

This has led to numerous data services, tools and analytics platforms. However, this easy availability of social media data for academic research may change significantly due to commercial pressures. In addition, as discussed in Sect. 2 , the tools available to researchers are far from ideal. They either give superficial access to the raw data or (for non-superficial access) require researchers to program analytics in a language such as Java.

1.1 Terminology

We start with definitions of some of the key techniques related to analyzing unstructured textual data:

Natural language processing —(NLP) is a field of computer science, artificial intelligence and linguistics concerned with the interactions between computers and human (natural) languages. Specifically, it is the process of a computer extracting meaningful information from natural language input and/or producing natural language output.

News analytics —the measurement of the various qualitative and quantitative attributes of textual (unstructured data) news stories. Some of these attributes are: sentiment , relevance and novelty .

Opinion mining —opinion mining (sentiment mining, opinion/sentiment extraction) is the area of research that attempts to make automatic systems to determine human opinion from text written in natural language.

Scraping —collecting online data from social media and other Web sites in the form of unstructured text and also known as site scraping, web harvesting and web data extraction.

Sentiment analysis —sentiment analysis refers to the application of natural language processing, computational linguistics and text analytics to identify and extract subjective information in source materials.

Text analytics —involves information retrieval (IR), lexical analysis to study word frequency distributions, pattern recognition, tagging/annotation, information extraction, data mining techniques including link and association analysis, visualization and predictive analytics.

1.2 Research challenges

Social media scraping and analytics provides a rich source of academic research challenges for social scientists, computer scientists and funding bodies. Challenges include:

Scraping —although social media data is accessible through APIs, due to the commercial value of the data, most of the major sources such as Facebook and Google are making it increasingly difficult for academics to obtain comprehensive access to their ‘raw’ data; very few social data sources provide affordable data offerings to academia and researchers. News services such as Thomson Reuters and Bloomberg typically charge a premium for access to their data. In contrast, Twitter has recently announced the Twitter Data Grants program, where researchers can apply to get access to Twitter’s public tweets and historical data in order to get insights from its massive set of data (Twitter has more than 500 million tweets a day).

Data cleansing —cleaning unstructured textual data (e.g., normalizing text), especially high-frequency streamed real-time data, still presents numerous problems and research challenges.

Holistic data sources —researchers are increasingly bringing together and combining novel data sources: social media data, real-time market & customer data and geospatial data for analysis.

Data protection —once you have created a ‘big data’ resource, the data needs to be secured, ownership and IP issues resolved (i.e., storing scraped data is against most of the publishers’ terms of service), and users provided with different levels of access; otherwise, users may attempt to ‘suck’ all the valuable data from the database.

Data analytics —sophisticated analysis of social media data for opinion mining (e.g., sentiment analysis) still raises a myriad of challenges due to foreign languages, foreign words, slang, spelling errors and the natural evolving of language.

Analytics dashboards —many social media platforms require users to write APIs to access feeds or program analytics models in a programming language, such as Java. While reasonable for computer scientists, these skills are typically beyond most (social science) researchers. Non-programming interfaces are required for giving what might be referred to as ‘deep’ access to ‘raw’ data, for example, configuring APIs, merging social media feeds, combining holistic sources and developing analytical models.

Data visualization —visual representation of data whereby information that has been abstracted in some schematic form with the goal of communicating information clearly and effectively through graphical means. Given the magnitude of the data involved, visualization is becoming increasingly important.

1.3 Social media research and applications

Social media data is clearly the largest, richest and most dynamic evidence base of human behavior, bringing new opportunities to understand individuals, groups and society. Innovative scientists and industry professionals are increasingly finding novel ways of automatically collecting, combining and analyzing this wealth of data. Naturally, doing justice to these pioneering social media applications in a few paragraphs is challenging. Three illustrative areas are: business, bioscience and social science.

The early business adopters of social media analysis were typically companies in retail and finance. Retail companies use social media to harness their brand awareness, product/customer service improvement, advertising/marketing strategies, network structure analysis, news propagation and even fraud detection. In finance, social media is used for measuring market sentiment and news data is used for trading. As an illustration, Bollen et al. ( 2011 ) measured sentiment of random sample of Twitter data, finding that Dow Jones Industrial Average (DJIA) prices are correlated with the Twitter sentiment 2–3 days earlier with 87.6 percent accuracy. Wolfram ( 2010 ) used Twitter data to train a Support Vector Regression (SVR) model to predict prices of individual NASDAQ stocks, finding ‘significant advantage’ for forecasting prices 15 min in the future.

In the biosciences, social media is being used to collect data on large cohorts for behavioral change initiatives and impact monitoring, such as tackling smoking and obesity or monitoring diseases. An example is Penn State University biologists (Salathé et al. 2012 ) who have developed innovative systems and techniques to track the spread of infectious diseases, with the help of news Web sites, blogs and social media.

Computational social science applications include: monitoring public responses to announcements, speeches and events especially political comments and initiatives; insights into community behavior; social media polling of (hard to contact) groups; early detection of emerging events, as with Twitter. For example, Lerman et al. ( 2008 ) use computational linguistics to automatically predict the impact of news on the public perception of political candidates. Yessenov and Misailovic ( 2009 ) use movie review comments to study the effect of various approaches in extracting text features on the accuracy of four machine learning methods—Naive Bayes, Decision Trees, Maximum Entropy and K-Means clustering. Lastly, Karabulut ( 2013 ) found that Facebook’s Gross National Happiness (GNH) exhibits peaks and troughs in-line with major public events in the USA.

1.4 Social media overview

For this paper, we group social media tools into:

Social media data —social media data types (e.g., social network media, wikis, blogs, RSS feeds and news, etc.) and formats (e.g., XML and JSON). This includes data sets and increasingly important real-time data feeds, such as financial data, customer transaction data, telecoms and spatial data.

Social media programmatic access —data services and tools for sourcing and scraping (textual) data from social networking media, wikis, RSS feeds, news, etc. These can be usefully subdivided into:

Data sources, services and tools —where data is accessed by tools which protect the raw data or provide simple analytics. Examples include: Google Trends, SocialMention, SocialPointer and SocialSeek, which provide a stream of information that aggregates various social media feeds.

Data feeds via APIs —where data sets and feeds are accessible via programmable HTTP-based APIs and return tagged data using XML or JSON, etc. Examples include Wikipedia, Twitter and Facebook.

Text cleaning and storage tools —tools for cleaning and storing textual data. Google Refine and DataWrangler are examples for data cleaning.

Text analysis tools —individual or libraries of tools for analyzing social media data once it has been scraped and cleaned. These are mainly natural language processing, analysis and classification tools, which are explained below.

Transformation tools —simple tools that can transform textual input data into tables, maps, charts (line, pie, scatter, bar, etc.), timeline or even motion (animation over timeline), such as Google Fusion Tables, Zoho Reports, Tableau Public or IBM’s Many Eyes.

Analysis tools —more advanced analytics tools for analyzing social data, identifying connections and building networks, such as Gephi (open source) or the Excel plug-in NodeXL.

Social media platforms —environments that provide comprehensive social media data and libraries of tools for analytics. Examples include: Thomson Reuters Machine Readable News, Radian 6 and Lexalytics.

Social network media platforms —platforms that provide data mining and analytics on Twitter, Facebook and a wide range of other social network media sources.

News platforms —platforms such as Thomson Reuters providing commercial news archives/feeds and associated analytics.

2 Social media methodology and critique

The two major impediments to using social media for academic research are firstly access to comprehensive data sets and secondly tools that allow ‘deep’ data analysis without the need to be able to program in a language such as Java. The majority of social media resources are commercial and companies are naturally trying to monetize their data. As discussed, it is important that researchers have access to open-source ‘big’ (social media) data sets and facilities for experimentation. Otherwise, social media research could become the exclusive domain of major companies, government agencies and a privileged set of academic researchers presiding over private data from which they produce papers that cannot be critiqued or replicated. Recently, there has been a modest response, as Twitter and Gnip are piloting a new program for data access, starting with 5 all-access data grants to select applicants.

2.1 Methodology

Research requirements can be grouped into: data, analytics and facilities.

Researchers need online access to historic and real-time social media data, especially the principal sources, to conduct world-leading research:

Social network media —access to comprehensive historic data sets and also real-time access to sources, possibly with a (15 min) time delay, as with Thomson Reuters and Bloomberg financial data.

News data —access to historic data and real-time news data sets, possibly through the concept of ‘educational data licenses’ (cf. software license).

Public data —access to scraped and archived important public data; available through RSS feeds, blogs or open government databases.

Programmable interfaces —researchers also need access to simple application programming interfaces (APIs) to scrape and store other available data sources that may not be automatically collected.

2.1.2 Analytics

Currently, social media data is typically either available via simple general routines or require the researcher to program their analytics in a language such as MATLAB, Java or Python. As discussed above, researchers require:

Analytics dashboards —non-programming interfaces are required for giving what might be termed as ‘deep’ access to ‘raw’ data.

Holistic data analysis —tools are required for combining (and conducting analytics across) multiple social media and other data sets.

Data visualization —researchers also require visualization tools whereby information that has been abstracted can be visualized in some schematic form with the goal of communicating information clearly and effectively through graphical means.

2.1.3 Facilities

Lastly, the sheer volume of social media data being generated argues for national and international facilities to be established to support social media research (cf. Wharton Research Data Services https://wrds-web.wharton.upenn.edu ):

Data storage —the volume of social media data, current and projected, is beyond most individual universities and hence needs to be addressed at a national science foundation level. Storage is required both for principal data sources (e.g., Twitter), but also for sources collected by individual projects and archived for future use by other researchers.

Computational facility —remotely accessible computational facilities are also required for: a) protecting access to the stored data; b) hosting the analytics and visualization tools; and c) providing computational resources such as grids and GPUs required for processing the data at the facility rather than transmitting it across a network.

2.2 Critique

Much needs to be done to support social media research. As discussed, the majority of current social media resources are commercial, expensive and difficult for academics to obtain full access.

In general, access to important sources of social media data is frequently restricted and full commercial access is expensive.

Siloed data —most data sources (e.g., Twitter) have inherently isolated information making it difficult to combine with other data sources.

Holistic data —in contrast, researchers are increasingly interested in accessing, storing and combining novel data sources: social media data, real-time financial market & customer data and geospatial data for analysis. This is currently extremely difficult to do even for Computer Science departments.

2.2.2 Analytics

Analytical tools provided by vendors are often tied to a single data set, maybe limited in analytical capability, and data charges make them expensive to use.

2.2.3 Facilities

There are an increasing number of powerful commercial platforms, such as the ones supplied by SAS and Thomson Reuters, but the charges are largely prohibitive for academic research. Either comparable facilities need to be provided by national science foundations or vendors need to be persuaded to introduce the concept of an ‘educational license.’

3 Social media data

Clearly, there is a large and increasing number of (commercial) services providing access to social networking media (e.g., Twitter, Facebook and Wikipedia) and news services (e.g., Thomson Reuters Machine Readable News). Equivalent major academic services are scarce.We start by discussing types of data and formats produced by these services.

3.1 Types of data

Although we focus on social media, as discussed, researchers are continually finding new and innovative sources of data to bring together and analyze. So when considering textual data analysis, we should consider multiple sources (e.g., social networking media, RSS feeds, blogs and news) supplemented by numeric (financial) data, telecoms data, geospatial data and potentially speech and video data. Using multiple data sources is certainly the future of analytics.

Broadly, data subdivides into:

Historic data sets —previously accumulated and stored social/news, financial and economic data.

Real-time feeds —live data feeds from streamed social media, news services, financial exchanges, telecoms services, GPS devices and speech.

Raw data —unprocessed computer data straight from source that may contain errors or may be unanalyzed.

Cleaned data —correction or removal of erroneous (dirty) data caused by disparities, keying mistakes, missing bits, outliers, etc.

Value-added data —data that has been cleaned, analyzed, tagged and augmented with knowledge.

3.2 Text data formats

The four most common formats used to markup text are: HTML, XML, JSON and CSV.

HTML —HyperText Markup Language (HTML) as well-known is the markup language for web pages and other information that can be viewed in a web browser. HTML consists of HTML elements, which include tags enclosed in angle brackets (e.g., <div>), within the content of the web page.

XML —Extensible Markup Language (XML)—the markup language for structuring textual data using <tag>…<\tag> to define elements.

JSON —JavaScript Object Notation (JSON) is a text-based open standard designed for human-readable data interchange and is derived from JavaScript.

CSV —a comma-separated values (CSV) file contains the values in a table as a series of ASCII text lines organized such that each column value is separated by a comma from the next column’s value and each row starts a new line.

For completeness, HTML and XML are so-called markup languages (markup and content) that define a set of simple syntactic rules for encoding documents in a format both human readable and machine readable. A markup comprises start-tags (e.g., <tag>), content text and end-tags (e.g., </tag>).

Many feeds use JavaScript Object Notation (JSON), the lightweight data-interchange format, based on a subset of the JavaScript Programming Language. JSON is a language-independent text format that uses conventions that are familiar to programmers of the C-family of languages, including C, C++, C#, Java, JavaScript, Perl, Python, and many others. JSON’s basic types are: Number, String, Boolean, Array (an ordered sequence of values, comma-separated and enclosed in square brackets) and Object (an unordered collection of key:value pairs). The JSON format is illustrated in Fig.  1 for a query on the Twitter API on the string ‘UCL,’ which returns two ‘text’ results from the Twitter user ‘uclnews.’

JSON Example

Comma-separated values are not a single, well-defined format but rather refer to any text file that: (a) is plain text using a character set such as ASCII, Unicode or EBCDIC; (b) consists of text records (e.g., one record per line); (c) with records divided into fields separated by delimiters (e.g., comma, semicolon and tab); and (d) where every record has the same sequence of fields.

4 Social media providers

Social media data resources broadly subdivide into those providing:

Freely available databases —repositories that can be freely downloaded, e.g., Wikipedia ( http://dumps.wikimedia.org ) and the Enron e-mail data set available via http://www.cs.cmu.edu/~enron/ .

Data access via tools —sources that provide controlled access to their social media data via dedicated tools, both to facilitate easy interrogation and also to stop users ‘sucking’ all the data from the repository. An example is Google’s Trends. These further subdivided into:

Free sources —repositories that are freely accessible, but the tools protect or may limit access to the ‘raw’ data in the repository, such as the range of tools provided by Google.

Commercial sources —data resellers that charge for access to their social media data. Gnip and DataSift provide commercial access to Twitter data through a partnership, and Thomson Reuters to news data.

Data access via APIs —social media data repositories providing programmable HTTP-based access to the data via APIs (e.g., Twitter, Facebook and Wikipedia).

4.1 Open-source databases

A major open source of social media is Wikipedia, which offers free copies of all available content to interested users (Wikimedia Foundation 2014 ). These databases can be used for mirroring, database queries and social media analytics. They include dumps from any Wikimedia Foundation project: http://dumps.wikimedia.org/ , English Wikipedia dumps in SQL and XML: http://dumps.wikimedia.org/enwiki/ , etc.

Another example of freely available data for research is the World Bank data, i.e., the World Bank Databank ( http://databank.worldbank.org/data/databases.aspx ) , which provides over 40 databases, such as Gender Statistics, Health Nutrition and Population Statistics, Global Economic Prospects, World Development Indicators and Global Development Finance, and many others. Most of the databases can be filtered by country/region, series/topics or time (years and quarters). In addition, tools are provided to allow reports to be customized and displayed in table, chart or map formats.

4.2 Data access via tools

As discussed, most commercial services provide access to social media data via online tools, both to control access to the raw data and increasingly to monetize the data.

4.2.1 Freely accessible sources

Google with tools such as Trends and InSights is a good example of this category. Google is the largest ‘scraper’ in the world, but they do their best to ‘discourage’ scraping of their own pages. (For an introduction of how to surreptitious scrape Google—and avoid being ‘banned’—see http://google-scraper.squabbel.com .) Google’s strategy is to provide a wide range of packages, such as Google Analytics, rather than from a researchers’ viewpoint the more useful programmable HTTP-based APIs.

Figure  2 illustrates how Google Trends displays a particular search term, in this case ‘libor.’ Using Google Trends you can compare up to five topics at a time and also see how often those topics have been mentioned and in which geographic regions the topics have been searched for the most.

Google Trends

4.2.2 Commercial sources

There is an increasing number of commercial services that scrape social networking media and then provide paid-for access via simple analytics tools. (The more comprehensive platforms with extensive analytics are reviewed in Sect. 8 .) In addition, companies such as Twitter are both restricting free access to their data and licensing their data to commercial data resellers, such as Gnip and DataSift.

Gnip is the world’s largest provider of social data. Gnip was the first to partner with Twitter to make their social data available, and since then, it was the first to work with Tumblr, Foursquare, WordPress, Disqus, StockTwits and other leading social platforms. Gnip delivers social data to customers in more than 40 countries, and Gnip’s customers deliver social media analytics to more than 95 % of the Fortune 500. Real-time data from Gnip can be delivered as a ‘Firehose’ of every single activity or via PowerTrack, a proprietary filtering tool that allows users to build queries around only the data they need. PowerTrack rules can filter data streams based on keywords, geo boundaries, phrase matches and even the type of content or media in the activity. The company then offers enrichments to these data streams such as Profile Geo (to add significantly more usable geo data for Twitter), URL expansion and language detection to further enhance the value of the data delivered. In addition to real-time data access, the company also offers Historical PowerTrack and Search API access for Twitter which give customers the ability to pull any Tweet since the first message on March 21, 2006.

Gnip provides access to premium (Gnip’s ‘Complete Access’ sources are publishers that have an agreement with Gnip to resell their data) and free data feeds (Gnip’s ‘Managed Public API Access’ sources provide access to normalized and consolidated free data from their APIs, although it requires Gnip’s paid services for the Data Collectors) via its dashboard (see Fig.  3 ). Firstly, the user only sees the feeds in the dashboard that were paid for under a sales agreement. To select a feed, the user clicks on a publisher and then chooses a specific feed from that publisher as shown in Fig.  3 . Different types of feeds serve different types of use cases and correspond to different types of queries and API endpoints on the publisher’s source API. After selecting the feed, the user is assisted by Gnip to configure it with any required parameters before it begins collecting data. This includes adding at least one rule. Under ‘Get Data’ – > ‘Advanced Settings’ you can also configure how often your feed queries the source API for data (the ‘query rate’). Choose between the publisher’s native data format and Gnip’s Activity Streams format (XML for Enterprise Data Collector feeds).

Gnip Dashboard, Publishers and Feeds

4.3 Data feed access via APIs

For researchers, arguably the most useful sources of social media data are those that provide programmable access via APIs, typically using HTTP-based protocols. Given their importance to academics, here, we review individually wikis, social networking media, RSS feeds, news, etc.

4.3.1 Wiki media

Wikipedia (and wikis in general) provides academics with large open-source repositories of user-generated (crowd-sourced) content. What is not widely known is that Wikipedia provides HTTP-based APIs that allows programmable access and searching (i.e., scraping) that returns data in a variety of formats including XML. In fact, the API is not unique to Wikipedia but part of MediaWiki’s ( http://www.mediawiki.org/ ) open-source toolkit and hence can be used with any MediaWiki-based wikis.

The wiki HTTP-based API works by accepting requests containing one or more input arguments and returning strings, often in XML format, that can be parsed and used by the requesting client. Other formats supported include JSON, WDDX, YAML, or PHP serialized. Details can be found at: http://en.wikipedia.org/w/api.php?action=query&list=allcategories&acprop=size&acprefix=hollywood&format=xml .

The HTTP request must contain: a) the requested ‘action,’ such as query, edit or delete operation; b) an authentication request; and c) any other supported actions. For example, the above request returns an XML string listing the first 10 Wikipedia categories with the prefix ‘hollywood.’ Vaswani ( 2011 ) provides a detailed description of how to scrape Wikipedia using an Apache/PHP development environment and an HTTP client capable of transmitting GET and PUT requests and handling responses.

4.3.2 Social networking media

As with Wikipedia, popular social networks, such as Facebook, Twitter and Foursquare, make a proportion of their data accessible via APIs.

Although many social networking media sites provide APIs, not all sites (e.g., Bing, LinkedIn and Skype) provide API access for scraping data. While more and more social networks are shifting to publicly available content, many leading networks are restricting free access, even to academics. For example, Foursquare announced in December 2013 that it will no longer allow private check-ins on iOS 7, and has now partnered with Gnip to provide a continuous stream of anonymized check-in data. The data is available in two packages: the full Firehose access level and a filtered version via Gnip’s PowerTrack service. Here, we briefly discuss the APIs provided by Twitter and Facebook.

4.3.2.1 Twitter

The default account setting keeps users’ Tweets public, although users can protect their Tweets and make them visible only to their approved Twitter followers. However, less than 10 % of all the Twitter accounts are private. Tweets from public accounts (including replies and mentions) are available in JSON format through Twitter’s Search API for batch requests of past data and Streaming API for near real-time data.

Search API —Query Twitter for recent Tweets containing specific keywords. It is part of the Twitter REST API v1.1 (it attempts to comply with the design principles of the REST architectural style, which stands for Representational State Transfer) and requires an authorized application (using oAuth, the open standard for authorization) before retrieving any results from the API.

Streaming API —A real-time stream of Tweets, filtered by user ID, keyword, geographic location or random sampling.

One may retrieve recent Tweets containing particular keywords through Twitter’s Search API (part of REST API v1.1) with the following API call: https://api.twitter.com/1.1/search/tweets.json?q=APPLE and real-time data using the streaming API call: https://stream.twitter.com/1/statuses/sample.json .

Twitter’s Streaming API allows data to be accessed via filtering (by keywords, user IDs or location) or by sampling of all updates from a select amount of users. Default access level ‘Spritzer’ allows sampling of roughly 1 % of all public statuses, with the option to retrieve 10 % of all statuses via the ‘Gardenhose’ access level (more suitable for data mining and research applications). In social media, streaming APIs are often called Firehose—a syndication feed that publishes all public activities as they happen in one big stream. Twitter has recently announced the Twitter Data Grants program, where researchers can apply to get access to Twitter’s public tweets and historical data in order to get insights from its massive set of data (Twitter has more than 500 million tweets a day); research institutions and academics will not get the Firehose access level; instead, they will only get the data set needed for their research project. Researchers can apply for it at the following address: https://engineering.twitter.com/research/data-grants .

Twitter results are stored in a JSON array of objects containing the fields shown in Fig.  4 . The JSON array consists of a list of objects matching the supplied filters and the search string, where each object is a Tweet and its structure is clearly specified by the object’s fields, e.g., ‘created_at’ and ‘from_user’. The example in Fig.  4 consists of the output of calling Twitter’s GET search API via http://search.twitter.com/search.json?q=financial%20times&rpp=1&include_entities=true&result_type=mixed where the parameters specify that the search query is ‘financial times,’ one result per page, each Tweet should have a node called ‘entities’ (i.e., metadata about the Tweet) and list ‘mixed’ results types, i.e., include both popular and real-time results in the response.

Example Output in JSON for Twitter REST API v1

4.3.2.2 Facebook

Facebook’s privacy issues are more complex than Twitter’s, meaning that a lot of status messages are harder to obtain than Tweets, requiring ‘open authorization’ status from users. Facebook currently stores all data as objects Footnote 1 and has a series of APIs, ranging from the Graph and Public Feed APIs to Keyword Insight API. In order to access the properties of an object, its unique ID must be known to make the API call. Facebook’s Search API (part of Facebook’s Graph API) can be accessed by calling https://graph.facebook.com/search?q=QUERY&type=page . The detailed API query format is shown in Fig.  5 . Here, ‘QUERY’ can be replaced by any search term, and ‘page’ can be replaced with ‘post,’ ‘user,’ ‘page,’ ‘event,’ ‘group,’ ‘place,’ ‘checkin,’ ‘location’ or ‘placetopic.’ The results of this search will contain the unique ID for each object. When returning the individual ID for a particular search result, one can use https://graph.facebook.com/ID to obtain further page details such as number of ‘likes.’ This kind of information is of interest to companies when it comes to brand awareness and competition monitoring.

Facebook Graph API Search Query Format

The Facebook Graph API search queries require an access token included in the request. Searching for pages and places requires an ‘app access token’, whereas searching for other types requires a user access token.

Replacing ‘page’ with ‘post’ in the aforementioned search URL will return all public statuses containing this search term. Footnote 2 Batch requests can be sent by following the procedure outlined here: https://developers.facebook.com/docs/reference/api/batch/ . Information on retrieving real-time updates can be found here: https://developers.facebook.com/docs/reference/api/realtime/ . Facebook also returns data in JSON format and so can be retrieved and stored using the same methods as used with data from Twitter, although the fields are different depending on the search type, as illustrated in Fig.  6 .

Facebook Graph API Search Results for q=’Centrica’ and type=’page’

4.3.3 RSS feeds

A large number of Web sites already provide access to content via RSS feeds. This is the syndication standard for publishing regular updates to web-based content based on a type of XML file that resides on an Internet server. For Web sites, RSS feeds can be created manually or automatically (with software).

An RSS Feed Reader reads the RSS feed file, finds what is new converts it to HTML and displays it. The program fragment in Fig.  7 shows the code for the control and channel statements for the RSS feed. The channel statements define the overall feed or channel, one set of channel statements in the RSS file.

Example RSS Feed Control and Channel Statements

4.3.4 Blogs, news groups and chat services

Blog scraping is the process of scanning through a large number of blogs, usually daily, searching for and copying content. This process is conducted through automated software. Figure  8 illustrates example code for Blog Scraping. This involves getting a Web site’s source code via Java’s URL Class, which can eventually be parsed via Regular Expressions to capture the target content.

Example Code for Blog Scraping

4.3.5 News feeds

News feeds are delivered in a variety of textual formats, often as machine-readable XML documents, JSON or CSV files. They include numerical values, tags and other properties that tend to represent underlying news stories. For testing purposes, historical information is often delivered via flat files, while live data for production is processed and delivered through direct data feeds or APIs. Figure  9 shows a snippet of the software calls to retrieve filtered NY Times articles.

Scraping New York Times Articles

Having examined the ‘classic’ social media data feeds, as an illustration of scraping innovative data sources, we will briefly look at geospatial feeds.

4.3.6 Geospatial feeds

Much of the ‘geospatial’ social media data come from mobile devices that generate location- and time-sensitive data. One can differentiate between four types of mobile social media feeds (Kaplan 2012 ):

Location and time sensitive —exchange of messages with relevance for one specific location at one specific point-in time (e.g., Foursquare).

Location sensitive only —exchange of messages with relevance for one specific location, which are tagged to a certain place and read later by others (e.g., Yelp and Qype)

Time sensitive only —transfer of traditional social media applications to mobile devices to increase immediacy (e.g., posting Twitter messages or Facebook status updates)

Neither location or time sensitive —transfer of traditional social media applications to mobile devices (e.g., watching a YouTube video or reading a Wikipedia entry)

With increasingly advanced mobile devices, notably smartphones, the content (photos, SMS messages, etc.) has geographical identification added, called ‘geotagged.’ These geospatial metadata are usually latitude and longitude coordinates, though they can also include altitude, bearing, distance, accuracy data or place names. GeoRSS is an emerging standard to encode the geographic location into a web feed, with two primary encodings: GeoRSS Geography Markup Language (GML) and GeoRSS Simple.

Example tools are GeoNetwork Opensource—a free comprehensive cataloging application for geographically referenced information, and FeedBurner—a web feed provider that can also provide geotagged feeds, if the specified feeds settings allow it.

As an illustration Fig.  10 shows the pseudo-code for analyzing a geospatial feed.

Pseudo-code for Analyzing a Geospatial Feed

5 Text cleaning, tagging and storing

The importance of ‘quality versus quantity’ of data in social media scraping and analytics cannot be overstated (i.e., garbage in and garbage out ). In fact, many details of analytics models are defined by the types and quality of the data. The nature of the data will also influence the database and hardware used.

Naturally, unstructured textual data can be very noisy (i.e., dirty). Hence, data cleaning (or cleansing, scrubbing) is an important area in social media analytics. The process of data cleaning may involve removing typographical errors or validating and correcting values against a known list of entities. Specifically, text may contain misspelled words, quotations, program codes, extra spaces, extra line breaks, special characters, foreign words, etc. So in order to achieve high-quality text mining, it is necessary to conduct data cleaning at the first step: spell checking, removing duplicates, finding and replacing text, changing the case of text, removing spaces and non-printing characters from text, fixing numbers, number signs and outliers, fixing dates and times, transforming and rearranging columns, rows and table data, etc.

Having reviewed the types and sources of raw data, we now turn to ‘cleaning’ or ‘cleansing’ the data to remove incorrect, inconsistent or missing information. Before discussing strategies for data cleaning, it is essential to identify possible data problems (Narang 2009 ):

Missing data —when a piece of information existed but was not included for whatever reason in the raw data supplied. Problems occur with: a) numeric data when ‘blank’ or a missing value is erroneously substituted by ‘zero’ which is then taken (for example) as the current price; and b) textual data when a missing word (like ‘not’) may change the whole meaning of a sentence.

Incorrect data —when a piece of information is incorrectly specified (such as decimal errors in numeric data or wrong word in textual data) or is incorrectly interpreted (such as a system assuming a currency value is in $ when in fact it is in £ or assuming text is in US English rather than UK English).

Inconsistent data —when a piece of information is inconsistently specified. For example, with numeric data, this might be using a mixture of formats for dates: 2012/10/14, 14/10/2012 or 10/14/2012. For textual data, it might be as simple as: using the same word in a mixture of cases, mixing English and French in a text message, or placing Latin quotes in an otherwise English text.

5.1 Cleansing data

A traditional approach to text data cleaning is to ‘pull’ data into a spreadsheet or spreadsheet-like table and then reformat the text. For example, Google Refine Footnote 3 is a standalone desktop application for data cleaning and transformation to various formats. Transformation expressions are written in proprietary Google Refine Expression Language (GREL) or JYTHON (an implementation of the Python programming language written in Java). Figure  11 illustrates text cleansing.

Text Cleansing Pseudo-code

5.2 Tagging unstructured data

Since most of the social media data is generated by humans and therefore is unstructured (i.e., it lacks a pre-defined structure or data model), an algorithm is required to transform it into structured data to gain any insight. Therefore, unstructured data need to be preprocessed, tagged and then parsed in order to quantify/analyze the social media data.

Adding extra information to the data (i.e., tagging the data) can be performed manually or via rules engines, which seek patterns or interpret the data using techniques such as data mining and text analytics. Algorithms exploit the linguistic, auditory and visual structure inherent in all of the forms of human communication. Tagging the unstructured data usually involve tagging the data with metadata or part-of-speech (POS) tagging. Clearly, the unstructured nature of social media data leads to ambiguity and irregularity when it is being processed by a machine in an automatic fashion.

Using a single data set can provide some interesting insights. However, combining more data sets and processing the unstructured data can result in more valuable insights, allowing us to answer questions that were impossible beforehand.

5.3 Storing data

As discussed, the nature of the social media data is highly influential on the design of the database and possibly the supporting hardware. It would also be very important to note that each social platform has very specific (and narrow) rules around how their respective data can be stored and used. These can be found in the Terms of Service for each platform.

For completeness, databases comprise:

Flat file —a flat file is a two-dimensional database (somewhat like a spreadsheet) containing records that have no structured interrelationship, that can be searched sequentially.

Relational database —a database organized as a set of formally described tables to recognize relations between stored items of information, allowing more complex relationships among the data items. Examples are row-based SQL databases and column-based kdb + used in finance.

noSQL databases —a class of database management system (DBMS) identified by its non-adherence to the widely used relational database management system (RDBMS) model. noSQL/newSQL databases are characterized as: being non-relational, distributed, open-source and horizontally scalable.

5.3.1 Apache (noSQL) databases and tools

The growth of ultra-large Web sites such as Facebook and Google has led to the development of noSQL databases as a way of breaking through the speed constraints that relational databases incur. A key driver has been Google’s MapReduce, i.e., the software framework that allows developers to write programs that process massive amounts of unstructured data in parallel across a distributed cluster of processors or stand-alone computers (Chandrasekar and Kowsalya 2011 ). It was developed at Google for indexing Web pages and replaced their original indexing algorithms and heuristics in 2004. The model is inspired by the ‘Map’ and ‘Reduce’ functions commonly used in functional programming. MapReduce (conceptually) takes as input a list of records, and the ‘Map’ computation splits them among the different computers in a cluster. The result of the Map computation is a list of key/value pairs. The corresponding ‘Reduce’ computation takes each set of values that has the same key and combines them into a single value. A MapReduce program is composed of a ‘Map()’ procedure for filtering and sorting and a ‘Reduce()’ procedure for a summary operation (e.g., counting and grouping).

Figure  12 provides a canonical example application of MapReduce. This example is a process to count the appearances of each different word in a set of documents (MapReduce 2011 ).

The Canonical Example Application of MapReduce

5.3.1.1 Apache open-source software

The research community is increasingly using Apache software for social media analytics. Within the Apache Software Foundation, three levels of software are relevant:

Cassandra/hive databases —Apache Cassandra is an open source (noSQL) distributed DBMS providing a structured ‘key-value’ store. Key-value stores allow an application to store its data in a schema-less way. Related noSQL database products include: Apache Hive, Apache Pig and MongoDB, a scalable and high-performance open-source database designed to handle document-oriented storage. Since noSQL databases are ‘structure-less,’ it is necessary to have a companion SQL database to retain and map the structure of the corresponding data.

Hadoop platform —is a Java-based programming framework that supports the processing of large data sets in a distributed computing environment. An application is broken down into numerous small parts (also called fragments or blocks) that can be run on systems with thousands of nodes involving thousands of terabytes of storage.

Mahout —provides implementations of distributed or otherwise scalable analytics (machine learning) algorithms running on the Hadoop platform. Mahout Footnote 4 supports four classes of algorithms: a) clustering (e.g., K-Means, Fuzzy C-Means) that groups text into related groups; b) classification (e.g., Complementary Naive Bayes classifier) that uses supervised learning to classify text; c) frequent itemset mining takes a set of item groups and identifies which individual items usually appear together; and d) recommendation mining (e.g., user- and item-based recommenders) that takes users’ behavior and from that tries to find items users might like.

6 Social media analytics techniques

As discussed, opinion mining (or sentiment analysis) is an attempt to take advantage of the vast amounts of user-generated text and news content online. One of the primary characteristics of such content is its textual disorder and high diversity. Here, natural language processing, computational linguistics and text analytics are deployed to identify and extract subjective information from source text. The general aim is to determine the attitude of a writer (or speaker) with respect to some topic or the overall contextual polarity of a document.

6.1 Computational science techniques

Automated sentiment analysis of digital texts uses elements from machine learning such as latent semantic analysis, support vector machines, bag-of-words model and semantic orientation (Turney 2002 ). In simple terms, the techniques employ three broad areas:

Computational statistics —refers to computationally intensive statistical methods including resampling methods, Markov chain Monte Carlo methods, local regression, kernel density estimation and principal components analysis.

Machine learning —a system capable of the autonomous acquisition and integration of knowledge learnt from experience, analytical observation, etc. (Murphy 2012 ). These sub-symbolic systems further subdivide into:

Supervised learning such as Regression Trees, Discriminant Function Analysis, Support Vector Machines.

Unsupervised learning such as Self-Organizing Maps (SOM), K-Means.

Machine Learning aims to solve the problem of having huge amounts of data with many variables and is commonly used in areas such as pattern recognition (speech, images), financial algorithms (credit scoring, algorithmic trading) (Nuti et al. 2011 ), energy forecasting (load, price) and biology (tumor detection, drug discovery). Figure  13 illustrates the two learning types of machine learning and their algorithm categories.

Machine Learning Overview

Complexity science —complex simulation models of difficult-to-predict systems derived from statistical physics, information theory and nonlinear dynamics. The realm of physicists and mathematicians.

These techniques are deployed in two ways:

Data mining —knowledge discovery that extracts hidden patterns from huge quantities of data, using sophisticated differential equations, heuristics, statistical discriminators (e.g., hidden Markov models), and artificial intelligence machine learning techniques (e.g., neural networks, genetic algorithms and support vector machines).

Simulation modeling —simulation-based analysis that tests hypotheses. Simulation is used to attempt to predict the dynamics of systems so that the validity of the underlying assumption can be tested.

6.1.1 Stream processing

Lastly, we should mention stream processing (Botan et al 2010 ). Increasingly, analytics applications that consume real-time social media, financial ‘ticker’ and sensor networks data need to process high-volume temporal data with low latency. These applications require support for online analysis of rapidly changing data streams. However, traditional database management systems (DBMSs) have no pre-defined notion of time and cannot handle data online in near real time. This has led to the development of Data Stream Management Systems (DSMSs) (Hebrail 2008 )—processing in main memory without storing the data on disk—that handle transient data streams on-line and process continuous queries on these data streams. Example commercial systems include: Oracle CEP engine, StreamBase and Microsoft’s StreamInsight (Chandramouli et al. 2010 ).

6.2 Sentiment analysis

Sentiment is about mining attitudes, emotions, feelings—it is subjective impressions rather than facts. Generally speaking, sentiment analysis aims to determine the attitude expressed by the text writer or speaker with respect to the topic or the overall contextual polarity of a document (Mejova 2009 ). Pang and Lee ( 2008 ) provide a thorough documentation on the fundamentals and approaches of sentiment classification and extraction, including sentiment polarity, degrees of positivity, subjectivity detection, opinion identification, non-factual information, term presence versus frequency, POS (parts of speech), syntax, negation, topic-oriented features and term-based features beyond term unigrams.

6.2.1 Sentiment classification

Sentiment analysis divides into specific subtasks:

Sentiment context —to extract opinion, one needs to know the ‘context’ of the text, which can vary significantly from specialist review portals/feeds to general forums where opinions can cover a spectrum of topics (Westerski 2008 ).

Sentiment level —text analytics can be conducted at the document, sentence or attribute level.

Sentiment subjectivity —deciding whether a given text expresses an opinion or is factual (i.e., without expressing a positive/negative opinion).

Sentiment orientation/polarity —deciding whether an opinion in a text is positive , neutral or negative .

Sentiment strength —deciding the ‘strength’ of an opinion in a text: weak , mild or strong .

Perhaps, the most difficult analysis is identifying sentiment orientation/polarity and strength— positive (wonderful, elegant, amazing, cool), neutral (fine, ok) and negative (horrible, disgusting, poor, flakey, sucks) due to slang.

A popular approach is to assign orientation/polarity scores (+1, 0, −1) to all words: positive opinion (+1), neutral opinion (0) and negative opinion (−1). The overall orientation/polarity score of the text is the sum of orientation scores of all ‘opinion’ words found. However, there are various potential problems in this simplistic approach, such as negation (e.g., there is nothing I hate about this product). One method of estimating sentiment orientation/polarity of the text is pointwise mutual information (PMI) a measure of association used in information theory and statistics.

6.2.2 Supervised learning methods

There are a number of popular computational statistics and machine learning techniques used for sentiment analysis. For a good introduction, see (Khan et al 2010 ). Techniques include:

Naïve Bayes (NB) —a simple probabilistic classifier based on applying Bayes’ theorem with strong (naive) independence assumptions (when features are independent of one another within each class).

Maximum entropy (ME) —the probability distribution that best represents the current state of knowledge is the one with largest information-theoretical entropy.

Support vector machines (SVM) —are supervised learning models with associated learning algorithms that analyze data and recognize patterns, used for classification and regression analysis.

Logistic regression (LR) model —is a type of regression analysis used for predicting the outcome of a categorical (a variable that can take on a limited number of categories) criterion variable based on one or more predictor variables.

Latent semantic analysis —an indexing and retrieval method that uses a mathematical technique called singular value decomposition (SVD) to identify patterns in the relationships between the terms and concepts contained in an unstructured collection of text (Kobayashi and Takeda 2000 ).

The bag-of-words model is a simplifying representation commonly used in natural language processing and IR, where a sentence or a document is represented as an unordered collection of words, disregarding grammar and even word order. This is a model traditionally applied to sentiment analysis thanks to its simplicity.

6.2.2.1 Naïve Bayes classifier (NBC)

As an example of sentiment analysis, we will describe briefly a Naive Bayes classifier (Murphy 2006 ). The Naive Bayes classifier is general purpose, simple to implement and works well for a range of applications. It classifies data in two steps:

Training step —using the training samples, the method estimates the parameters of a probability distribution, assuming features are conditionally independent given the class.

Analysis/testing step —For any unseen test sample, the method computes the posterior probability of that sample belonging to each class. The method then classifies the test sample according to the largest posterior probability.

Using the Naïve Bayes classifier, the classifier calculates the probability for a text to belong to each of the categories you test against. The category with the highest probability for the given text wins:

Figure  14 provides an example of sentiment classification using a Naïve Bayes classifier in Python. There are a number of Naïve Bayes classifier programs available in Java, including the jBNC toolkit ( http://jbnc.sourceforge.net ), WEKA ( www.cs.waikato.ac.nz/ml/weka ) and Alchemy API ( www.alchemyapi.com/api/demo.html ).

Sentiment Classification Example using Python

We next look at the range of Social Media tools available, starting with ‘tools’ and ‘toolkits,’ and in the subsequent chapter at ‘comprehensive’ social media platforms. Since there are a large number of social media textual data services, tools and platforms, we will restrict ourselves examining a few leading examples.

7 Social media analytics tools

Opinion mining tools are crowded with (commercial) providers, most of which are skewed toward sentiment analysis of customer feedback about products and services. Fortunately, there is a vast spectrum of tools for textual analysis ranging from simple open-source tools to libraries, multi-function commercial toolkits and platforms. This section focuses on individual tools and toolkits for scraping, cleaning and analytics, and the next chapter looks at what we call social media platforms that provide both archive data and real-time feeds, and as well as sophisticated analytics tools.

7.1 Scientific programming tools

Popular scientific analytics libraries and tools have been enhanced to provide support for sourcing, searching and analyzing text. Examples include: R—used for statistical programming, MATLAB—used for numeric scientific programming, and Mathematica—used for symbolic scientific programming (computer algebra).

Data processing and data modeling, e.g., regression analysis, are straightforward using MATLAB, which provides time-series analysis, GUI and array-based statistics. MATLAB is significantly faster than the traditional programming languages and can be used for a wide range of applications. Moreover, the exhaustive built-in plotting functions make it a complex analytics toolkit. More computationally powerful algorithms can be developed using it in conjunction with the packages (e.g., FastICA in order to perform independent component analysis).

Python can be used for (natural) language detection, title and content extraction, query matching and, when used in conjunction with a module such as scikit-learn, it can be trained to perform sentiment analysis, e.g., using a Naïve Bayes classifier.

Another example, Apache UIMA (Unstructured Information Management Applications) is an open-source project that analyzes ‘big data’ and discovers information that is relevant to the user.

7.2 Business toolkits

Business Toolkits are commercial suites of tools that allow users to source, search and analyze text for a range of commercial purposes.

SAS Sentiment Analysis Manager, part of the SAS Text Analytics program, can be used for scraping content sources, including mainstream Web sites and social media outlets, as well as internal organizational text sources, and creates reports that describe the expressed feelings of consumers, customers and competitors in real time.

RapidMiner (Hirudkar and Sherekar 2013 ), a popular toolkit offering an open-source Community Edition released under the GNU AGPL and also an Enterprise Edition offered under a commercial license. RapidMiner provides data mining and machine learning procedures including: data loading and transformation (Extract, Transform, Load, a.k.a. ETL), data preprocessing and visualization, modeling, evaluation, and deployment. RapidMiner is written in Java and uses learning schemes and attribute evaluators from the Weka machine learning environment and statistical modeling schemes from the R project.

Other examples are Lexalytics that provides a commercial sentiment analysis engine for many OEM and direct customers; and IBM SPSS Statistics is one of the most used programs for statistical analysis in social science.

7.3 Social media monitoring tools

Social media monitoring tools are sentiment analysis tools for tracking and measuring what people are saying (typically) about a company or its products, or any topic across the web’s social media landscape.

In the area of social media monitoring examples include: Social Mention, ( http://socialmention.com/ ), which provides social media alerts similarly to Google Alerts; Amplified Analytics ( http://www.amplifiedanalytics.com/ ), which focuses on product reviews and marketing information; Lithium Social Media Monitoring; and Trackur, which is an online reputation monitoring tool that tracks what is being said on the Internet.

Google also provides a few useful free tools. Google Trends shows how often a particular search-term input compares to the total search volume. Another tool built around Google Search is Google Alerts—a content change detection tool that provides notifications automatically. Google also acquired FeedBurner—an RSS feeds management—in 2007.

7.4 Text analysis tools

Text analysis tools are broad-based tools for natural language processing and text analysis. Examples of companies in the text analysis area include: OpenAmplify and Jodange whose tools automatically filter and aggregate thoughts, feelings and statements from traditional and social media.

There are also a large number of freely available tools produced by academic groups and non-governmental organizations (NGO) for sourcing, searching and analyzing opinions. Examples include Stanford NLP group tools and LingPipe, a suite of Java libraries for the linguistic analysis of human language (Teufl et al 2010 ).

A variety of open-source text analytics tools are available, especially for sentiment analysis. A popular text analysis tool, which is also open source, is Python NLTK—Natural Language Toolkit ( www.nltk.org/ ), which includes open-source Python modules, linguistic data and documentation for text analytics. Another one is GATE ( http://gate.ac.uk/sentiment ).

We should also mention Lexalytics Sentiment Toolkit which performs automatic sentiment analysis on input documents. It is powerful when used on a large number of documents, but it does not perform data scraping.

Other commercial software for text mining include: AeroText, Attensity, Clarabridge, IBM LanguageWare, SPSS Text Analytics for Surveys, Language Computer Corporation, STATISTICA Text Miner and WordStat.

7.5 Data visualization tools

The data visualization tools provide business intelligence (BI) capabilities and allow different types of users to gain insights from the ‘big’ data. The users can perform exploratory analysis through interactive user interfaces available on the majority of devices, with a recent focus on mobile devices (smartphones and tablets). The data visualization tools help the users identify patterns, trends and relationships in the data which were previously latent. Fast ad hoc visualization on the data can reveal patterns and outliers, and it can be performed on large-scale data sets frameworks, such as Apache Hadoop or Amazon Kinesis. Two notable visualization tools are SAS Visual Analytics and Tableau.

7.6 Case study: SAS Sentiment Analysis and Social Media Analytics

SAS is the leading advanced analytics software for BI, data management and predictive analytics. SAS Sentiment Analysis (SAS Institute 2013 ) automatically rates and classifies opinions. It also performs data scraping from Web sites, social media and internal file systems. Then, it processes in a unified format to evaluate relevance with regard to its pre-defined topics. SAS Sentiment Analysis identifies trends and emotional changes. Experts can refine the sentiment models through an interactive workbench. The tool automatically assigns sentiment scores to the input documents as they are retrieved in real time.

SAS Sentiment Analysis combines statistical modeling and linguistics (rule-based natural language processing techniques) in order to output accurate sentiment analysis results. The tool monitors and evaluates sentiment changes over time; it extracts sentiments in real time as the scraped data is being retrieved and generates reports showing patterns and detailed reactions.

The software identifies where (i.e., on what channel) the topic is being discussed and quantifies perceptions in the market as the software scrapes and analyzes both internal and external content about your organization (or the concept you are analyzing) and competitors, identifying positive, neutral, negative or ‘no sentiment’ texts in real time.

SAS Sentiment Analysis and SAS Social Media Analytics have a user-friendly interface for developing models; users can upload sentiment analysis models directly to the server in order to minimize the manual model deployment. More advanced users can use the interactive workbench to refine their models. The software includes graphics to illustrate instantaneously the text classification (i.e., positive, negative, neutral or unclassified) and point-and-click exploration in order to drill the classified text into detail. The tool also provides some workbench functionality through APIs, allowing for automatic/programmatic integration with other modules/projects. Figure  15 illustrates the SAS Social Media Analytics graphical reports, which provide user-friendly sentiment insights. The SAS software has crawling plugins for the most popular social media sites, including Facebook, Twitter, Bing, LinkedIn, Flickr and Google. It can also be customized to crawl any Web site using the mark-up matcher; this provides a point-and-click interface to indicate what areas need to be extracted from an HTML or XML. SAS Social Media Analytics gathers online conversations from popular networking sites (e.g., Facebook and Twitter), blogs and review sites (e.g., TripAdvisor and Priceline), and scores the data for influence and sentiment. It provides visualization tools for real-time tracking; it allows users to submit customized queries and returns a geographical visualization with brand-specific commentary from Twitter, as illustrated in Fig.  16 .

Graphical Reports with Sentiment Insights

SAS Visualization of Real-Time Tracking via Twitter

8 Social media analytics platforms

Here, we examine comprehensive social media platforms that combine social media archives, data feeds, data mining and data analysis tools. Simply put, the platforms are different from tools and toolkits since platforms are more comprehensive and provide both tools and data.

They broadly subdivide into:

News platforms —platforms such as Thomson Reuters providing news archives/feeds and associated analytics and targeting companies such as financial institutions seeking to monitor market sentiment in news.

Social network media platforms —platforms that provide data mining and analytics on Twitter, Facebook and a wide range of other social network media sources. Providers typically target companies seeking to monitor sentiment around their brands or products.

8.1 News platforms

The two most prominent business news feed providers are Thomson Reuters and Bloomberg.

Computer read news in real time and provide automatically key indicators and meaningful insights. The news items are automatically retrieved, analyzed and interpreted in a few milliseconds. The machine-readable news indicators can potentially improve quantitative strategies, risk management and decision making.

Examples of machine-readable news include: Thomson Reuters Machine Readable News, Bloomberg’s Event-Driven Trading Feed and AlphaFlash (Deutsche Börse’s machine-readable news feed). Thomson Reuters Machine Readable News (Thomson Reuters 2012a , b , c ) has Reuters News content dating back to 1987, and comprehensive news from over 50 third-parties dating back to 2003, such as PR Newswire, Business Wire and the Regulatory News Service (LSE). The feed offers full text and comprehensive metadata via streaming XML.

Thomson Reuters News Analytics uses Natural Language Processing (NLP) techniques to score news items on tens of thousands of companies and nearly 40 commodities and energy topics. Items are measured across the following dimensions:

Author sentiment —metrics for how positive, negative or neutral the tone of the item is, specific to each company in the article.

Relevance —how relevant or substantive the story is for a particular item.

Volume analysis —how much news is happening on a particular company.

Uniqueness —how new or repetitive the item is over various time periods.

Headline analysis —denotes special features such as broker actions, pricing commentary, interviews, exclusives and wrap-ups.

8.2 Social network media platforms

Attensity, Brandwatch, Salesforce Marketing Cloud (previously called Radian6) and Sysomos MAP (Media Analysis Platform) are examples of social media monitoring platforms, which measure demographics, influential topics and sentiments. They include text analytics and sentiment analysis on online consumer conversations and provide user-friendly interfaces for customizing the search query, dashboards, reports and file export features (e.g., to Excel or CSV format). Most of the platforms scrape a range of social network media using a distributed crawler that targets: micro-blogging (Twitter via full Twitter Firehose), blogs (Blogger, WordPress, etc.), social networks (Facebook and MySpace), forums, news sites, images sites (Flickr) and corporate sites. Some of the platforms provide multi-language support for widely used languages (e.g., English, French, German, Italian and Spanish).

Sentiment analysis platforms use two main methodologies. One involves a statistical or model-based approach wherein the system learns to assess sentiment by analyzing large quantities of pre-scored material. The other method utilizes a large dictionary of pre-scored phrases.

RapidMiner Footnote 5 is a platform which combines data mining and data analysis, which, depending on the requirements, can be open source. It uses the WEKA machine learning library and provides access to data sources such as Excel, Access, Oracle, IBM, MySQL, PostgreSQL and Text files.

Mozenda provides a point-and-click user interface for extracting specific information from the Web sites and allows automation and data export to CSV, TSV or XML files.

DataSift provides access to both real-time and historical social data from the leading social networks and millions of other sources, enabling clients to aggregate, filter and gain insights and discover trends from the billions of public social conversations. Once the data is aggregated and processed (i.e., DataSift can filter and add context, such as enrichments—language processing, geodata and demographics—and categorization—spam detection, intent identification and machine learning), the customers can use pre-built integrations with popular BI tools, application and developer tools to deliver the data into their businesses, or use the DataSift APIs to stream real-time data into their applications.

There are a growing number of social media analytics platforms being founded nowadays. Other notable platforms that handle sentiment and semantic analysis of Web and Web 2.0-sourced material include Google Analytics, HP Autonomy IDOL (Intelligent Data Operating Layer), IBM SPSS Modeler, Adobe SocialAnalytics, GraphDive, Keen IO, Mass Relevance, Parse.ly, ViralHeat, Socialbakers, DachisGroup, evolve24, OpenAmplify and AdmantX.

Recently, more and more specific social analytics platforms have emerged. One of them is iSpot.tv which launched its own social media analytics platform that matches television ads with mentions on Twitter and Facebook. It provides real-time reports about when and where an ad appears, together with what people are saying about it on social networks (iSpot.tv monitors almost 80 different networks).

Thomson Reuters has recently announced that it is now incorporating Twitter sentiment analysis for the Thomson Reuters Eikon market analysis and trading platform, providing visualizations and charts based on the sentiment data. In the previous year, Bloomberg incorporated tweets related to specific companies in a wider data stream.

8.3 Case study: Thomson Reuters News Analytics

Thomson Reuters News Analytics (TRNA) provides a huge news archive with analytics to read and interpret news, offering meaningful insights. TRNA scores news items on over 25,000 equities and nearly 40 topics (commodities and energy). The platform scrapes and analyzes news data in real time and feeds the data into other programs/projects or quantitative strategies.

TRNA uses an NLP system from Lexalytics, one of the linguistics technology leaders, that can track news sentiment over time, and scores text across the various dimensions as mentioned in Sect. 8.1 .

The platform’s text scoring and metadata has more than 80 fields (Thomson Reuters 2010 ) such as:

Item type —stage of the story: Alert, Article, Updates or Corrections.

Item genre —classification of the story, i.e., interview, exclusive and wrap-up.

Headline —alert or headline text.

Relevance —varies from 0 to 1.0.

Prevailing sentiment —can be 1, 0 or −1.

Positive, neutral, negative —more detailed sentiment indication.

Broker action —denotes broker actions: upgrade, downgrade, maintain, undefined or whether it is the broker itself

Price/market commentary —used to flag items describing pricing/market commentary

Topic codes —describes what the story is about, i.e., RCH = Research, RES = Results, RESF = Results Forecast, MRG = Mergers and Acquisitions

A snippet of the news sentiment analysis is illustrated in Fig.  17 .

Thomson Reuters News Discovery Application with Sentiment Analysis

In 2012, Thomson Reuters extended its machine-readable news offering to include sentiment analysis and scoring for social media. TRNA’s extension is called Thomson Reuters News Analytics (TRNA) for Internet News and Social Media, which aggregates content from over four million social media channels and 50,000 Internet news sites. The content is then analyzed by TRNA in real time, generating a quantifiable output across dimensions such as sentiment, relevance, novelty volume, category and source ranks. This extension uses the same extensive metadata tagging (across more than 80 fields).

TRNA for Internet News and Social Media is a powerful platform analyzing, tagging and filtering millions of public and premium sources of Internet content, turning big data into actionable ideas. The platform also provides a way to visually analyze the big data. It can be combined with Panopticon Data Visualization Software in order to reach meaningful conclusions more quickly with visually intuitive displays (Thomson Reuters 2012a , b , c ), as illustrated in Fig.  18 .

Combining TRNA for Internet News and Social Media with Panopticon Data Visualization Software

Thomson Reuters also expanded the News Analytics service with MarketPsych Indices (Thomson Reuters 2012a , b , c ), which allows for real-time psychological analysis of news and social media. The Thomson Reuters MarketPsych Indices (TRMI) service gains a quantitative view of market psychology as it attempts to identify human emotion and sentiment. It is a complement to TRNA and uses NLP processing created by MarketPsych ( http://www.marketpsych.com ), a leading company in behavioral psychology in financial markets.

Behavioral economists have extensively investigated whether emotions affect markets in predictable ways, and TRMI attempts to measure the state of ‘emotions’ in real time in order to identify patterns as they emerge. TRMI has two key indicator types:

Emotional indicators (sentiments) —emotions such as Gloom , Fear , Trust , Uncertainty , Innovation , Anger , Stress , Urgency , Optimism and Joy .

Buzz metrics —they indicate how much something is being discussed in the news and social media and include macroeconomic themes (e.g., Litigation, Mergers, Volatility, Financials sector, Airlines sector and Clean Technology sector )

The platform from Thomson Reuters allows the exploitation of news and social media to be used to spot opportunities and capitalize on market inefficiencies (Thomson Reuters 2013 ).

9 Experimental computational environment for social media

As we have discussed in Sect. 2 methodology and critique, researchers arguably require a comprehensive experimental computational environment/facility for social media research with the following attributes:

Data scraping —the ability through easily programmable APIs to scrape any type of social media (social networking media, RSS feeds, blogs, wikis, news, etc.).

Data streaming —to access and combine real-time feeds and archived data for analytics.

Data storage —a major facility for storing principal data sources and for archiving data collected for specific projects.

Data protection/security —the stored data needs to be protected to stop users attempting to ‘suck it out’ off the facility. Access to certain data sets may need to be restricted and charges may be levied on access (cf. Wharton Research Data Services).

Programmable interfaces —researchers need access to simple application programming interfaces (APIs) to scrape and store other available data sources that may not be automatically collected.

9.2 Analytics

Analytics dashboards —non-programming interfaces are required for giving what might be referred to as ‘deep’ access to ‘raw’ data.

Programmable analytics —programming interfaces are also required so users can deploy advanced data mining and computer simulation models using MATLAB, Java and Python.

Stream processing —facilities are required to support analytics on streamed real-time data feeds, such as Twitter feeds, news feeds and financial tick data.

High-performance computing —lastly the environment needs to support non-programming interfaces to MapReduce/Hadoop, NoSQL databases and Grids of processors.

Decentralized analytics —if researchers are to combine social media data with highly sensitive/valuable proprietary data held by governments, financial institutions, retailers and other commercial organizations, then the environment needs in the future to support decentralized analytics across distributed data sources and in a highly secure way.

Realistically, this is best facilitated at a national or international level.

To provide some insight into the structure of an experimental computational environment for social media (analytics), below we present the system architecture of the UCL SocialSTORM analytics platform developed by Dr. Michal Galas and his colleagues (Galas et al. 2012 ) to University College London (UCL).

University College London’s social media streaming, storage and analytics platform (SocialSTORM) is a cloud-based ‘central hub’ platform, which facilitates the acquisition of text-based data from online sources such as Twitter, Facebook, RSS media and news. The system includes facilities to upload and run Java-coded simulation models to analyze the aggregated data, which may comprise scraped social data and/or users’ own proprietary data.

9.3 System architecture

Figure  19 shows the architecture of the SocialSTORM platform, and the following section outlines the key components of the overall system. The basic idea is that each external feed has a dedicated connectivity engine (API) and this streams data to the message bus, which handles internal communication, analytics and storage.

SocialSTORM Platform Architecture

Connectivity engines —the connectivity modules communicate with the external data sources, including Twitter and Facebook’s APIs, financial blogs, various RSS and news feeds. The platform’s APIs are continually being expanded to incorporate other social media sources as required. Data is fed into SocialSTORM in real time, including a random sample of all public updates from Twitter, providing gigabytes of text-based data every day.

Messaging bus —the message bus serves as the internal communication layer which accepts the incoming data streams (messages) from the various connectivity engines, parses these (from either JSON or XML format) to an internal representation of data in the platform, distributes the information across all the interested modules and writes the various data to the appropriate tables of the main database.

Data warehouse —the database supports terabytes of text-based entries, which are accompanied by various types of metadata to expand the potential avenues of research. Entries are organized by source and accurately time-stamped with the time of publication, as well as being tagged with topics for easy retrieval by simulation models. The platform currently uses HBase, but in future might use Apache Cassandra or Hive.

Simulation manager —the simulation manager provides an external API for clients to interact with the data for research purposes, including a web-based GUI whereby users can select various filters to apply to the data sets before uploading a Java-coded simulation model to perform the desired analysis on the data. This facilitates all client-access to the data warehouse and also allows users to upload their own data sets for aggregation with UCL’s social data for a particular simulation. There is also the option to switch between historical mode (which mines data existing at the time the simulation is started) and live mode (which ‘listens’ to incoming data streams and performs analysis in real time).

9.4 Platform components

The platform comprises the following modules, which are illustrated in Fig.  20 :

Environment System Architecture and Modules

Back-end services —this provides the core of the platform functionalities. It is a set of services that allow connections to data providers, propagation processing and aggregation of data feeds, execution and maintenance of models, as well as their management in a multiuser environment.

Front-end client APIs —this provides a set of programmatic and graphical interfaces that can be used to interact with a platform to implement and test analytical models. The programmatic access provides model templates to simplify access to some of the functionalities and defines generic structure of every model in the platform. The graphic user interface allows visual management of analytical models. It enables the user to visualize data in various forms, provides data watch grid capabilities, provides a dynamic visualization of group behavior of data and allows users to observe information on events relevant to the user’s environment.

Connectivity engine —this functionality provides a means of communication with the outside world, with financial brokers, data providers and others. Each of the outside venues utilized by the platform has a dedicated connector object responsible for control of communication. This is possible due to the fact that each of the outside institutions provide either a dedicated API or is using a communication protocol (e.g., the FIX protocol and the JSON/XML-based protocol). The platform provides a generalized interface to allow standardization of a variety of connectors.

Internal communication layer —the idea behind the use of the internal messaging system in the platform draws from the concept of event-driven programming. Analytical platforms utilize events as a main means of communication between their elements. The elements, in turn, are either producers or consumers of the events. The approach significantly simplifies the architecture of such system while making it scalable and flexible for further extensions.

Aggregation database —this provides a fast and robust DBMS functionality, for an entry-level aggregation of data, which is then filtered, enriched, restructured and stored in big data facilities. Aggregation facilities enable analytical platforms to store, extract and manipulate large amounts of data. The storage capabilities of the Aggregation element not only allow replay of historical data for modeling purposes, but also enable other, more sophisticated tasks related to functioning of the platform including model risk analysis, evaluation of performance of models and many more.

Client SDK —this is a complete set of APIs (Application Programming Interfaces) that enable development, implementation and testing of new analytical models with use of the developer’s favorite IDE (Integrated Development Environment). The SDK allows connection from the IDE to the server side of the platform to provide all the functionalities the user may need to develop and execute models.

Shared memory —this provides a buffer-type functionality that speeds up the delivery of temporal/historical data to models and the analytics-related elements of the platform (i.e., the statistical analysis library of methods), and, at the same time, reduces the memory usage requirement. The main idea is to have a central point in the memory (RAM) of the platform that will manage and provide a temporal/historical data from the current point of time up to a specified number of timestamps back in history). Since the memory is shared, no model will have to keep and manage history by itself. Moreover, since the memory is kept in RAM rather than in the files or the DBMS, the access to it is instant and bounded only by the performance of hardware and the platform on which the buffers work.

Model templates —the platform supports two generic types of models: push and pull. The push type registers itself to listen to a specified set of data streams during initialization, and the execution of the model logic is triggered each time a new data feed arrives to the platform. This type is dedicated to very quick, low-latency, high-frequency models and the speed is achieved at the cost of small shared memory buffers. The pull model template executes and requests data on its own, based on a schedule. Instead of using the memory buffers, it has a direct connection to the big data facilities and hence can request as much historical data as necessary, at the expense of speed.

10 Conclusions

As discussed, the easy availability of APIs provided by Twitter, Facebook and News services has led to an ‘explosion’ of data services and software tools for scraping and sentiment analysis, and social media analytics platforms. This paper surveys some of the social media software tools, and for completeness introduced social media scraping, data cleaning and sentiment analysis.

Perhaps, the biggest concern is that companies are increasingly restricting access to their data to monetize their content. It is important that researchers have access to computational environments and especially ‘big’ social media data for experimentation. Otherwise, computational social science could become the exclusive domain of major companies, government agencies and a privileged set of academic researchers presiding over private data from which they produce papers that cannot be critiqued or replicated. Arguably what is required are public-domain computational environments and data facilities for quantitative social science, which can be accessed by researchers via a cloud-based facility.

An object may be a person, a page, a picture or an event.

Details of the information retrieved in status updates can be found here: https://developers.facebook.com/docs/reference/api/status/ .

More information about Google Refine is found in its documentation wiki: https://github.com/OpenRefine/OpenRefine/wiki .

Apache Mahout project page: http://mahout.apache.org/ .

http://rapid-i.com/ .

Botan I et al. (2010) SECRET: a model for analysis of the execution semantics of stream processing systems. Proc VLDB Endow 3(1–2):232–243

Salathé M et al. (2012) Digital epidemiology. PLoS Comput Biol 8(7):1–5

Bollen J, Mao H, Zeng X (2011) Twitter mood predicts the stock market. J Comput Sci 2(3):1–8

Article   Google Scholar  

Chandramouli B et al (2010) Data stream management systems for computational finance. IEEE Comput 43(12):45–52

Chandrasekar C, Kowsalya N (2011) Implementation of MapReduce Algorithm and Nutch Distributed File System in Nutch. Int J Comput Appl 1:6–11

Cioffi-Revilla C (2010) Computational social science. Wiley Interdiscip Rev Comput Statistics 2(3):259–271

Galas M, Brown D, Treleaven P (2012) A computational social science environment for financial/economic experiments. In: Proceedings of the Computational Social Science Society of the Americas, vol 1, pp 1–13

Hebrail G (2008) Data stream management and mining. In: Fogelman-Soulié F, Perrotta D, Piskorski J, Steinberger R (eds) Mining Massive Data Sets for Security. IOS Press, pp 89–102

Hirudkar AM, Sherekar SS (2013) Comparative analysis of data mining tools and techniques for evaluating performance of database system. Int J Comput Sci Appl 6(2):232–237

Google Scholar  

Kaplan AM (2012) If you love something, let it go mobile: mobile marketing and mobile social media 4x4. Bus Horiz 55(2):129–139

Kaplan AM, Haenlein M (2010) Users of the world, unite! the challenges and opportunities of social media. Bus Horiz 53(1):59–68

Karabulut Y (2013) Can Facebook predict stock market activity? SSRN eLibrary, pp 1–58. http://ssrn.com/abstract=2017099 . Accessed 2 Feb 2014

Khan A, Baharudin B, Lee LH, Khan K (2010) A review of machine learning algorithms for text-documents classification. J Adv Inf Technol 1(1):4–20

Kobayashi M, Takeda K (2000) Information retrieval on the web. ACM Comput Surv CSUR 32(2):144–173

Lazer D et al (2009) Computational social science. Science 323:721–723

Lerman K, Gilder A, Dredze M, Pereira F (2008) Reading the markets: forecasting public opinion of political candidates by news analysis. In: Proceedings of the 22nd international conference on computational linguistics 1:473–480

MapReduce (2011) What is MapReduce?. http://www.mapreduce.org/what-is-mapreduce.php . Accessed 31 Jan 2014

Mejova Y (2009) Sentiment analysis: an overview, pp 1–34. http://www.academia.edu/291678/Sentiment_Analysis_An_Overview . Accessed 4 Nov 2013

Murphy KP (2006) Naive Bayes classifiers. University of British Columbia, pp 1–8. http://www.ic.unicamp.br/~rocha/teaching/2011s1/mc906/aulas/naivebayes.pdf

Murphy KP (2012) Machine learning: a probabilistic perspective. In: Chapter 1: Introduction. MIT Press, pp 1–26

Narang RK (2009) Inside the black box. Hoboken, New Jersey

Book   Google Scholar  

Nuti G, Mirghaemi M, Treleaven P, Yingsaeree C (2011) Algorithmic trading. IEEE Comput 44(11):61–69

Pang B, Lee L (2008) Opinion mining and sentiment analysis. Found Trends Inf Retr 2(1–2):1–135

SAS Institute Inc (2013) SAS sentiment analysis factsheet. http://www.sas.com/resources/factsheet/sas-sentiment-analysis-factsheet.pdf . Accessed 6 Dec 2013

Teufl P, Payer U, Lackner G (2010) From NLP (natural language processing) to MLP (machine language processing). In: Kotenko I, Skormin V (eds) Computer network security, Springer, Berlin Heidelberg, pp 256–269

Thomson Reuters (2010). Thomson Reuters news analytics. http://thomsonreuters.com/products/financial-risk/01_255/News_Analytics_-_Product_Brochure-_Oct_2010_1_.pdf . Accessed 1 Oct 2013

Thomson Reuters (2012) Thomson Reuters machine readable news. http://thomsonreuters.com/products/financial-risk/01_255/TR_MRN_Overview_10Jan2012.pdf . Accessed 5 Dec 2013

Thomson Reuters (2012) Thomson Reuters MarketPsych Indices. http://thomsonreuters.com/products/financial-risk/01_255/TRMI_flyer_2012.pdf . Accessed 7 Dec 2013

Thomson Reuters (2012) Thomson Reuters news analytics for internet news and social media. http://thomsonreuters.com/business-unit/financial/eurozone/112408/news_analytics_and_social_media . Accessed 7 Dec 2013

Thomson Reuters (2013) Machine readable news. http://thomsonreuters.com/machine-readable-news/?subsector=thomson-reuters-elektron . Accessed 18 Dec 2013

Turney PD (2002) Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics pp. 417–424

Vaswani V (2011) Hook into Wikipedia information using PHP and the MediaWiki API. http://www.ibm.com/developerworks/web/library/x-phpwikipedia/index.html . Accessed 21 Dec 2012

Westerski A (2008) Sentiment analysis: introduction and the state of the art overview. Universidad Politecnica de Madrid, Spain, pp 1–9. http://www.adamwesterski.com/wpcontent/files/docsCursos/sentimentA_doc_TLAW.pdf . Accessed 14 Aug 2013

Wikimedia Foundation (2014) Wikipedia:Database download. http://en.wikipedia.org/wiki/Wikipedia:Database_download . Accessed 18 Apr 2014

Wolfram SMA (2010) Modelling the stock market using Twitter. Dissertation Master of Science thesis, School of Informatics, University of Edinburgh, pp 1–74. http://homepages.inf.ed.ac.uk/miles/msc-projects/wolfram.pdf . Accessed 23 Jul 2013

Yessenov K, Misailovic S (2009) Sentiment analysis of movie review comments, pp 1–17. http://people.csail.mit.edu/kuat/courses/6.863/report.pdf . Accessed 16 Aug 2013

Download references

Acknowledgments

The authors would like to acknowledge Michal Galas who led the design and implementation of the UCL SocialSTORM platform with the assistance of Ilya Zheludev, Kacper Chwialkowski and Dan Brown. Dr. Christian Hesse of Deutsche Bank is also acknowledged for collaboration on News Analytics.

Author information

Authors and affiliations.

Department of Computer Science, University College London, Gower Street, London, WC1E 6BT, UK

Bogdan Batrinca & Philip C. Treleaven

You can also search for this author in PubMed   Google Scholar

Corresponding author

Correspondence to Philip C. Treleaven .

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

Reprints and permissions

About this article

Batrinca, B., Treleaven, P.C. Social media analytics: a survey of techniques, tools and platforms. AI & Soc 30 , 89–116 (2015). https://doi.org/10.1007/s00146-014-0549-4

Download citation

Received : 25 February 2014

Accepted : 04 July 2014

Published : 26 July 2014

Issue Date : February 2015

DOI : https://doi.org/10.1007/s00146-014-0549-4

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Social media
  • Behavior economics
  • Sentiment analysis
  • Opinion mining
  • Software platforms
  • Find a journal
  • Publish with us
  • Track your research

U.S. flag

An official website of the United States government

The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

  • Publications
  • Account settings

Preview improvements coming to the PMC website in October 2024. Learn More or Try it out now .

  • Advanced Search
  • Journal List
  • Front Psychol

The effect of social media on the development of students’ affective variables

1 Science and Technology Department, Nanjing University of Posts and Telecommunications, Nanjing, China

2 School of Marxism, Hohai University, Nanjing, Jiangsu, China

3 Government Enterprise Customer Center, China Mobile Group Jiangsu Co., Ltd., Nanjing, China

The use of social media is incomparably on the rise among students, influenced by the globalized forms of communication and the post-pandemic rush to use multiple social media platforms for education in different fields of study. Though social media has created tremendous chances for sharing ideas and emotions, the kind of social support it provides might fail to meet students’ emotional needs, or the alleged positive effects might be short-lasting. In recent years, several studies have been conducted to explore the potential effects of social media on students’ affective traits, such as stress, anxiety, depression, and so on. The present paper reviews the findings of the exemplary published works of research to shed light on the positive and negative potential effects of the massive use of social media on students’ emotional well-being. This review can be insightful for teachers who tend to take the potential psychological effects of social media for granted. They may want to know more about the actual effects of the over-reliance on and the excessive (and actually obsessive) use of social media on students’ developing certain images of self and certain emotions which are not necessarily positive. There will be implications for pre- and in-service teacher training and professional development programs and all those involved in student affairs.

Introduction

Social media has turned into an essential element of individuals’ lives including students in today’s world of communication. Its use is growing significantly more than ever before especially in the post-pandemic era, marked by a great revolution happening to the educational systems. Recent investigations of using social media show that approximately 3 billion individuals worldwide are now communicating via social media ( Iwamoto and Chun, 2020 ). This growing population of social media users is spending more and more time on social network groupings, as facts and figures show that individuals spend 2 h a day, on average, on a variety of social media applications, exchanging pictures and messages, updating status, tweeting, favoring, and commenting on many updated socially shared information ( Abbott, 2017 ).

Researchers have begun to investigate the psychological effects of using social media on students’ lives. Chukwuere and Chukwuere (2017) maintained that social media platforms can be considered the most important source of changing individuals’ mood, because when someone is passively using a social media platform seemingly with no special purpose, s/he can finally feel that his/her mood has changed as a function of the nature of content overviewed. Therefore, positive and negative moods can easily be transferred among the population using social media networks ( Chukwuere and Chukwuere, 2017 ). This may become increasingly important as students are seen to be using social media platforms more than before and social networking is becoming an integral aspect of their lives. As described by Iwamoto and Chun (2020) , when students are affected by social media posts, especially due to the increasing reliance on social media use in life, they may be encouraged to begin comparing themselves to others or develop great unrealistic expectations of themselves or others, which can have several affective consequences.

Considering the increasing influence of social media on education, the present paper aims to focus on the affective variables such as depression, stress, and anxiety, and how social media can possibly increase or decrease these emotions in student life. The exemplary works of research on this topic in recent years will be reviewed here, hoping to shed light on the positive and negative effects of these ever-growing influential platforms on the psychology of students.

Significance of the study

Though social media, as the name suggests, is expected to keep people connected, probably this social connection is only superficial, and not adequately deep and meaningful to help individuals feel emotionally attached to others. The psychological effects of social media on student life need to be studied in more depth to see whether social media really acts as a social support for students and whether students can use social media to cope with negative emotions and develop positive feelings or not. In other words, knowledge of the potential effects of the growing use of social media on students’ emotional well-being can bridge the gap between the alleged promises of social media and what it actually has to offer to students in terms of self-concept, self-respect, social role, and coping strategies (for stress, anxiety, etc.).

Exemplary general literature on psychological effects of social media

Before getting down to the effects of social media on students’ emotional well-being, some exemplary works of research in recent years on the topic among general populations are reviewed. For one, Aalbers et al. (2018) reported that individuals who spent more time passively working with social media suffered from more intense levels of hopelessness, loneliness, depression, and perceived inferiority. For another, Tang et al. (2013) observed that the procedures of sharing information, commenting, showing likes and dislikes, posting messages, and doing other common activities on social media are correlated with higher stress. Similarly, Ley et al. (2014) described that people who spend 2 h, on average, on social media applications will face many tragic news, posts, and stories which can raise the total intensity of their stress. This stress-provoking effect of social media has been also pinpointed by Weng and Menczer (2015) , who contended that social media becomes a main source of stress because people often share all kinds of posts, comments, and stories ranging from politics and economics, to personal and social affairs. According to Iwamoto and Chun (2020) , anxiety and depression are the negative emotions that an individual may develop when some source of stress is present. In other words, when social media sources become stress-inducing, there are high chances that anxiety and depression also develop.

Charoensukmongkol (2018) reckoned that the mental health and well-being of the global population can be at a great risk through the uncontrolled massive use of social media. These researchers also showed that social media sources can exert negative affective impacts on teenagers, as they can induce more envy and social comparison. According to Fleck and Johnson-Migalski (2015) , though social media, at first, plays the role of a stress-coping strategy, when individuals continue to see stressful conditions (probably experienced and shared by others in media), they begin to develop stress through the passage of time. Chukwuere and Chukwuere (2017) maintained that social media platforms continue to be the major source of changing mood among general populations. For example, someone might be passively using a social media sphere, and s/he may finally find him/herself with a changed mood depending on the nature of the content faced. Then, this good or bad mood is easily shared with others in a flash through the social media. Finally, as Alahmar (2016) described, social media exposes people especially the young generation to new exciting activities and events that may attract them and keep them engaged in different media contexts for hours just passing their time. It usually leads to reduced productivity, reduced academic achievement, and addiction to constant media use ( Alahmar, 2016 ).

The number of studies on the potential psychological effects of social media on people in general is higher than those selectively addressed here. For further insights into this issue, some other suggested works of research include Chang (2012) , Sriwilai and Charoensukmongkol (2016) , and Zareen et al. (2016) . Now, we move to the studies that more specifically explored the effects of social media on students’ affective states.

Review of the affective influences of social media on students

Vygotsky’s mediational theory (see Fernyhough, 2008 ) can be regarded as a main theoretical background for the support of social media on learners’ affective states. Based on this theory, social media can play the role of a mediational means between learners and the real environment. Learners’ understanding of this environment can be mediated by the image shaped via social media. This image can be either close to or different from the reality. In the case of the former, learners can develop their self-image and self-esteem. In the case of the latter, learners might develop unrealistic expectations of themselves by comparing themselves to others. As it will be reviewed below among the affective variables increased or decreased in students under the influence of the massive use of social media are anxiety, stress, depression, distress, rumination, and self-esteem. These effects have been explored more among school students in the age range of 13–18 than university students (above 18), but some studies were investigated among college students as well. Exemplary works of research on these affective variables are reviewed here.

In a cross-sectional study, O’Dea and Campbell (2011) explored the impact of online interactions of social networks on the psychological distress of adolescent students. These researchers found a negative correlation between the time spent on social networking and mental distress. Dumitrache et al. (2012) explored the relations between depression and the identity associated with the use of the popular social media, the Facebook. This study showed significant associations between depression and the number of identity-related information pieces shared on this social network. Neira and Barber (2014) explored the relationship between students’ social media use and depressed mood at teenage. No significant correlation was found between these two variables. In the same year, Tsitsika et al. (2014) explored the associations between excessive use of social media and internalizing emotions. These researchers found a positive correlation between more than 2-h a day use of social media and anxiety and depression.

Hanprathet et al. (2015) reported a statistically significant positive correlation between addiction to Facebook and depression among about a thousand high school students in wealthy populations of Thailand and warned against this psychological threat. Sampasa-Kanyinga and Lewis (2015) examined the relationship between social media use and psychological distress. These researchers found that the use of social media for more than 2 h a day was correlated with a higher intensity of psychological distress. Banjanin et al. (2015) tested the relationship between too much use of social networking and depression, yet found no statistically significant correlation between these two variables. Frison and Eggermont (2016) examined the relationships between different forms of Facebook use, perceived social support of social media, and male and female students’ depressed mood. These researchers found a positive association between the passive use of the Facebook and depression and also between the active use of the social media and depression. Furthermore, the perceived social support of the social media was found to mediate this association. Besides, gender was found as the other factor to mediate this relationship.

Vernon et al. (2017) explored change in negative investment in social networking in relation to change in depression and externalizing behavior. These researchers found that increased investment in social media predicted higher depression in adolescent students, which was a function of the effect of higher levels of disrupted sleep. Barry et al. (2017) explored the associations between the use of social media by adolescents and their psychosocial adjustment. Social media activity showed to be positively and moderately associated with depression and anxiety. Another investigation was focused on secondary school students in China conducted by Li et al. (2017) . The findings showed a mediating role of insomnia on the significant correlation between depression and addiction to social media. In the same year, Yan et al. (2017) aimed to explore the time spent on social networks and its correlation with anxiety among middle school students. They found a significant positive correlation between more than 2-h use of social networks and the intensity of anxiety.

Also in China, Wang et al. (2018) showed that addiction to social networking sites was correlated positively with depression, and this correlation was mediated by rumination. These researchers also found that this mediating effect was moderated by self-esteem. It means that the effect of addiction on depression was compounded by low self-esteem through rumination. In another work of research, Drouin et al. (2018) showed that though social media is expected to act as a form of social support for the majority of university students, it can adversely affect students’ mental well-being, especially for those who already have high levels of anxiety and depression. In their research, the social media resources were found to be stress-inducing for half of the participants, all university students. The higher education population was also studied by Iwamoto and Chun (2020) . These researchers investigated the emotional effects of social media in higher education and found that the socially supportive role of social media was overshadowed in the long run in university students’ lives and, instead, fed into their perceived depression, anxiety, and stress.

Keles et al. (2020) provided a systematic review of the effect of social media on young and teenage students’ depression, psychological distress, and anxiety. They found that depression acted as the most frequent affective variable measured. The most salient risk factors of psychological distress, anxiety, and depression based on the systematic review were activities such as repeated checking for messages, personal investment, the time spent on social media, and problematic or addictive use. Similarly, Mathewson (2020) investigated the effect of using social media on college students’ mental health. The participants stated the experience of anxiety, depression, and suicidality (thoughts of suicide or attempts to suicide). The findings showed that the types and frequency of using social media and the students’ perceived mental health were significantly correlated with each other.

The body of research on the effect of social media on students’ affective and emotional states has led to mixed results. The existing literature shows that there are some positive and some negative affective impacts. Yet, it seems that the latter is pre-dominant. Mathewson (2020) attributed these divergent positive and negative effects to the different theoretical frameworks adopted in different studies and also the different contexts (different countries with whole different educational systems). According to Fredrickson’s broaden-and-build theory of positive emotions ( Fredrickson, 2001 ), the mental repertoires of learners can be built and broadened by how they feel. For instance, some external stimuli might provoke negative emotions such as anxiety and depression in learners. Having experienced these negative emotions, students might repeatedly check their messages on social media or get addicted to them. As a result, their cognitive repertoire and mental capacity might become limited and they might lose their concentration during their learning process. On the other hand, it should be noted that by feeling positive, learners might take full advantage of the affordances of the social media and; thus, be able to follow their learning goals strategically. This point should be highlighted that the link between the use of social media and affective states is bi-directional. Therefore, strategic use of social media or its addictive use by students can direct them toward either positive experiences like enjoyment or negative ones such as anxiety and depression. Also, these mixed positive and negative effects are similar to the findings of several other relevant studies on general populations’ psychological and emotional health. A number of studies (with general research populations not necessarily students) showed that social networks have facilitated the way of staying in touch with family and friends living far away as well as an increased social support ( Zhang, 2017 ). Given the positive and negative emotional effects of social media, social media can either scaffold the emotional repertoire of students, which can develop positive emotions in learners, or induce negative provokers in them, based on which learners might feel negative emotions such as anxiety and depression. However, admittedly, social media has also generated a domain that encourages the act of comparing lives, and striving for approval; therefore, it establishes and internalizes unrealistic perceptions ( Virden et al., 2014 ; Radovic et al., 2017 ).

It should be mentioned that the susceptibility of affective variables to social media should be interpreted from a dynamic lens. This means that the ecology of the social media can make changes in the emotional experiences of learners. More specifically, students’ affective variables might self-organize into different states under the influence of social media. As for the positive correlation found in many studies between the use of social media and such negative effects as anxiety, depression, and stress, it can be hypothesized that this correlation is induced by the continuous comparison the individual makes and the perception that others are doing better than him/her influenced by the posts that appear on social media. Using social media can play a major role in university students’ psychological well-being than expected. Though most of these studies were correlational, and correlation is not the same as causation, as the studies show that the number of participants experiencing these negative emotions under the influence of social media is significantly high, more extensive research is highly suggested to explore causal effects ( Mathewson, 2020 ).

As the review of exemplary studies showed, some believed that social media increased comparisons that students made between themselves and others. This finding ratifies the relevance of the Interpretation Comparison Model ( Stapel and Koomen, 2000 ; Stapel, 2007 ) and Festinger’s (1954) Social Comparison Theory. Concerning the negative effects of social media on students’ psychology, it can be argued that individuals may fail to understand that the content presented in social media is usually changed to only represent the attractive aspects of people’s lives, showing an unrealistic image of things. We can add that this argument also supports the relevance of the Social Comparison Theory and the Interpretation Comparison Model ( Stapel and Koomen, 2000 ; Stapel, 2007 ), because social media sets standards that students think they should compare themselves with. A constant observation of how other students or peers are showing their instances of achievement leads to higher self-evaluation ( Stapel and Koomen, 2000 ). It is conjectured that the ubiquitous role of social media in student life establishes unrealistic expectations and promotes continuous comparison as also pinpointed in the Interpretation Comparison Model ( Stapel and Koomen, 2000 ; Stapel, 2007 ).

Implications of the study

The use of social media is ever increasing among students, both at school and university, which is partly because of the promises of technological advances in communication services and partly because of the increased use of social networks for educational purposes in recent years after the pandemic. This consistent use of social media is not expected to leave students’ psychological, affective and emotional states untouched. Thus, it is necessary to know how the growing usage of social networks is associated with students’ affective health on different aspects. Therefore, we found it useful to summarize the research findings in recent years in this respect. If those somehow in charge of student affairs in educational settings are aware of the potential positive or negative effects of social media usage on students, they can better understand the complexities of students’ needs and are better capable of meeting them.

Psychological counseling programs can be initiated at schools or universities to check upon the latest state of students’ mental and emotional health influenced by the pervasive use of social media. The counselors can be made aware of the potential adverse effects of social networking and can adapt the content of their inquiries accordingly. Knowledge of the potential reasons for student anxiety, depression, and stress can help school or university counselors to find individualized coping strategies when they diagnose any symptom of distress in students influenced by an excessive use of social networking.

Admittedly, it is neither possible to discard the use of social media in today’s academic life, nor to keep students’ use of social networks fully controlled. Certainly, the educational space in today’s world cannot do without the social media, which has turned into an integral part of everybody’s life. Yet, probably students need to be instructed on how to take advantage of the media and to be the least affected negatively by its occasional superficial and unrepresentative content. Compensatory programs might be needed at schools or universities to encourage students to avoid making unrealistic and impartial comparisons of themselves and the flamboyant images of others displayed on social media. Students can be taught to develop self-appreciation and self-care while continuing to use the media to their benefit.

The teachers’ role as well as the curriculum developers’ role are becoming more important than ever, as they can significantly help to moderate the adverse effects of the pervasive social media use on students’ mental and emotional health. The kind of groupings formed for instructional purposes, for example, in social media can be done with greater care by teachers to make sure that the members of the groups are homogeneous and the tasks and activities shared in the groups are quite relevant and realistic. The teachers cannot always be in a full control of students’ use of social media, and the other fact is that students do not always and only use social media for educational purposes. They spend more time on social media for communicating with friends or strangers or possibly they just passively receive the content produced out of any educational scope just for entertainment. This uncontrolled and unrealistic content may give them a false image of life events and can threaten their mental and emotional health. Thus, teachers can try to make students aware of the potential hazards of investing too much of their time on following pages or people that publish false and misleading information about their personal or social identities. As students, logically expected, spend more time with their teachers than counselors, they may be better and more receptive to the advice given by the former than the latter.

Teachers may not be in full control of their students’ use of social media, but they have always played an active role in motivating or demotivating students to take particular measures in their academic lives. If teachers are informed of the recent research findings about the potential effects of massively using social media on students, they may find ways to reduce students’ distraction or confusion in class due to the excessive or over-reliant use of these networks. Educators may more often be mesmerized by the promises of technology-, computer- and mobile-assisted learning. They may tend to encourage the use of social media hoping to benefit students’ social and interpersonal skills, self-confidence, stress-managing and the like. Yet, they may be unaware of the potential adverse effects on students’ emotional well-being and, thus, may find the review of the recent relevant research findings insightful. Also, teachers can mediate between learners and social media to manipulate the time learners spend on social media. Research has mainly indicated that students’ emotional experiences are mainly dependent on teachers’ pedagogical approach. They should refrain learners from excessive use of, or overreliance on, social media. Raising learners’ awareness of this fact that individuals should develop their own path of development for learning, and not build their development based on unrealistic comparison of their competences with those of others, can help them consider positive values for their activities on social media and, thus, experience positive emotions.

At higher education, students’ needs are more life-like. For example, their employment-seeking spirits might lead them to create accounts in many social networks, hoping for a better future. However, membership in many of these networks may end in the mere waste of the time that could otherwise be spent on actual on-campus cooperative projects. Universities can provide more on-campus resources both for research and work experience purposes from which the students can benefit more than the cyberspace that can be tricky on many occasions. Two main theories underlying some negative emotions like boredom and anxiety are over-stimulation and under-stimulation. Thus, what learners feel out of their involvement in social media might be directed toward negative emotions due to the stimulating environment of social media. This stimulating environment makes learners rely too much, and spend too much time, on social media or use them obsessively. As a result, they might feel anxious or depressed. Given the ubiquity of social media, these negative emotions can be replaced with positive emotions if learners become aware of the psychological effects of social media. Regarding the affordances of social media for learners, they can take advantage of the potential affordances of these media such as improving their literacy, broadening their communication skills, or enhancing their distance learning opportunities.

A review of the research findings on the relationship between social media and students’ affective traits revealed both positive and negative findings. Yet, the instances of the latter were more salient and the negative psychological symptoms such as depression, anxiety, and stress have been far from negligible. These findings were discussed in relation to some more relevant theories such as the social comparison theory, which predicted that most of the potential issues with the young generation’s excessive use of social media were induced by the unfair comparisons they made between their own lives and the unrealistic portrayal of others’ on social media. Teachers, education policymakers, curriculum developers, and all those in charge of the student affairs at schools and universities should be made aware of the psychological effects of the pervasive use of social media on students, and the potential threats.

It should be reminded that the alleged socially supportive and communicative promises of the prevalent use of social networking in student life might not be fully realized in practice. Students may lose self-appreciation and gratitude when they compare their current state of life with the snapshots of others’ or peers’. A depressed or stressed-out mood can follow. Students at schools or universities need to learn self-worth to resist the adverse effects of the superficial support they receive from social media. Along this way, they should be assisted by the family and those in charge at schools or universities, most importantly the teachers. As already suggested, counseling programs might help with raising students’ awareness of the potential psychological threats of social media to their health. Considering the ubiquity of social media in everybody’ life including student life worldwide, it seems that more coping and compensatory strategies should be contrived to moderate the adverse psychological effects of the pervasive use of social media on students. Also, the affective influences of social media should not be generalized but they need to be interpreted from an ecological or contextual perspective. This means that learners might have different emotions at different times or different contexts while being involved in social media. More specifically, given the stative approach to learners’ emotions, what learners emotionally experience in their application of social media can be bound to their intra-personal and interpersonal experiences. This means that the same learner at different time points might go through different emotions Also, learners’ emotional states as a result of their engagement in social media cannot be necessarily generalized to all learners in a class.

As the majority of studies on the psychological effects of social media on student life have been conducted on school students than in higher education, it seems it is too soon to make any conclusive remark on this population exclusively. Probably, in future, further studies of the psychological complexities of students at higher education and a better knowledge of their needs can pave the way for making more insightful conclusions about the effects of social media on their affective states.

Suggestions for further research

The majority of studies on the potential effects of social media usage on students’ psychological well-being are either quantitative or qualitative in type, each with many limitations. Presumably, mixed approaches in near future can better provide a comprehensive assessment of these potential associations. Moreover, most studies on this topic have been cross-sectional in type. There is a significant dearth of longitudinal investigation on the effect of social media on developing positive or negative emotions in students. This seems to be essential as different affective factors such as anxiety, stress, self-esteem, and the like have a developmental nature. Traditional research methods with single-shot designs for data collection fail to capture the nuances of changes in these affective variables. It can be expected that more longitudinal studies in future can show how the continuous use of social media can affect the fluctuations of any of these affective variables during the different academic courses students pass at school or university.

As already raised in some works of research reviewed, the different patterns of impacts of social media on student life depend largely on the educational context. Thus, the same research designs with the same academic grade students and even the same age groups can lead to different findings concerning the effects of social media on student psychology in different countries. In other words, the potential positive and negative effects of popular social media like Facebook, Snapchat, Twitter, etc., on students’ affective conditions can differ across different educational settings in different host countries. Thus, significantly more research is needed in different contexts and cultures to compare the results.

There is also a need for further research on the higher education students and how their affective conditions are positively and negatively affected by the prevalent use of social media. University students’ psychological needs might be different from other academic grades and, thus, the patterns of changes that the overall use of social networking can create in their emotions can be also different. Their main reasons for using social media might be different from school students as well, which need to be investigated more thoroughly. The sorts of interventions needed to moderate the potential negative effects of social networking on them can be different too, all requiring a new line of research in education domain.

Finally, there are hopes that considering the ever-increasing popularity of social networking in education, the potential psychological effects of social media on teachers be explored as well. Though teacher psychology has only recently been considered for research, the literature has provided profound insights into teachers developing stress, motivation, self-esteem, and many other emotions. In today’s world driven by global communications in the cyberspace, teachers like everyone else are affecting and being affected by social networking. The comparison theory can hold true for teachers too. Thus, similar threats (of social media) to self-esteem and self-worth can be there for teachers too besides students, which are worth investigating qualitatively and quantitatively.

Probably a new line of research can be initiated to explore the co-development of teacher and learner psychological traits under the influence of social media use in longitudinal studies. These will certainly entail sophisticated research methods to be capable of unraveling the nuances of variation in these traits and their mutual effects, for example, stress, motivation, and self-esteem. If these are incorporated within mixed-approach works of research, more comprehensive and better insightful findings can be expected to emerge. Correlational studies need to be followed by causal studies in educational settings. As many conditions of the educational settings do not allow for having control groups or randomization, probably, experimental studies do not help with this. Innovative research methods, case studies or else, can be used to further explore the causal relations among the different features of social media use and the development of different affective variables in teachers or learners. Examples of such innovative research methods can be process tracing, qualitative comparative analysis, and longitudinal latent factor modeling (for a more comprehensive view, see Hiver and Al-Hoorie, 2019 ).

Author contributions

Both authors listed have made a substantial, direct, and intellectual contribution to the work, and approved it for publication.

This study was sponsored by Wuxi Philosophy and Social Sciences bidding project—“Special Project for Safeguarding the Rights and Interests of Workers in the New Form of Employment” (Grant No. WXSK22-GH-13). This study was sponsored by the Key Project of Party Building and Ideological and Political Education Research of Nanjing University of Posts and Telecommunications—“Research on the Guidance and Countermeasures of Network Public Opinion in Colleges and Universities in the Modern Times” (Grant No. XC 2021002).

Conflict of interest

Author XX was employed by China Mobile Group Jiangsu Co., Ltd. The remaining author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

  • Aalbers G., McNally R. J., Heeren A., de Wit S., Fried E. I. (2018). Social media and depression symptoms: A network perspective. J. Exp. Psychol. Gen. 148 1454–1462. 10.1037/xge0000528 [ PubMed ] [ CrossRef ] [ Google Scholar ]
  • Abbott J. (2017). Introduction: Assessing the social and political impact of the internet and new social media in Asia. J. Contemp. Asia 43 579–590. 10.1080/00472336.2013.785698 [ CrossRef ] [ Google Scholar ]
  • Alahmar A. T. (2016). The impact of social media on the academic performance of second year medical students at College of Medicine, University of Babylon, Iraq. J. Med. Allied Sci. 6 77–83. 10.5455/jmas.236927 [ CrossRef ] [ Google Scholar ]
  • Banjanin N., Banjanin N., Dimitrijevic I., Pantic I. (2015). Relationship between internet use and depression: Focus on physiological mood oscillations, social networking and online addictive behavior. Comp. Hum. Behav. 43 308–312. 10.1016/j.chb.2014.11.013 [ CrossRef ] [ Google Scholar ]
  • Barry C. T., Sidoti C. L., Briggs S. M., Reiter S. R., Lindsey R. A. (2017). Adolescent social media use and mental health from adolescent and parent perspectives. J. Adolesc. 61 1–11. 10.1016/j.adolescence.2017.08.005 [ PubMed ] [ CrossRef ] [ Google Scholar ]
  • Chang Y. (2012). The relationship between maladaptive perfectionism with burnout: Testing mediating effect of emotion-focused coping. Pers. Individ. Differ. 53 635–639. 10.1016/j.paid.2012.05.002 [ CrossRef ] [ Google Scholar ]
  • Charoensukmongkol P. (2018). The impact of social media on social comparison and envy in teenagers: The moderating role of the parent comparing children and in-group competition among friends. J. Child Fam. Stud. 27 69–79. 10.1007/s10826-017-0872-8 [ CrossRef ] [ Google Scholar ]
  • Chukwuere J. E., Chukwuere P. C. (2017). The impact of social media on social lifestyle: A case study of university female students. Gender Behav. 15 9966–9981. [ Google Scholar ]
  • Drouin M., Reining L., Flanagan M., Carpenter M., Toscos T. (2018). College students in distress: Can social media be a source of social support? Coll. Stud. J. 52 494–504. [ Google Scholar ]
  • Dumitrache S. D., Mitrofan L., Petrov Z. (2012). Self-image and depressive tendencies among adolescent Facebook users. Rev. Psihol. 58 285–295. [ Google Scholar ]
  • Fernyhough C. (2008). Getting Vygotskian about theory of mind: Mediation, dialogue, and the development of social understanding. Dev. Rev. 28 225–262. 10.1016/j.dr.2007.03.001 [ CrossRef ] [ Google Scholar ]
  • Festinger L. (1954). A Theory of social comparison processes. Hum. Relat. 7 117–140. 10.1177/001872675400700202 [ CrossRef ] [ Google Scholar ]
  • Fleck J., Johnson-Migalski L. (2015). The impact of social media on personal and professional lives: An Adlerian perspective. J. Individ. Psychol. 71 135–142. 10.1353/jip.2015.0013 [ CrossRef ] [ Google Scholar ]
  • Fredrickson B. L. (2001). The role of positive emotions in positive psychology: The broaden-and-build theory of positive emotions. Am. Psychol. 56 218–226. 10.1037/0003-066X.56.3.218 [ PMC free article ] [ PubMed ] [ CrossRef ] [ Google Scholar ]
  • Frison E., Eggermont S. (2016). Exploring the relationships between different types of Facebook use, perceived online social support, and adolescents’ depressed mood. Soc. Sci. Compu. Rev. 34 153–171. 10.1177/0894439314567449 [ CrossRef ] [ Google Scholar ]
  • Hanprathet N., Manwong M., Khumsri J., Yingyeun R., Phanasathit M. (2015). Facebook addiction and its relationship with mental health among Thai high school students. J. Med. Assoc. Thailand 98 S81–S90. [ PubMed ] [ Google Scholar ]
  • Hiver P., Al-Hoorie A. H. (2019). Research Methods for Complexity Theory in Applied Linguistics. Bristol: Multilingual Matters. 10.21832/HIVER5747 [ CrossRef ] [ Google Scholar ]
  • Iwamoto D., Chun H. (2020). The emotional impact of social media in higher education. Int. J. High. Educ. 9 239–247. 10.5430/ijhe.v9n2p239 [ CrossRef ] [ Google Scholar ]
  • Keles B., McCrae N., Grealish A. (2020). A systematic review: The influence of social media on depression, anxiety and psychological distress in adolescents. Int. J. Adolesc. Youth 25 79–93. 10.1080/02673843.2019.1590851 [ CrossRef ] [ Google Scholar ]
  • Ley B., Ogonowski C., Hess J., Reichling T., Wan L., Wulf V. (2014). Impacts of new technologies on media usage and social behavior in domestic environments. Behav. Inform. Technol. 33 815–828. 10.1080/0144929X.2013.832383 [ CrossRef ] [ Google Scholar ]
  • Li J.-B., Lau J. T. F., Mo P. K. H., Su X.-F., Tang J., Qin Z.-G., et al. (2017). Insomnia partially mediated the association between problematic Internet use and depression among secondary school students in China. J. Behav. Addict. 6 554–563. 10.1556/2006.6.2017.085 [ PMC free article ] [ PubMed ] [ CrossRef ] [ Google Scholar ]
  • Mathewson M. (2020). The impact of social media usage on students’ mental health. J. Stud. Affairs 29 146–160. [ Google Scholar ]
  • Neira B. C. J., Barber B. L. (2014). Social networking site use: Linked to adolescents’ social self-concept, self-esteem, and depressed mood. Aus. J. Psychol. 66 56–64. 10.1111/ajpy.12034 [ CrossRef ] [ Google Scholar ]
  • O’Dea B., Campbell A. (2011). Online social networking amongst teens: Friend or foe? Ann. Rev. CyberTher. Telemed. 9 108–112. [ PubMed ] [ Google Scholar ]
  • Radovic A., Gmelin T., Stein B. D., Miller E. (2017). Depressed adolescents positive and negative use of social media. J. Adolesc. 55 5–15. 10.1016/j.adolescence.2016.12.002 [ PMC free article ] [ PubMed ] [ CrossRef ] [ Google Scholar ]
  • Sampasa-Kanyinga H., Lewis R. F. (2015). Frequent use of social networking sites is associated with poor psychological functioning among children and adolescents. Cyberpsychol. Behav. Soc. Network. 18 380–385. 10.1089/cyber.2015.0055 [ PubMed ] [ CrossRef ] [ Google Scholar ]
  • Sriwilai K., Charoensukmongkol P. (2016). Face it, don’t Facebook it: Impacts of social media addiction on mindfulness, coping strategies and the consequence on emotional exhaustion. Stress Health 32 427–434. 10.1002/smi.2637 [ PubMed ] [ CrossRef ] [ Google Scholar ]
  • Stapel D. A. (2007). “ In the mind of the beholder: The interpretation comparison model of accessibility effects ,” in Assimilation and Contrast in Social Psychology , eds Stapel D. A., Suls J. (London: Psychology Press; ), 143–164. [ Google Scholar ]
  • Stapel D. A., Koomen W. (2000). Distinctiveness of others, mutability of selves: Their impact on self-evaluations. J. Pers. Soc. Psychol. 79 1068–1087. 10.1037//0022-3514.79.6.1068 [ PubMed ] [ CrossRef ] [ Google Scholar ]
  • Tang F., Wang X., Norman C. S. (2013). An investigation of the impact of media capabilities and extraversion on social presence and user satisfaction. Behav. Inform. Technol. 32 1060–1073. 10.1080/0144929X.2013.830335 [ CrossRef ] [ Google Scholar ]
  • Tsitsika A. K., Tzavela E. C., Janikian M., Ólafsson K., Iordache A., Schoenmakers T. M., et al. (2014). Online social networking in adolescence: Patterns of use in six European countries and links with psychosocial functioning. J. Adolesc. Health 55 141–147. 10.1016/j.jadohealth.2013.11.010 [ PubMed ] [ CrossRef ] [ Google Scholar ]
  • Vernon L., Modecki K. L., Barber B. L. (2017). Tracking effects of problematic social networking on adolescent psychopathology: The mediating role of sleep disruptions. J. Clin. Child Adolesc. Psychol. 46 269–283. 10.1080/15374416.2016.1188702 [ PubMed ] [ CrossRef ] [ Google Scholar ]
  • Virden A., Trujillo A., Predeger E. (2014). Young adult females’ perceptions of high-risk social media behaviors: A focus-group approach. J. Commun. Health Nurs. 31 133–144. 10.1080/07370016.2014.926677 [ PubMed ] [ CrossRef ] [ Google Scholar ]
  • Wang P., Wang X., Wu Y., Xie X., Wang X., Zhao F., et al. (2018). Social networking sites addiction and adolescent depression: A moderated mediation model of rumination and self-esteem. Pers. Individ. Differ. 127 162–167. 10.1016/j.paid.2018.02.008 [ CrossRef ] [ Google Scholar ]
  • Weng L., Menczer F. (2015). Topicality and impact in social media: Diverse messages, focused messengers. PLoS One 10 : e0118410 . 10.1371/journal.pone.0118410 [ PMC free article ] [ PubMed ] [ CrossRef ] [ Google Scholar ]
  • Yan H., Zhang R., Oniffrey T. M., Chen G., Wang Y., Wu Y., et al. (2017). Associations among screen time and unhealthy behaviors, academic performance, and well-being in Chinese adolescents. Int. J. Environ. Res. Public Health 14 : 596 . 10.3390/ijerph14060596 [ PMC free article ] [ PubMed ] [ CrossRef ] [ Google Scholar ]
  • Zareen N., Karim N., Khan U. A. (2016). Psycho-emotional impact of social media emojis. ISRA Med. J. 8 257–262. [ Google Scholar ]
  • Zhang R. (2017). The stress-buffering effect of self-disclosure on Facebook: An examination of stressful life events, social support, and mental health among college students. Comp. Hum. Behav. 75 527–537. 10.1016/j.chb.2017.05.043 [ CrossRef ] [ Google Scholar ]

IMAGES

  1. How does social media impact self-esteem? by thisisfkhan

    social media platforms research paper

  2. Sports Marketing Term Papers : Components of Sport Marketing and Term Paper

    social media platforms research paper

  3. Custom Essay

    social media platforms research paper

  4. How to Use Social Media Data to Boost Content Marketing Strategy

    social media platforms research paper

  5. New Social Media Platforms

    social media platforms research paper

  6. Graph Of The Most Popular Social Media Platforms 2002

    social media platforms research paper

VIDEO

  1. AI in Social Media: The Algorithms Behind the Platforms

  2. Social-Media-Influencer കാണാൻ പാടില്ലാത്ത വിധം 'അവനെ' കണ്ടു

  3. The 5 Most Popular Social Media Platforms in 2023

  4. Top 10 Social Media Platforms Transforming News Consumption

  5. How social media is shaping public opinion on scientific research

  6. Social Media Platforms: Influence and Ethics

COMMENTS

  1. (PDF) Social media platforms: A primer for researchers

    Abstract. Social media platforms play an increasingly important role in research, education, and clinical practice. As an inseparable part of open science, these platforms may increase the ...

  2. The Role of Social Media Content Format and Platform in Users

    The purpose of this study is to understand the role of social media content on users' engagement behavior. More specifically, we investigate: (i)the direct effects of format and platform on users' passive and active engagement behavior, and (ii) we assess the moderating effect of content context on the link between each content type (rational, emotional, and transactional content) and ...

  3. Social media platforms: a primer for researchers

    Social media platforms play an increasingly important role in research, education, and clinical practice. As an inseparable part of open science, these platforms may increase the visibility of research outputs and facilitate scholarly networking. The editors who ethically moderate Twitter, Facebook, and other popular social media accounts for ...

  4. Advances in Social Media Research: Past, Present and Future

    Social media comprises communication websites that facilitate relationship forming between users from diverse backgrounds, resulting in a rich social structure. User generated content encourages inquiry and decision-making. Given the relevance of social media to various stakeholders, it has received significant attention from researchers of various fields, including information systems. There ...

  5. Effects of Social Media Use on Psychological Well-Being: A Mediated

    Given this research gap, this paper's main objective is to shed light on the effect of social media use on psychological well-being. ... of the respondents reported between 5 and 8 h a day, and over one-quarter (27.2%) 9-12 h a day. Regarding the social media platforms used, over 38.5 and 39.6% reported Facebook and WhatsApp, respectively. Of ...

  6. Social media in marketing research: Theoretical bases, methodological

    Search for more papers by this author. Jorma Larimo, Jorma Larimo. ... This was followed by data obtained from social media platforms (23.4%), experimental data (11.5%), and in-depth interviews (8.9%). ... In light of the increasingly heterogeneous consumer preferences and the diversification of social media platforms, more research needs to be ...

  7. Conceptualising and measuring social media engagement: A ...

    The spread of social media platforms enhanced academic and professional debate on social media engagement that attempted to better understand its theoretical foundations and measurements. This paper aims to systematically contribute to this academic debate by analysing, discussing, and synthesising social media engagement literature in the perspective of social media metrics. Adopting a ...

  8. PDF Understanding the impacts of social media platforms on students

    Social media platforms provide students with continuous opportunities in their learning progress. ... Ghandforoush and Khan's (2016) research findings, social media benefits higher education institutions' teaching and learning processes. Lecturers and students can learn through social media, which assists them in sharing learning content and

  9. Social media for knowledge-sharing: A systematic literature review

    Social media platforms incorporate a wide range of online media, including word-of-mouth forums such as Social Networking (SNS) (like Myspace and ... The five quality assessment questions listed above were employed to evaluate the 121 selected research papers, in order to strengthen the researchers' confidence in the credibility of their ...

  10. Social media platforms: a primer for researchers

    Social media platforms play an increasingly important role in research, education, and clinical practice. As an inseparable part of open science, these platforms may increase the visibility of research outputs and facilitate scholarly networking. The editors who ethically moderate Twitter, Facebook, and other popular social media accounts for ...

  11. Social media in qualitative research: Challenges and ...

    Social media platforms contain many different kinds of digital texts and images (Urquhart & Vaast, 2012). 3. Images (such as screen shots and videos) may need to be gathered and analyzed (sometimes while actively participating in the online fieldwork) ... We hope that future research methods papers about the use of social media data will ...

  12. Social Media Use and Its Connection to Mental Health: A Systematic

    Of the 16 selected research papers, there were a research focus on adults, gender, and preadolescents [10-19]. ... Lately, studies have found that using social media platforms can have a detrimental effect on the psychological health of its users. However, the extent to which the use of social media impacts the public is yet to be determined. ...

  13. (PDF) The Impact of social media on Mental Health ...

    PDF | This paper examines the impact of social media on mental health, focusing on the role of online platforms in shaping psychological well-being. The... | Find, read and cite all the research ...

  14. Social media usage to share information in communication ...

    Social media platforms (blogs, social networking sites, microblogging, etc.) contain all Web 2.0-based services. ... Social media research is encouraged in the field of communication, as people today present themselves through digitally networked platforms. ... Larivière V. Characterizing social media metrics of scholarly papers: The effect of ...

  15. Social Media Platforms and User Engagement: A Multi-Platform ...

    There have been multiple research studies in recent days that have analyzed the growing role of social media in firms' communication strategy as well as the role of social media in shaping a firm's reputation. However, most of these studies focus on one of the multiple social media platforms that firms use. In this study, we argue that there are nuances in firms' social media ...

  16. Visual media analysis for Instagram and other online platforms

    Abstract. Instagram is currently the social media platform most associated with online images (and their analysis), but images from other platforms also can be collected and grouped, arrayed by similarity, stacked, matched, stained, labelled, depicted as network, placed side by side and otherwise analytically displayed.

  17. Exploring the role of social media in collaborative learning the new

    This study is an attempt to examine the application and usefulness of social media and mobile devices in transferring the resources and interaction with academicians in higher education institutions across the boundary wall, a hitherto unexplained area of research. This empirical study is based on the survey of 360 students of a university in eastern India, cognising students' perception on ...

  18. The use of social media and online communications in times of pandemic

    The term social media describes 'interactive computer-mediated technologies that facilitate the creation or sharing of information, ideas, career interests and other forms of expression via virtual communities and networks'. 1 This definition includes a wide variety of popular platforms, including Twitter™, Facebook™, Instagram ...

  19. The Impact of Social Media on the Mental Health of Adolescents and

    Introduction and background. Humans are naturally social species that depend on the companionship of others to thrive in life. Thus, while being socially linked with others helps alleviate stress, worry, and melancholy, a lack of social connection can pose major threats to one's mental health [].Over the past 10 years, the rapid emergence of social networking sites like Facebook, Twitter ...

  20. Understanding the complex links between social media and health

    Fabiana Zollo and colleagues call for comprehensive, robust research on the influence of social media on health behaviour in order to improve public health responses ### Key messages Over 90% of people connected to the internet are active on social media, with a total of 4.76 billion users worldwide in January 2023.1 The digital revolution has reshaped the news landscape and changed the way ...

  21. Social media analytics: a survey of techniques, tools and platforms

    This paper is written for (social science) researchers seeking to analyze the wealth of social media now available. It presents a comprehensive review of software tools for social networking media, wikis, really simple syndication feeds, blogs, newsgroups, chat and news feeds. For completeness, it also includes introductions to social media scraping, storage, data cleaning and sentiment ...

  22. Social Media and Emotional Well-being: Pursuit of Happiness or Pleasure

    Social media platforms like Facebook, Instagram, Twitter, YouTube, Snapchat, etc. helps in maintaining social interactions and social relationships with like-minded peers who share a common interest and hobbies ... The theoretical base for this research paper is based on the Emotional State theory of Happiness proposed by Haybron . According to ...

  23. PDF Use of Social Media and its Impact on Academic Performance of Tertiary

    i. To examine the impact of social media on academic performance among the students. ii. To identify the benefits obtained from using the social media. iii. To ascertain what students use social media sites for 1.4 Research Questions i. Does the use of social media sites have any impact on student's academic performance? ii.

  24. The effect of social media on the development of students' affective

    In recent years, several studies have been conducted to explore the potential effects of social media on students' affective traits, such as stress, anxiety, depression, and so on. The present paper reviews the findings of the exemplary published works of research to shed light on the positive and negative potential effects of the massive use ...