Variety and Mainstays of the R Developer Community

Saved in:
Bibliographic Details
Title: Variety and Mainstays of the R Developer Community
Language: English
Authors: Lijin Zhang (ORCID 0000-0002-4222-8850), Xueyang Li, Zhiyong Zhang (ORCID 0000-0003-0590-2196)
Source: Grantee Submission. 2023 15(3):5-25.
Peer Reviewed: Y
Page Count: 22
Publication Date: 2023
Sponsoring Agency: Institute of Education Sciences (ED)
Contract Number: R305D210023
Document Type: Journal Articles
Reports - Research
Descriptors: Computer Software, Programming Languages, Data Analysis, Visual Aids, Models, Word Frequency, Phrase Structure, Classification, Information Retrieval, Network Analysis, Computational Linguistics, Probability, Bayesian Statistics, Authors
DOI: 10.32614/RJ-2023-060
ISSN: 2073-4859
Abstract: The thriving developer community has a significant impact on the widespread use of R software. To better understand this community, we conducted a study analyzing all R packages available on CRAN. We identified the most popular topics of R packages by text mining the package descriptions. Additionally, using network centrality measures, we discovered the important packages in the package dependency network and influential developers in the global R community. Our analysis showed that among the 20 topics identified in the topic model, "Data Import, Export, and Wrangling," as well as "Data Visualization, Result Presentation, and Interactive Web Applications," were particularly popular among influential packages and developers. These findings provide valuable insights into the R community.
Abstractor: As Provided
IES Funded: Yes
Entry Date: 2024
Accession Number: ED645169
Database: ERIC
Description
Abstract:The thriving developer community has a significant impact on the widespread use of R software. To better understand this community, we conducted a study analyzing all R packages available on CRAN. We identified the most popular topics of R packages by text mining the package descriptions. Additionally, using network centrality measures, we discovered the important packages in the package dependency network and influential developers in the global R community. Our analysis showed that among the 20 topics identified in the topic model, "Data Import, Export, and Wrangling," as well as "Data Visualization, Result Presentation, and Interactive Web Applications," were particularly popular among influential packages and developers. These findings provide valuable insights into the R community.
ISSN:2073-4859
DOI:10.32614/RJ-2023-060