What Can We Learn from College Students' Network Transactions? Constructing Useful Features for Student Success Prediction

Saved in:
Bibliographic Details
Title: What Can We Learn from College Students' Network Transactions? Constructing Useful Features for Student Success Prediction
Language: English
Authors: Pytlarz, Ian, Pu, Shi, Patel, Monal, Prabhu, Rajini
Source: International Educational Data Mining Society. 2018.
Availability: International Educational Data Mining Society. e-mail: admin@educationaldatamining.org; Web site: http://www.educationaldatamining.org
Peer Reviewed: Y
Page Count: 5
Publication Date: 2018
Document Type: Speeches/Meeting Papers
Reports - Research
Education Level: Higher Education
Postsecondary Education
Descriptors: College Freshmen, Grade Point Average, At Risk Students, Academic Achievement, Computer Networks, Attendance, Learner Engagement, On Campus Students, Study Habits, Geographic Location, Student Behavior, Artificial Intelligence, Correlation, Data Analysis, Prediction
Geographic Terms: Indiana
Abstract: Identifying at-risk students at an early stage is a challenging task for colleges and universities. In this paper, we use students' oncampus network traffic volume to construct several useful features in predicting their first semester GPA. In particular, we build proxies for their attendance, class engagement, and out-of-class study hours based on their network traffic volume. We then test how much these network-based features can increase the performance of a model with only conventional features (e.g., demographics, high school GPA, standardized test scores, etc.). We labeled students as "above median" and "below median" students based on their first term GPA. Several machine learning models were then applied, ranging from logistic regression, SVM, and random forests, to AdaBoost. The result shows that the model with network-based features consistently outperforms the ones without, in terms of accuracy, f1 score, and AUC. Given that network activity data is readily available data in most colleges and universities, this study provides practical insights on how to build more powerful models to predict student success. [For the full proceedings, see ED593090.]
Abstractor: As Provided
Entry Date: 2019
Accession Number: ED593202
Database: ERIC
FullText Text:
  Availability: 0
CustomLinks:
  – Url: https://eric.ed.gov/contentdelivery/servlet/ERICServlet?accno=ED593202
    Name: ERIC Full Text
    Category: fullText
    Text: Full Text from ERIC
Header DbId: eric
DbLabel: ERIC
An: ED593202
AccessLevel: 3
PubType: Conference
PubTypeId: conference
PreciseRelevancyScore: 0
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: What Can We Learn from College Students' Network Transactions? Constructing Useful Features for Student Success Prediction
– Name: Language
  Label: Language
  Group: Lang
  Data: English
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Pytlarz%2C+Ian%22">Pytlarz, Ian</searchLink><br /><searchLink fieldCode="AR" term="%22Pu%2C+Shi%22">Pu, Shi</searchLink><br /><searchLink fieldCode="AR" term="%22Patel%2C+Monal%22">Patel, Monal</searchLink><br /><searchLink fieldCode="AR" term="%22Prabhu%2C+Rajini%22">Prabhu, Rajini</searchLink>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: <searchLink fieldCode="SO" term="%22International+Educational+Data+Mining+Society%22"><i>International Educational Data Mining Society</i></searchLink>. 2018.
– Name: Avail
  Label: Availability
  Group: Avail
  Data: International Educational Data Mining Society. e-mail: admin@educationaldatamining.org; Web site: http://www.educationaldatamining.org
– Name: PeerReviewed
  Label: Peer Reviewed
  Group: SrcInfo
  Data: Y
– Name: Pages
  Label: Page Count
  Group: Src
  Data: 5
– Name: DatePubCY
  Label: Publication Date
  Group: Date
  Data: 2018
– Name: TypeDocument
  Label: Document Type
  Group: TypDoc
  Data: Speeches/Meeting Papers<br />Reports - Research
– Name: Audience
  Label: Education Level
  Group: Audnce
  Data: <searchLink fieldCode="EL" term="%22Higher+Education%22">Higher Education</searchLink><br /><searchLink fieldCode="EL" term="%22Postsecondary+Education%22">Postsecondary Education</searchLink>
– Name: Subject
  Label: Descriptors
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22College+Freshmen%22">College Freshmen</searchLink><br /><searchLink fieldCode="DE" term="%22Grade+Point+Average%22">Grade Point Average</searchLink><br /><searchLink fieldCode="DE" term="%22At+Risk+Students%22">At Risk Students</searchLink><br /><searchLink fieldCode="DE" term="%22Academic+Achievement%22">Academic Achievement</searchLink><br /><searchLink fieldCode="DE" term="%22Computer+Networks%22">Computer Networks</searchLink><br /><searchLink fieldCode="DE" term="%22Attendance%22">Attendance</searchLink><br /><searchLink fieldCode="DE" term="%22Learner+Engagement%22">Learner Engagement</searchLink><br /><searchLink fieldCode="DE" term="%22On+Campus+Students%22">On Campus Students</searchLink><br /><searchLink fieldCode="DE" term="%22Study+Habits%22">Study Habits</searchLink><br /><searchLink fieldCode="DE" term="%22Geographic+Location%22">Geographic Location</searchLink><br /><searchLink fieldCode="DE" term="%22Student+Behavior%22">Student Behavior</searchLink><br /><searchLink fieldCode="DE" term="%22Artificial+Intelligence%22">Artificial Intelligence</searchLink><br /><searchLink fieldCode="DE" term="%22Correlation%22">Correlation</searchLink><br /><searchLink fieldCode="DE" term="%22Data+Analysis%22">Data Analysis</searchLink><br /><searchLink fieldCode="DE" term="%22Prediction%22">Prediction</searchLink>
– Name: Subject
  Label: Geographic Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22Indiana%22">Indiana</searchLink>
– Name: Abstract
  Label: Abstract
  Group: Ab
  Data: Identifying at-risk students at an early stage is a challenging task for colleges and universities. In this paper, we use students' oncampus network traffic volume to construct several useful features in predicting their first semester GPA. In particular, we build proxies for their attendance, class engagement, and out-of-class study hours based on their network traffic volume. We then test how much these network-based features can increase the performance of a model with only conventional features (e.g., demographics, high school GPA, standardized test scores, etc.). We labeled students as "above median" and "below median" students based on their first term GPA. Several machine learning models were then applied, ranging from logistic regression, SVM, and random forests, to AdaBoost. The result shows that the model with network-based features consistently outperforms the ones without, in terms of accuracy, f1 score, and AUC. Given that network activity data is readily available data in most colleges and universities, this study provides practical insights on how to build more powerful models to predict student success. [For the full proceedings, see ED593090.]
– Name: AbstractInfo
  Label: Abstractor
  Group: Ab
  Data: As Provided
– Name: DateEntry
  Label: Entry Date
  Group: Date
  Data: 2019
– Name: AN
  Label: Accession Number
  Group: ID
  Data: ED593202
PLink https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=eric&AN=ED593202
RecordInfo BibRecord:
  BibEntity:
    Languages:
      – Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 5
    Subjects:
      – SubjectFull: College Freshmen
        Type: general
      – SubjectFull: Grade Point Average
        Type: general
      – SubjectFull: At Risk Students
        Type: general
      – SubjectFull: Academic Achievement
        Type: general
      – SubjectFull: Computer Networks
        Type: general
      – SubjectFull: Attendance
        Type: general
      – SubjectFull: Learner Engagement
        Type: general
      – SubjectFull: On Campus Students
        Type: general
      – SubjectFull: Study Habits
        Type: general
      – SubjectFull: Geographic Location
        Type: general
      – SubjectFull: Student Behavior
        Type: general
      – SubjectFull: Artificial Intelligence
        Type: general
      – SubjectFull: Correlation
        Type: general
      – SubjectFull: Data Analysis
        Type: general
      – SubjectFull: Prediction
        Type: general
      – SubjectFull: Indiana
        Type: general
    Titles:
      – TitleFull: What Can We Learn from College Students' Network Transactions? Constructing Useful Features for Student Success Prediction
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Pytlarz, Ian
      – PersonEntity:
          Name:
            NameFull: Pu, Shi
      – PersonEntity:
          Name:
            NameFull: Patel, Monal
      – PersonEntity:
          Name:
            NameFull: Prabhu, Rajini
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 07
              Type: published
              Y: 2018
          Titles:
            – TitleFull: International Educational Data Mining Society
              Type: main
ResultId 1