ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	5
Since 2017 (last 10 years)	10
Since 2007 (last 20 years)	20

Descriptor

Bayesian Statistics	28
Scores	28
Adaptive Testing	9
Computer Assisted Testing	9
Item Response Theory	8
Correlation	7
Hypothesis Testing	6
Statistical Analysis	6
Test Items	6
Probability	5
Simulation	5
Comparative Analysis	4
Computation	4
Educational Testing	4
Item Banks	4
Psychometrics	4
Scoring	4
Achievement Gains	3
Achievement Tests	3
Classification	3
Data Analysis	3
Educational Research	3
Equations (Mathematics)	3
Error of Measurement	3
Estimation (Mathematics)	3
More ▼

Source

ETS Research Report Series	4
Journal of Educational and…	3
Education and Information…	2
Educational and Psychological…	2
Grantee Submission	2
Measurement:…	2
Applied Measurement in…	1
Computers & Education	1
Journal of Advanced Academics	1
Journal of College Science…	1
Journal of Experimental…	1
National Center for Education…	1
Psychometrika	1
More ▼

Publication Type

Reports - Research	22
Journal Articles	19
Reports - Evaluative	5
Speeches/Meeting Papers	2
Opinion Papers	1
Reports - Descriptive	1

Education Level

Higher Education	4
Postsecondary Education	4
Elementary Education	3
Elementary Secondary Education	2
Grade 5	2
Grade 6	2
Intermediate Grades	2
Grade 3	1
Grade 4	1
Grade 7	1
Grade 8	1
Middle Schools	1
More ▼

Audience

Location

Indiana	1
Netherlands	1
New York	1
Portugal	1

Laws, Policies, & Programs

Assessments and Surveys

Early Childhood Longitudinal…	1
Indiana Statewide Testing for…	1
Wechsler Adult Intelligence…	1
Wechsler Intelligence Scale…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 28 results Save | Export

On Bank Assembly and Block Selection in Multidimensional Forced-Choice Adaptive Assessments

Peer reviewed

Direct link

Kreitchmann, Rodrigo S.; Sorrel, Miguel A.; Abad, Francisco J. – Educational and Psychological Measurement, 2023

Multidimensional forced-choice (FC) questionnaires have been consistently found to reduce the effects of socially desirable responding and faking in noncognitive assessments. Although FC has been considered problematic for providing ipsative scores under the classical test theory, item response theory (IRT) models enable the estimation of…

Descriptors: Measurement Techniques, Questionnaires, Social Desirability, Adaptive Testing

Handling Extreme Scores in Vertically Scaled Fixed-Length Computerized Adaptive Tests

Peer reviewed

Direct link

Wyse, Adam E.; McBride, James R. – Measurement: Interdisciplinary Research and Perspectives, 2022

A common practical challenge is how to assign ability estimates to all incorrect and all correct response patterns when using item response theory (IRT) models and maximum likelihood estimation (MLE) since ability estimates for these types of responses equal -8 or +8. This article uses a simulation study and data from an operational K-12…

Descriptors: Scores, Adaptive Testing, Computer Assisted Testing, Test Length

Prediction of Student Exam Performance Using Data Mining Classification Algorithms

Peer reviewed

Direct link

Dalia Khairy; Nouf Alharbi; Mohamed A. Amasha; Marwa F. Areed; Salem Alkhalaf; Rania A. Abougalala – Education and Information Technologies, 2024

Student outcomes are of great importance in higher education institutions. Accreditation bodies focus on them as an indicator to measure the performance and effectiveness of the institution. Forecasting students' academic performance is crucial for every educational establishment seeking to enhance performance and perseverance of its students and…

Descriptors: Prediction, Tests, Scores, Information Retrieval

Item Pool Quality Control in Educational Testing: Change Point Model, Compound Risk, and Sequential Detection

Peer reviewed

Direct link

Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022

In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…

Descriptors: Standardized Tests, Test Items, Test Validity, Scores

Detecting Test Fraud Using Bayes Factors

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip; Johnson, Matthew S. – Grantee Submission, 2019

According to Wollack and Schoenig (2018), score differencing is one of six types of statistical methods used to detect test fraud. In this paper, we suggested the use of Bayes factors (e.g., Kass & Raftery, 1995) for score differencing. A simulation study shows that the suggested approach performs slightly better than an existing frequentist…

Descriptors: Cheating, Deception, Statistical Analysis, Bayesian Statistics

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Revealing the Comprehension Processes of Underprepared College Students: An Evaluation of the Reading Strategies Assessment Tool

Peer reviewed
PDF on ERIC

Download full text

Magliano, Joseph P.; Lampi, Jodi P.; Ray, Melissa; Chan, Greta – Grantee Submission, 2020

Coherent mental models for successful comprehension require inferences that establish semantic "bridges" between discourse constituents and "elaborations" that incorporate relevant background knowledge. While it is established that individual differences in the extent to which postsecondary students engage in these processes…

Descriptors: Reading Comprehension, Reading Strategies, Inferences, Reading Tests

Using the Bayes Factors to Evaluate Person Fit in the Item Response Theory

Peer reviewed

Direct link

Pan, Tianshu; Yin, Yue – Applied Measurement in Education, 2017

In this article, we propose using the Bayes factors (BF) to evaluate person fit in item response theory models under the framework of Bayesian evaluation of an informative diagnostic hypothesis. We first discuss the theoretical foundation for this application and how to analyze person fit using BF. To demonstrate the feasibility of this approach,…

Descriptors: Bayesian Statistics, Goodness of Fit, Item Response Theory, Monte Carlo Methods

Research and Teaching: Correcting Missed Exam Questions as a Learning Tool in a Physiology Course

Peer reviewed

Direct link

Rozell, Timothy G.; Johnson, Jessica; Sexten, Andrea; Rhodes, Ashley E. – Journal of College Science Teaching, 2017

Students in a junior- and senior-level Anatomy and Physiology course have the opportunity to correct missed exam questions ("regrade") and earn up to half of the original points missed. The three objectives of this study were to determine if: (a) performance on the regrade assignment was correlated with scores on subsequent exams, (b)…

Descriptors: Physiology, Scores, Grades (Scholastic), Exit Examinations

The Case for Informal Argument

Peer reviewed

Direct link

Mislevy, Robert J. – Measurement: Interdisciplinary Research and Perspectives, 2012

Paul E. Newton's "Clarifying the Consensus Definition of Validity" addresses the single most important, yet stubbornly protean, value in educational and psychological assessment. "Standards for Educational and Psychological Testing" (American Educational Research Association, American Psychological Association, & National Council on Measurement in…

Descriptors: Evidence, Validity, Educational Testing, Psychological Evaluation

Effectiveness of Item Response Theory (IRT) Proficiency Estimation Methods under Adaptive Multistage Testing. Research Report. ETS RR-15-11

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook Henry – ETS Research Report Series, 2015

The purpose of this inquiry was to investigate the effectiveness of item response theory (IRT) proficiency estimators in terms of estimation bias and error under multistage testing (MST). We chose a 2-stage MST design in which 1 adaptation to the examinees' ability levels takes place. It includes 4 modules (1 at Stage 1, 3 at Stage 2) and 3 paths…

Descriptors: Item Response Theory, Computation, Statistical Bias, Error of Measurement

Using Bayesian Networks to Improve Knowledge Assessment

Peer reviewed

Direct link

Millan, Eva; Descalco, Luis; Castillo, Gladys; Oliveira, Paula; Diogo, Sandra – Computers & Education, 2013

In this paper, we describe the integration and evaluation of an existing generic Bayesian student model (GBSM) into an existing computerized testing system within the Mathematics Education Project (PmatE--Projecto Matematica Ensino) of the University of Aveiro. This generic Bayesian student model had been previously evaluated with simulated…

Descriptors: Computer Assisted Testing, Equations (Mathematics), Mathematics Education, Expertise

Statistical Methods for Assessments in Simulations and Serious Games. Research Report. ETS RR-14-12

Peer reviewed
PDF on ERIC

Download full text

Fu, Jianbin; Zapata, Diego; Mavronikolas, Elia – ETS Research Report Series, 2014

Simulation or game-based assessments produce outcome data and process data. In this article, some statistical models that can potentially be used to analyze data from simulation or game-based assessments are introduced. Specifically, cognitive diagnostic models that can be used to estimate latent skills from outcome data so as to scale these…

Descriptors: Simulation, Evaluation Methods, Games, Data Collection

Modeling Change in Large-Scale Longitudinal Studies of Educational Growth: Four Decades of Contributions to the Assessment of Educational Growth. Research Report. ETS RR-12-04. ETS R&D Scientific and Policy Contributions Series. ETS SPC-12-01

Peer reviewed
PDF on ERIC

Download full text

Rock, Donald A. – ETS Research Report Series, 2012

This paper provides a history of ETS's role in developing assessment instruments and psychometric procedures for measuring change in large-scale national assessments funded by the Longitudinal Studies branch of the National Center for Education Statistics. It documents the innovations developed during more than 30 years of working with…

Descriptors: Models, Educational Change, Longitudinal Studies, Educational Development

The Effect of an Out-of-School Enrichment Program on the Academic Achievement of High-Potential Students from Low-Income Families

Peer reviewed

Direct link

Hodges, Jaret; McIntosh, Jason; Gentry, Marcia – Journal of Advanced Academics, 2017

High-potential students from low-income families are at an academic disadvantage compared with their more affluent peers. To address this issue, researchers have suggested novel approaches to mitigate gaps in student performance, including out-of-school enrichment programs. Longitudinal mixed effects modeling was used to analyze the growth of…

Descriptors: After School Programs, Enrichment Activities, Academic Achievement, High Achievement

Previous Page | Next Page »

Pages: 1 | 2

McBride, James R.	3
Chiang, Hanley S.	2
Schochet, Peter Z.	2
Weiss, David J.	2
Abad, Francisco J.	1
Boyd, Donald	1
Bradlow, Eric T.	1
Carvajal, Jorge	1
Castillo, Gladys	1
Chan, Greta	1
Chen, Yunxiao	1
Dalia Khairy	1
DeAyala, R. J.	1
Descalco, Luis	1
Diogo, Sandra	1
Fu, Jianbin	1
Gelbal, Selahattin	1
Gentry, Marcia	1
Hodges, Jaret	1
Hsiung, Chao A.	1
Johnson, Jessica	1
Johnson, Matthew S.	1
Kim, Sooyeon	1
Koch, William R.	1
Kreitchmann, Rodrigo S.	1
More ▼