ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	10
Since 2006 (last 20 years)	25

Source

Educational and Psychological…

Publication Type

Journal Articles	37
Reports - Research	23
Reports - Evaluative	12
Reports - Descriptive	2

Education Level

Higher Education	3
Elementary Secondary Education	2
Postsecondary Education	2
Elementary Education	1
Grade 5	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Canada	1
Florida	1
Netherlands (Amsterdam)	1
Norway	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

Bender Gestalt Test	1
Coopersmith Self Esteem…	1
Defining Issues Test	1
Developmental Test of Visual…	1
Embedded Figures Test	1
General Educational…	1
Kohlberg Moral Judgment…	1
Minnesota Multiphasic…	1
Need for Cognition Scale	1
Raven Advanced Progressive…	1
Raven Progressive Matrices	1
Rod and Frame Test	1
Rorschach Test	1
SRA Achievement Series	1
Sarason Test Anxiety Scale…	1
Test for Auditory…	1
Wechsler Adult Intelligence…	1
Wechsler Intelligence Scale…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 68 results Save | Export

Added Value of Subscores for Tests with Polytomous Items

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Educational and Psychological Measurement, 2025

Test-takers, policymakers, teachers, and institutions are increasingly demanding that testing programs provide more detailed feedback regarding test performance. As a result, there has been a growing interest in the reporting of subscores that potentially provide such detailed feedback. Haberman developed a method based on classical test theory…

Descriptors: Scores, Test Theory, Test Items, Testing

Multimodal Data Fusion to Detect Preknowledge Test-Taking Behavior Using Machine Learning

Peer reviewed

Direct link

Kaiwen Man – Educational and Psychological Measurement, 2024

In various fields, including college admission, medical board certifications, and military recruitment, high-stakes decisions are frequently made based on scores obtained from large-scale assessments. These decisions necessitate precise and reliable scores that enable valid inferences to be drawn about test-takers. However, the ability of such…

Descriptors: Prior Learning, Testing, Behavior, Artificial Intelligence

Semisupervised Learning Method to Adjust Biased Item Difficulty Estimates Caused by Nonignorable Missingness in a Virtual Learning Environment

Peer reviewed
PDF on ERIC

Download full text

Direct link

Xue, Kang; Huggins-Manley, Anne Corinne; Leite, Walter – Educational and Psychological Measurement, 2022

In data collected from virtual learning environments (VLEs), item response theory (IRT) models can be used to guide the ongoing measurement of student ability. However, such applications of IRT rely on unbiased item parameter estimates associated with test items in the VLE. Without formal piloting of the items, one can expect a large amount of…

Descriptors: Virtual Classrooms, Artificial Intelligence, Item Response Theory, Item Analysis

Disentangling Item and Testing Effects in Inoculation Research on Online Misinformation: Solomon Revisited

Peer reviewed

Direct link

Roozenbeek, Jon; Maertens, Rakoen; McClanahan, William; van der Linden, Sander – Educational and Psychological Measurement, 2021

Online misinformation is a pervasive global problem. In response, psychologists have recently explored the theory of psychological inoculation: If people are preemptively exposed to a weakened version of a misinformation technique, they can build up cognitive resistance. This study addresses two unanswered methodological questions about a widely…

Descriptors: Games, Intervention, Scores, Pretests Posttests

The Use of Theory of Linear Mixed-Effects Models to Detect Fraudulent Erasures at an Aggregate Level

Peer reviewed
PDF on ERIC

Download full text

Direct link

Peng, Luyao; Sinharay, Sandip – Educational and Psychological Measurement, 2022

Wollack et al. (2015) suggested the erasure detection index (EDI) for detecting fraudulent erasures for individual examinees. Wollack and Eckerly (2017) and Sinharay (2018) extended the index of Wollack et al. (2015) to suggest three EDIs for detecting fraudulent erasures at the aggregate or group level. This article follows up on the research of…

Descriptors: Cheating, Identification, Statistical Analysis, Testing

Does the Effect of a Time Limit for Testing Impair Structural Investigations by Means of Confirmatory Factor Models?

Peer reviewed

Direct link

Schweizer, Karl; Reiß, Siegbert; Troche, Stefan – Educational and Psychological Measurement, 2019

The article reports three simulation studies conducted to find out whether the effect of a time limit for testing impairs model fit in investigations of structural validity, whether the representation of the assumed source of the effect prevents impairment of model fit and whether it is possible to identify and discriminate this method effect from…

Descriptors: Timed Tests, Testing, Barriers, Testing Problems

The Development of MST Test Information for the Prediction of Test Performances

Peer reviewed

Direct link

Park, Ryoungsun; Kim, Jiseon; Chung, Hyewon; Dodd, Barbara G. – Educational and Psychological Measurement, 2017

The current study proposes novel methods to predict multistage testing (MST) performance without conducting simulations. This method, called MST test information, is based on analytic derivation of standard errors of ability estimates across theta levels. We compared standard errors derived analytically to the simulation results to demonstrate the…

Descriptors: Testing, Performance, Prediction, Error of Measurement

Examining Measurement Invariance and Differential Item Functioning with Discrete Latent Construct Indicators: A Note on a Multiple Testing Procedure

Peer reviewed

Direct link

Raykov, Tenko; Dimitrov, Dimiter M.; Marcoulides, George A.; Li, Tatyana; Menold, Natalja – Educational and Psychological Measurement, 2018

A latent variable modeling method for studying measurement invariance when evaluating latent constructs with multiple binary or binary scored items with no guessing is outlined. The approach extends the continuous indicator procedure described by Raykov and colleagues, utilizes similarly the false discovery rate approach to multiple testing, and…

Descriptors: Models, Statistical Analysis, Error of Measurement, Test Bias

A Comparison of Exposure Control Procedures in CATs Using the 3PL Model

Peer reviewed

Direct link

Leroux, Audrey J.; Lopez, Myriam; Hembry, Ian; Dodd, Barbara G. – Educational and Psychological Measurement, 2013

This study compares the progressive-restricted standard error (PR-SE) exposure control procedure to three commonly used procedures in computerized adaptive testing, the randomesque, Sympson-Hetter (SH), and no exposure control methods. The performance of these four procedures is evaluated using the three-parameter logistic model under the…

Descriptors: Computer Assisted Testing, Adaptive Testing, Comparative Analysis, Statistical Analysis

Evaluation of Two Methods for Modeling Measurement Errors When Testing Interaction Effects with Observed Composite Scores

Peer reviewed

Direct link

Hsiao, Yu-Yu; Kwok, Oi-Man; Lai, Mark H. C. – Educational and Psychological Measurement, 2018

Path models with observed composites based on multiple items (e.g., mean or sum score of the items) are commonly used to test interaction effects. Under this practice, researchers generally assume that the observed composites are measured without errors. In this study, we reviewed and evaluated two alternative methods within the structural…

Descriptors: Error of Measurement, Testing, Scores, Models

Factorial Invariance in Multiple Populations: A Multiple Testing Procedure

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A.; Millsap, Roger E. – Educational and Psychological Measurement, 2013

A multiple testing method for examining factorial invariance for latent constructs evaluated by multiple indicators in distinct populations is outlined. The procedure is based on the false discovery rate concept and multiple individual restriction tests and resolves general limitations of a popular factorial invariance testing approach. The…

Descriptors: Testing, Statistical Analysis, Factor Analysis, Statistical Significance

A New Stopping Rule for Computerized Adaptive Testing

Peer reviewed

Direct link

Choi, Seung W.; Grady, Matthew W.; Dodd, Barbara G. – Educational and Psychological Measurement, 2011

The goal of the current study was to introduce a new stopping rule for computerized adaptive testing (CAT). The predicted standard error reduction (PSER) stopping rule uses the predictive posterior variance to determine the reduction in standard error that would result from the administration of additional items. The performance of the PSER was…

Descriptors: Item Banks, Adaptive Testing, Computer Assisted Testing, Evaluation Methods

Improving Measures via Examining the Behavior of Distractors in Multiple-Choice Tests: Assessment and Remediation

Peer reviewed

Direct link

Sideridis, Georgios; Tsaousis, Ioannis; Al Harbi, Khaleel – Educational and Psychological Measurement, 2017

The purpose of the present article was to illustrate, using an example from a national assessment, the value from analyzing the behavior of distractors in measures that engage the multiple-choice format. A secondary purpose of the present article was to illustrate four remedial actions that can potentially improve the measurement of the…

Descriptors: Multiple Choice Tests, Attention Control, Testing, Remedial Instruction

Evaluating Ranking Strategies in Assessing Change when the Measures Differ across Time

Peer reviewed

Direct link

Moses, Tim; Kim, Sooyeon – Educational and Psychological Measurement, 2012

In this study, a ranking strategy was evaluated for comparing subgroups' change using identical, equated, and nonidentical measures. Four empirical data sets were evaluated, each of which contained examinees' scores on two occasions, where the two occasions' scores were obtained on a single identical measure, on two equated tests, and on two…

Descriptors: Testing, Change, Scores, Measures (Individuals)

Effect of Multiple Testing Adjustment in Differential Item Functioning Detection

Peer reviewed

Direct link

Kim, Jihye; Oshima, T. C. – Educational and Psychological Measurement, 2013

In a typical differential item functioning (DIF) analysis, a significance test is conducted for each item. As a test consists of multiple items, such multiple testing may increase the possibility of making a Type I error at least once. The goal of this study was to investigate how to control a Type I error rate and power using adjustment…

Descriptors: Test Bias, Test Items, Statistical Analysis, Error of Measurement

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Dodd, Barbara G.	3
Krus, David J.	2
Marcoulides, George A.	2
Raykov, Tenko	2
Abu-Sayf, F. K.	1
Aiken, Lewis R.	1
Al Harbi, Khaleel	1
Ayers, Jerry B.	1
Berdie, Ralph F.	1
Bode, James	1
Bowers, John	1
Brooks, Sarah	1
Brooks, Thomas	1
Calabrese, Frank J.	1
Carbuhn, Wayne M.	1
Carlson, Jerry S.	1
Ceurvorst, Robert W.	1
Chang, Shun-Wen	1
Chissom, Brad S.	1
Choi, Seung W.	1
Chung, Hyewon	1
Cizek, Gregory J.	1
Collins, Jackie	1
Cronbach, Lee J.	1
More ▼

Testing	68
Statistical Analysis	18
Comparative Analysis	15
Test Reliability	12
Test Validity	12
Scores	10
Computer Assisted Testing	9
Test Items	9
Error of Measurement	8
Correlation	7
Factor Analysis	7
Item Analysis	7
Item Response Theory	7
Achievement Tests	6
Computer Programs	6
Higher Education	6
Intelligence Tests	6
Multiple Choice Tests	6
Response Style (Tests)	6
Measurement Techniques	5
Models	5
Psychological Testing	5
Psychometrics	5
Test Construction	5
Testing Problems	5
More ▼