ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	16

Descriptor

Testing Problems	31
Simulation	24
Item Response Theory	12
Computer Simulation	8
Computer Assisted Testing	7
Evaluation Methods	7
Psychometrics	7
Test Items	7
Achievement Tests	5
Scores	5
Adaptive Testing	4
Evaluation Problems	4
Statistical Analysis	4
Test Bias	4
Cognitive Tests	3
Comparative Analysis	3
Computation	3
Computer Software	3
Diagnostic Tests	3
Educational Assessment	3
Equations (Mathematics)	3
Error Patterns	3
Evaluation Research	3
Foreign Countries	3
Higher Education	3
More ▼

Publication Type

Journal Articles	31
Reports - Research	19
Reports - Evaluative	7
Reports - Descriptive	4
Opinion Papers	2

Education Level

Secondary Education

Audience

Researchers

Location

California	1
Denmark	1
Germany	1
Ohio	1
Poland	1
Sweden	1

Laws, Policies, & Programs

Assessments and Surveys

Indiana Statewide Testing for…	2
Program for International…	2
Armed Services Vocational…	1
National Assessment of…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 31 results Save | Export

Studying the Development of Navigation Using Virtual Environments

Peer reviewed

Direct link

Nguyen, Kim V.; Tansan, Merve; Newcombe, Nora S. – Journal of Cognition and Development, 2023

Research on spatial navigation is essential to understanding how mobile species adapt to their environments. Such research increasingly uses virtual environments (VEs) because, although VE has drawbacks, it allows for standardization of procedures, precision in measuring behaviors, ease in introducing variation, and cross-investigator…

Descriptors: Computer Simulation, Spatial Ability, Navigation, Research Methodology

Hybrid Threshold-Based Sequential Procedures for Detecting Compromised Items in a Computerized Adaptive Testing Licensure Exam

Peer reviewed

Direct link

Lee, Chansoon; Qian, Hong – Educational and Psychological Measurement, 2022

Using classical test theory and item response theory, this study applied sequential procedures to a real operational item pool in a variable-length computerized adaptive testing (CAT) to detect items whose security may be compromised. Moreover, this study proposed a hybrid threshold approach to improve the detection power of the sequential…

Descriptors: Computer Assisted Testing, Adaptive Testing, Licensing Examinations (Professions), Item Response Theory

Comparing Different Trend Estimation Approaches in Country Means and Standard Deviations in International Large-Scale Assessment Studies

Peer reviewed

Direct link

Robitzsch, Alexander; Lüdtke, Oliver – Large-scale Assessments in Education, 2023

One major aim of international large-scale assessments (ILSA) like PISA is to monitor changes in student performance over time. To accomplish this task, a set of common items (i.e., link items) is repeatedly administered in each assessment. Linking methods based on item response theory (IRT) models are used to align the results from the different…

Descriptors: Educational Trends, Trend Analysis, International Assessment, Achievement Tests

Item Calibration Methods with Multiple Subscale Multistage Testing

Peer reviewed

Direct link

Chun Wang; Ping Chen; Shengyu Jiang – Journal of Educational Measurement, 2020

Many large-scale educational surveys have moved from linear form design to multistage testing (MST) design. One advantage of MST is that it can provide more accurate latent trait [theta] estimates using fewer items than required by linear tests. However, MST generates incomplete response data by design; hence, questions remain as to how to…

Descriptors: Test Construction, Test Items, Adaptive Testing, Maximum Likelihood Statistics

Detecting Fraudulent Erasures at an Aggregate Level

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2018

Wollack, Cohen, and Eckerly suggested the "erasure detection index" (EDI) to detect fraudulent erasures for individual examinees. Wollack and Eckerly extended the EDI to detect fraudulent erasures at the group level. The EDI at the group level was found to be slightly conservative. This article suggests two modifications of the EDI for…

Descriptors: Deception, Identification, Testing Problems, Cheating

Fine-Tuning Cross-Battery Assessment Procedures: After Follow-Up Testing, Use All Valid Scores, Cohesive or Not

Peer reviewed

Direct link

Schneider, W. Joel; Roman, Zachary – Journal of Psychoeducational Assessment, 2018

We used data simulations to test whether composites consisting of cohesive subtest scores are more accurate than composites consisting of divergent subtest scores. We demonstrate that when multivariate normality holds, divergent and cohesive scores are equally accurate. Furthermore, excluding divergent scores results in biased estimates of…

Descriptors: Statistical Data, Simulation, Testing, Scores

Are the Nonparametric Person-Fit Statistics More Powerful than Their Parametric Counterparts? Revisiting the Simulations in Karabatsos (2003)

Peer reviewed

Direct link

Sinharay, Sandip – Applied Measurement in Education, 2017

Karabatsos compared the power of 36 person-fit statistics using receiver operating characteristics curves and found the "H[superscript T]" statistic to be the most powerful in identifying aberrant examinees. He found three statistics, "C", "MCI", and "U3", to be the next most powerful. These four statistics,…

Descriptors: Nonparametric Statistics, Goodness of Fit, Simulation, Comparative Analysis

Evaluating Specification Tests in the Context of Value-Added Estimation

Peer reviewed

Direct link

Guarino, Cassandra M.; Reckase, Mark D.; Stacy, Brian W.; Wooldridge, Jeffrey M. – Journal of Research on Educational Effectiveness, 2015

We study the properties of two specification tests that have been applied to a variety of estimators in the context of value-added measures (VAMs) of teacher and school quality: the Hausman test for choosing between student-level random and fixed effects, and a test for feedback (sometimes called a "falsification test"). We discuss…

Descriptors: Teacher Effectiveness, Educational Quality, Evaluation Methods, Tests

Modeling Skipped and Not-Reached Items Using IRTrees

Peer reviewed

Direct link

Debeer, Dries; Janssen, Rianne; De Boeck, Paul – Journal of Educational Measurement, 2017

When dealing with missing responses, two types of omissions can be discerned: items can be skipped or not reached by the test taker. When the occurrence of these omissions is related to the proficiency process the missingness is nonignorable. The purpose of this article is to present a tree-based IRT framework for modeling responses and omissions…

Descriptors: Item Response Theory, Test Items, Responses, Testing Problems

Assessing Individual-Level Impact of Interruptions during Online Testing

Peer reviewed

Direct link

Sinharay, Sandip; Wan, Ping; Choi, Seung W.; Kim, Dong-In – Journal of Educational Measurement, 2015

With an increase in the number of online tests, the number of interruptions during testing due to unexpected technical issues seems to be on the rise. For example, interruptions occurred during several recent state tests. When interruptions occur, it is important to determine the extent of their impact on the examinees' scores. Researchers such as…

Descriptors: Computer Assisted Testing, Testing Problems, Scores, Statistical Analysis

Determining the Overall Impact of Interruptions during Online Testing

Peer reviewed

Direct link

Sinharay, Sandip; Wan, Ping; Whitaker, Mike; Kim, Dong-In; Zhang, Litong; Choi, Seung W. – Journal of Educational Measurement, 2014

With an increase in the number of online tests, interruptions during testing due to unexpected technical issues seem unavoidable. For example, interruptions occurred during several recent state tests. When interruptions occur, it is important to determine the extent of their impact on the examinees' scores. There is a lack of research on this…

Descriptors: Computer Assisted Testing, Testing Problems, Scores, Regression (Statistics)

Screening Test Items for Differential Item Functioning

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014

A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…

Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing

The Applicability of Multidimensional Computerized Adaptive Testing for Cognitive Ability Measurement in Organizational Assessment

Peer reviewed

Direct link

Makransky, Guido; Glas, Cees A. W. – International Journal of Testing, 2013

Cognitive ability tests are widely used in organizations around the world because they have high predictive validity in selection contexts. Although these tests typically measure several subdomains, testing is usually carried out for a single subdomain at a time. This can be ineffective when the subdomains assessed are highly correlated. This…

Descriptors: Foreign Countries, Cognitive Ability, Adaptive Testing, Feedback (Response)

Impact of Diagnosticity on the Adequacy of Models for Cognitive Diagnosis under a Linear Attribute Structure: A Simulation Study

Peer reviewed

Direct link

de La Torre, Jimmy; Karelitz, Tzur M. – Journal of Educational Measurement, 2009

Compared to unidimensional item response models (IRMs), cognitive diagnostic models (CDMs) based on latent classes represent examinees' knowledge and item requirements using discrete structures. This study systematically examines the viability of retrofitting CDMs to IRM-based data with a linear attribute structure. The study utilizes a procedure…

Descriptors: Simulation, Item Response Theory, Psychometrics, Evaluation Methods

Impact of Missing Data on the Detection of Differential Item Functioning: The Case of Mantel-Haenszel and Logistic Regression Analysis

Peer reviewed

Direct link

Robitzsch, Alexander; Rupp, Andre A. – Educational and Psychological Measurement, 2009

This article describes the results of a simulation study to investigate the impact of missing data on the detection of differential item functioning (DIF). Specifically, it investigates how four methods for dealing with missing data (listwise deletion, zero imputation, two-way imputation, response function imputation) interact with two methods of…

Descriptors: Test Bias, Simulation, Interaction, Effect Size

Previous Page | Next Page »

Pages: 1 | 2 | 3

Journal of Educational…	8
Applied Psychological…	2
Educational and Psychological…	2
Journal of Educational and…	2
Academic Medicine	1
American Educational Research…	1
Applied Measurement in…	1
Educational Leadership	1
Educational Measurement:…	1
Educational Researcher	1
International Journal of…	1
Journal of Cognition and…	1
Journal of Dental Education	1
Journal of Educational…	1
Journal of Optometric…	1
Journal of Psychoeducational…	1
Journal of Research on…	1
Large-scale Assessments in…	1
Middle School Journal	1
Psychometrika	1
TECHNOS	1
More ▼

Sinharay, Sandip	4
Choi, Seung W.	2
Drasgow, Fritz	2
Kim, Dong-In	2
Levine, Michael V.	2
Robitzsch, Alexander	2
Wan, Ping	2
Baxter, Gail P.	1
Bruno, James E.	1
Burchard, Kenneth W.	1
Chun Wang	1
Cui, Ying	1
Dana, Thomas M.	1
De Boeck, Paul	1
Debeer, Dries	1
Doscher, Mary-Lynn	1
Eiting, Mindert H.	1
Glas, Cees A. W.	1
Gross, Leon J.	1
Guarino, Cassandra M.	1
Janssen, Rianne	1
Karelitz, Tzur M.	1
Lee, Chansoon	1
Leighton, Jacqueline P.	1
More ▼