ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	7

Descriptor

Measurement	8
Models	8
Test Length	8
Computation	3
Foreign Countries	3
Item Response Theory	3
Accuracy	2
Comparative Analysis	2
Computer Assisted Testing	2
Error of Measurement	2
Simulation	2
Test Items	2
Adaptive Testing	1
Bayesian Statistics	1
Bias	1
Certification	1
Citizenship	1
Comparative Education	1
Construct Validity	1
Correlation	1
Cutting Scores	1
Data Analysis	1
Decision Making	1
Educational Testing	1
Efficiency	1
More ▼

Source

Educational and Psychological…	2
ProQuest LLC	2
Applied Psychological…	1
Pearson	1
Psychometrika	1
Universal Journal of…	1

Author

Chien, Yuehmei	1
Culpepper, Steven Andrew	1
Damrongpanit, Suntonrapot	1
Eggen, Theo J. H. M.	1
Ellis, Jules L.	1
Huang, Hung-Yu	1
Kim, Jihye	1
Qian, Hong	1
Shin, Chingwei David	1
Verelst, Norman D.	1
Way, Walter Denny	1
More ▼

Publication Type

Journal Articles	5
Reports - Research	4
Dissertations/Theses -…	2
Reports - Descriptive	1
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Higher Education	2
Postsecondary Education	2
Elementary Secondary Education	1

Audience

Location

Taiwan	1
Thailand	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 8 results Save | Export

A Simple Model to Determine the Efficient Duration of Exams

Peer reviewed

Direct link

Ellis, Jules L. – Educational and Psychological Measurement, 2021

This study develops a theoretical model for the costs of an exam as a function of its duration. Two kind of costs are distinguished: (1) the costs of measurement errors and (2) the costs of the measurement. Both costs are expressed in time of the student. Based on a classical test theory model, enriched with assumptions on the context, the costs…

Descriptors: Test Length, Models, Error of Measurement, Measurement

Factor Structure and Measurement Invariance of the Self-Discipline Model Using the Different-Length Questionnaires: Application of Multiple Matrix Sampling

Peer reviewed
PDF on ERIC

Download full text

Damrongpanit, Suntonrapot – Universal Journal of Educational Research, 2019

The purposes of this study were to test the structural validity and to test the parameters invariance of the self-discipline measurement model for good student citizenship among the models, using the data from the 1,047 complete questionnaires and the reducing length questionnaires with multiple matrix sampling technique. The sample size of this…

Descriptors: Factor Structure, Questionnaires, Test Length, Citizenship

Mixture IRT Model with a Higher-Order Structure for Latent Traits

Peer reviewed

Direct link

Huang, Hung-Yu – Educational and Psychological Measurement, 2017

Mixture item response theory (IRT) models have been suggested as an efficient method of detecting the different response patterns derived from latent classes when developing a test. In testing situations, multiple latent traits measured by a battery of tests can exhibit a higher-order structure, and mixtures of latent classes may occur on…

Descriptors: Item Response Theory, Models, Bayesian Statistics, Computation

Evaluating EIV, OLS, and SEM Estimators of Group Slope Differences in the Presence of Measurement Error: The Single-Indicator Case

Peer reviewed

Direct link

Culpepper, Steven Andrew – Applied Psychological Measurement, 2012

Measurement error significantly biases interaction effects and distorts researchers' inferences regarding interactive hypotheses. This article focuses on the single-indicator case and shows how to accurately estimate group slope differences by disattenuating interaction effects with errors-in-variables (EIV) regression. New analytic findings were…

Descriptors: Evidence, Test Length, Interaction, Regression (Statistics)

A Comparison of Three Content Balancing Methods for Fixed and Variable Length Computerized Adaptive Tests

Direct link

Shin, Chingwei David; Chien, Yuehmei; Way, Walter Denny – Pearson, 2012

Content balancing is one of the most important components in the computerized adaptive testing (CAT) especially in the K to 12 large scale tests that complex constraint structure is required to cover a broad spectrum of content. The purpose of this study is to compare the weighted penalty model (WPM) and the weighted deviation method (WDM) under…

Descriptors: Computer Assisted Testing, Elementary Secondary Education, Test Content, Models

Controlling Type I Error Rate in Evaluating Differential Item Functioning for Four DIF Methods: Use of Three Procedures for Adjustment of Multiple Item Testing

Direct link

Kim, Jihye – ProQuest LLC, 2010

In DIF studies, a Type I error refers to the mistake of identifying non-DIF items as DIF items, and a Type I error rate refers to the proportion of Type I errors in a simulation study. The possibility of making a Type I error in DIF studies is always present and high possibility of making such an error can weaken the validity of the assessment.…

Descriptors: Test Bias, Test Length, Simulation, Testing

Three Essays on Teacher Education Programs and Test-Takers' Response Times on Test Items

Direct link

Qian, Hong – ProQuest LLC, 2013

This dissertation includes three essays: one essay focuses on the effect of teacher preparation programs on teacher knowledge while the other two focus on test-takers' response times on test items. Essay One addresses the problem of how opportunities to learn in teacher preparation programs influence future elementary mathematics teachers'…

Descriptors: Teacher Education Programs, Pedagogical Content Knowledge, Preservice Teacher Education, Preservice Teachers

Loss of Information in Estimating Item Parameters in Incomplete Designs

Peer reviewed

Direct link

Eggen, Theo J. H. M.; Verelst, Norman D. – Psychometrika, 2006

In this paper, the efficiency of conditional maximum likelihood (CML) and marginal maximum likelihood (MML) estimation of the item parameters of the Rasch model in incomplete designs is investigated. The use of the concept of F-information (Eggen, 2000) is generalized to incomplete testing designs. The scaled determinant of the F-information…

Descriptors: Test Length, Computation, Maximum Likelihood Statistics, Models