ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	7

Descriptor

Error of Measurement	7
Guidelines	7
Item Analysis	3
Probability	3
Comparative Analysis	2
Computation	2
Item Response Theory	2
Models	2
Scores	2
Statistical Distributions	2
Test Construction	2
Administration	1
Alternative Assessment	1
Benchmarking	1
College Instruction	1
College Mathematics	1
College Students	1
Computer Assisted Testing	1
Computer Simulation	1
Correlation	1
Cutting Scores	1
Data	1
Data Collection	1
Data Processing	1
Decision Making	1
More ▼

Source

Journal of Educational and…	2
Assessment & Evaluation in…	1
Educational and Psychological…	1
Journal of Educational…	1
National Center for Research…	1
Review of Educational Research	1

Author

Brennan, Robert L.	1
Cheema, Jehanzeb R.	1
Clauser, Brian E.	1
Clauser, Jerome C.	1
Ferrao, Maria	1
French, Brian F.	1
Griffin, Noelle	1
Kane, Michael	1
Kolen, Michael J.	1
Lee, Won-Chan	1
Maller, Susan J.	1
Niemi, David	1
Vallone, Julia	1
Wallin, Gabriel	1
Wang, Haiwen	1
Wang, Jia	1
Wiberg, Marie	1
More ▼

Publication Type

Reports - Evaluative	7
Journal Articles	6
Tests/Questionnaires	2

Education Level

Elementary Secondary Education	1
Higher Education	1
Postsecondary Education	1

Audience

Location

Mississippi	1
Portugal	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores

Peer reviewed

Direct link

Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2023

This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the…

Descriptors: Models, Error of Measurement, Robustness (Statistics), Equated Scores

Examining the Precision of Cut Scores within a Generalizability Theory Framework: A Closer Look at the Item Effect

Peer reviewed

Direct link

Clauser, Brian E.; Kane, Michael; Clauser, Jerome C. – Journal of Educational Measurement, 2020

An Angoff standard setting study generally yields judgments on a number of items by a number of judges (who may or may not be nested in panels). Variability associated with judges (and possibly panels) contributes error to the resulting cut score. The variability associated with items plays a more complicated role. To the extent that the mean item…

Descriptors: Cutting Scores, Generalization, Decision Making, Standard Setting

A Review of Missing Data Handling Methods in Education Research

Direct link

Cheema, Jehanzeb R. – Review of Educational Research, 2014

Missing data are a common occurrence in survey-based research studies in education, and the way missing values are handled can significantly affect the results of analyses based on such data. Despite known problems with performance of some missing data handling methods, such as mean imputation, many researchers in education continue to use those…

Descriptors: Educational Research, Data, Data Collection, Data Processing

Iterative Purification and Effect Size Use with Logistic Regression for Differential Item Functioning Detection

Peer reviewed

Direct link

French, Brian F.; Maller, Susan J. – Educational and Psychological Measurement, 2007

Two unresolved implementation issues with logistic regression (LR) for differential item functioning (DIF) detection include ability purification and effect size use. Purification is suggested to control inaccuracies in DIF detection as a result of DIF items in the ability estimate. Additionally, effect size use may be beneficial in controlling…

Descriptors: Effect Size, Test Bias, Guidelines, Error of Measurement

E-Assessment within the Bologna Paradigm: Evidence from Portugal

Peer reviewed

Direct link

Ferrao, Maria – Assessment & Evaluation in Higher Education, 2010

The Bologna Declaration brought reforms into higher education that imply changes in teaching methods, didactic materials and textbooks, infrastructures and laboratories, etc. Statistics and mathematics are disciplines that traditionally have the worst success rates, particularly in non-mathematics core curricula courses. This research project,…

Descriptors: Foreign Countries, Computer Assisted Testing, Educational Technology, Educational Assessment

Interval Estimation for True Raw and Scale Scores under the Binomial Error Model

Peer reviewed

Direct link

Lee, Won-Chan; Brennan, Robert L.; Kolen, Michael J. – Journal of Educational and Behavioral Statistics, 2006

Assuming errors of measurement are distributed binomially, this article reviews various procedures for constructing an interval for an individual's true number-correct score; presents two general interval estimation procedures for an individual's true scale score (i.e., normal approximation and endpoints conversion methods); compares various…

Descriptors: Probability, Intervals, Guidelines, Computer Simulation

Recommendations for Building a Valid Benchmark Assessment System: Second Report to the Jackson Public Schools. CRESST Report 724

Download full text

Niemi, David; Wang, Jia; Wang, Haiwen; Vallone, Julia; Griffin, Noelle – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2007

There are usually many testing activities going on in a school, with different tests serving different purposes, thus organization and planning are key in creating an efficient system in assessing the most important educational objectives. In the ideal case, an assessment system will be able to inform on student learning, instruction and…

Descriptors: School Administration, Educational Objectives, Administration, Public Schools