ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	6

Descriptor

Error of Measurement	9
Models	9
Testing	9
Psychometrics	4
Comparative Analysis	3
Item Response Theory	3
Measurement Techniques	3
Probability	3
Statistical Analysis	3
Computation	2
Evaluation Methods	2
Guessing (Tests)	2
Item Analysis	2
Measurement	2
Scores	2
Scoring	2
Test Construction	2
Test Interpretation	2
Test Reliability	2
Academic Achievement	1
Achievement Tests	1
Attention Control	1
Business Administration	1
Chemistry	1
College Entrance Examinations	1
More ▼

Source

Educational and Psychological…	3
International Journal of…	2
Psicologica: International…	1
Psychological Review	1

Author

Al Harbi, Khaleel	1
Birnbaum, Michael H.	1
Dimitrov, Dimiter M.	1
Dirkzwager, Arie	1
Ferrando, Pere J.	1
Foster, Jeff L.	1
Hsiao, Yu-Yu	1
Kolakowski, Donald	1
Kwok, Oi-Man	1
Lai, Mark H. C.	1
Li, Tatyana	1
Marcoulides, George A.	1
Menold, Natalja	1
Meyer, Kevin D.	1
Millman, Jason	1
Raykov, Tenko	1
Sideridis, Georgios	1
Tsaousis, Ioannis	1
More ▼

Publication Type

Journal Articles	7
Reports - Research	5
Opinion Papers	1
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

High Schools	1
Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Examining Measurement Invariance and Differential Item Functioning with Discrete Latent Construct Indicators: A Note on a Multiple Testing Procedure

Peer reviewed

Direct link

Raykov, Tenko; Dimitrov, Dimiter M.; Marcoulides, George A.; Li, Tatyana; Menold, Natalja – Educational and Psychological Measurement, 2018

A latent variable modeling method for studying measurement invariance when evaluating latent constructs with multiple binary or binary scored items with no guessing is outlined. The approach extends the continuous indicator procedure described by Raykov and colleagues, utilizes similarly the false discovery rate approach to multiple testing, and…

Descriptors: Models, Statistical Analysis, Error of Measurement, Test Bias

Evaluation of Two Methods for Modeling Measurement Errors When Testing Interaction Effects with Observed Composite Scores

Peer reviewed

Direct link

Hsiao, Yu-Yu; Kwok, Oi-Man; Lai, Mark H. C. – Educational and Psychological Measurement, 2018

Path models with observed composites based on multiple items (e.g., mean or sum score of the items) are commonly used to test interaction effects. Under this practice, researchers generally assume that the observed composites are measured without errors. In this study, we reviewed and evaluated two alternative methods within the structural…

Descriptors: Error of Measurement, Testing, Scores, Models

Improving Measures via Examining the Behavior of Distractors in Multiple-Choice Tests: Assessment and Remediation

Peer reviewed

Direct link

Sideridis, Georgios; Tsaousis, Ioannis; Al Harbi, Khaleel – Educational and Psychological Measurement, 2017

The purpose of the present article was to illustrate, using an example from a national assessment, the value from analyzing the behavior of distractors in measures that engage the multiple-choice format. A secondary purpose of the present article was to illustrate four remedial actions that can potentially improve the measurement of the…

Descriptors: Multiple Choice Tests, Attention Control, Testing, Remedial Instruction

Testing Mixture Models of Transitive Preference: Comment on Regenwetter, Dana, and Davis-Stober (2011)

Peer reviewed

Direct link

Birnbaum, Michael H. – Psychological Review, 2011

This article contrasts 2 approaches to analyzing transitivity of preference and other behavioral properties in choice data. The approach of Regenwetter, Dana, and Davis-Stober (2011) assumes that on each choice, a decision maker samples randomly from a mixture of preference orders to determine whether "A" is preferred to "B." In contrast, Birnbaum…

Descriptors: Evidence, Testing, Computation, Probability

Assessing Short-Term Individual Consistency Using IRT-Based Statistics

Peer reviewed
PDF on ERIC

Download full text

Ferrando, Pere J. – Psicologica: International Journal of Methodology and Experimental Psychology, 2010

This article proposes a procedure, based on a global statistic, for assessing intra-individual consistency in a test-retest design with a short-term retest interval. The procedure is developed within the framework of parametric item response theory, and the statistic is a likelihood-based measure that can be considered as an extension of the…

Descriptors: Item Response Theory, Intervals, Psychometrics, Testing

Passing Scores and Test Lengths for Domain-Referenced Measures.

Download full text

Millman, Jason – 1972

Procedures for establishing standards and determining the number of items needed in criterion-referenced measures are reviewed. The discussion of setting a passing score is organized around five factors: performance of others, item content, educational consequences, psychological and financial costs, and measurement error. Classical test theory,…

Descriptors: Academic Achievement, Criterion Referenced Tests, Error of Measurement, Models

Latent Trait Estimation: Theory vs. Practice.

Download full text

Kolakowski, Donald – 1972

Empirical results are presented as regards the implementation of a latent-trait psychometric model by means of conditional maximum likelihood estimation. Items are scored polychotomously into varying numbers of nominal categories and the test and item characteristic curves and information functions are examined. It is concluded that scoring items…

Descriptors: Error of Measurement, Item Analysis, Item Sampling, Measurement Techniques

Multiple Evaluation: A New Testing Paradigm that Exorcizes Guessing

Peer reviewed

Direct link

Dirkzwager, Arie – International Journal of Testing, 2003

The crux in psychometrics is how to estimate the probability that a respondent answers an item correctly on one occasion out of many. Under the current testing paradigm this probability is estimated using all kinds of statistical techniques and mathematical modeling. Multiple evaluation is a new testing paradigm using the person's own personal…

Descriptors: Psychometrics, Probability, Models, Measurement

Considerations for Creating Multi-Language Personality Norms: A Three-Component Model of Error

Peer reviewed

Direct link

Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008

With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…

Descriptors: Global Approach, Cultural Differences, Norms, Human Resources