ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	8

Source

Applied Psychological…

Author

Bechger, Timo M.	1
Biswas, Ajoy Kumar	1
DeMars, Christine E.	1
Embretson, Susan E.	1
Funke, Joachim	1
Gorin, Joanna S.	1
Greiff, Samuel	1
Hsiao, Ya Ping	1
Lee, Won-Chan	1
Maris, Gunter	1
Wang, Wen-Chung	1
Wise, Steven L.	1
Wustenberg, Sascha	1
Yao, Lihua	1
More ▼

Publication Type

Journal Articles	8
Reports - Descriptive	4
Reports - Evaluative	3
Reports - Research	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

Netherlands	1
West Germany	1

Laws, Policies, & Programs

Assessments and Surveys

Armed Forces Qualification…	1
Graduate Record Examinations	1

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Comparing the Performance of Five Multidimensional CAT Selection Procedures with Different Stopping Rules

Peer reviewed

Direct link

Yao, Lihua – Applied Psychological Measurement, 2013

Through simulated data, five multidimensional computerized adaptive testing (MCAT) selection procedures with varying test lengths are examined and compared using different stopping rules. Fixed item exposure rates are used for all the items, and the Priority Index (PI) method is used for the content constraints. Two stopping rules, standard error…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Selection

Dynamic Problem Solving: A New Assessment Perspective

Peer reviewed

Direct link

Greiff, Samuel; Wustenberg, Sascha; Funke, Joachim – Applied Psychological Measurement, 2012

This article addresses two unsolved measurement issues in dynamic problem solving (DPS) research: (a) unsystematic construction of DPS tests making a comparison of results obtained in different studies difficult and (b) use of time-intensive single tasks leading to severe reliability problems. To solve these issues, the MicroDYN approach is…

Descriptors: Problem Solving, Tests, Measurement, Structural Equation Models

Detecting Halo Effects in Performance-Based Examinations

Peer reviewed

Direct link

Bechger, Timo M.; Maris, Gunter; Hsiao, Ya Ping – Applied Psychological Measurement, 2010

The main purpose of this article is to demonstrate how halo effects may be detected and quantified using two independent ratings of the same person. A practical illustration is given to show how halo effects can be avoided. (Contains 2 tables, 7 figures, and 2 notes.)

Descriptors: Performance Based Assessment, Test Reliability, Test Length, Language Tests

A Clarification of the Effects of Rapid Guessing on Coefficient [Alpha]: A Note on Attali's "Reliability of Speeded Number-Right Multiple-Choice Tests"

Peer reviewed

Direct link

Wise, Steven L.; DeMars, Christine E. – Applied Psychological Measurement, 2009

Attali (2005) recently demonstrated that Cronbach's coefficient [alpha] estimate of reliability for number-right multiple-choice tests will tend to be deflated by speededness, rather than inflated as is commonly believed and taught. Although the methods, findings, and conclusions of Attali (2005) are correct, his article may inadvertently invite a…

Descriptors: Guessing (Tests), Multiple Choice Tests, Test Reliability, Computation

A Critique of Raju and Oshima's Prophecy Formulas for Assessing the Reliability of Item Response Theory-Based Ability Estimates

Peer reviewed

Direct link

Wang, Wen-Chung – Applied Psychological Measurement, 2008

Raju and Oshima (2005) proposed two prophecy formulas based on item response theory in order to predict the reliability of ability estimates for a test after change in its length. The first prophecy formula is equivalent to the classical Spearman-Brown prophecy formula. The second prophecy formula is misleading because of an underlying false…

Descriptors: Test Reliability, Item Response Theory, Computation, Evaluation Methods

Multinomial and Compound Multinomial Error Models for Tests with Complex Item Scoring

Peer reviewed

Direct link

Lee, Won-Chan – Applied Psychological Measurement, 2007

This article introduces a multinomial error model, which models an examinee's test scores obtained over repeated measurements of an assessment that consists of polytomously scored items. A compound multinomial error model is also introduced for situations in which items are stratified according to content categories and/or prespecified numbers of…

Descriptors: Simulation, Error of Measurement, Scoring, Test Items

Reliability of Total Test Scores When Considered as Ordinal Measurements

Peer reviewed

Direct link

Biswas, Ajoy Kumar – Applied Psychological Measurement, 2006

This article studies the ordinal reliability of (total) test scores. This study is based on a classical-type linear model of observed score (X), true score (T), and random error (E). Based on the idea of Kendall's tau-a coefficient, a measure of ordinal reliability for small-examinee populations is developed. This measure is extended to large…

Descriptors: True Scores, Test Theory, Test Reliability, Scores

Item Difficulty Modeling of Paragraph Comprehension Items

Peer reviewed

Direct link

Gorin, Joanna S.; Embretson, Susan E. – Applied Psychological Measurement, 2006

Recent assessment research joining cognitive psychology and psychometric theory has introduced a new technology, item generation. In algorithmic item generation, items are systematically created based on specific combinations of features that underlie the processing required to correctly solve a problem. Reading comprehension items have been more…

Descriptors: Difficulty Level, Test Items, Modeling (Psychology), Paragraph Composition

Test Reliability	8
Item Response Theory	3
Scores	3
Test Items	3
Computation	2
Computer Assisted Testing	2
Correlation	2
Error of Measurement	2
Evaluation Research	2
Foreign Countries	2
Models	2
Simulation	2
Test Length	2
Adaptive Testing	1
Artificial Intelligence	1
Bias	1
Cognitive Processes	1
Cognitive Psychology	1
College Students	1
Construct Validity	1
Difficulty Level	1
Equations (Mathematics)	1
Evaluation Methods	1
Guessing (Tests)	1
Indo European Languages	1
More ▼