ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	2

Source

Applied Psychological…

Author

Bennett, Randy Elliot	1
Birenbaum, Menucha	1
Cohen, Allan S.	1
De Ayala, R. J.	1
Kim, Seock-Ho	1
Kluge, Annette	1
Norcini, John	1
Stocking, Martha L.	1
van der Linden, Wim J.	1

Publication Type

Journal Articles	8
Reports - Evaluative	8
Speeches/Meeting Papers	1

Education Level

Audience

Location

Germany	1
Israel (Tel Aviv)	1

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Performance Assessments with Microworlds and Their Difficulty

Peer reviewed

Direct link

Kluge, Annette – Applied Psychological Measurement, 2008

The use of microworlds (MWs), or complex dynamic systems, in educational testing and personnel selection is hampered by systematic measurement errors because these new and innovative item formats are not adequately controlled for their difficulty. This empirical study introduces a way to operationalize an MW's difficulty and demonstrates the…

Descriptors: Personnel Selection, Self Efficacy, Educational Testing, Computer Uses in Education

Application of an Automated Item Selection Method to Real Data.

Peer reviewed

Stocking, Martha L.; And Others – Applied Psychological Measurement, 1993

A method of automatically selecting items for inclusion in a test with constraints on item content and statistical properties was applied to real data. Tests constructed manually from the same data and constraints were compared to tests constructed automatically. Results show areas in which automated assembly can improve test construction. (SLD)

Descriptors: Algorithms, Automation, Comparative Testing, Computer Assisted Testing

The Effect of Numbers of Experts and Common Items on Cutting Score Equivalents Based on Expert Judgment.

Peer reviewed

Norcini, John; And Others – Applied Psychological Measurement, 1991

Effects of numbers of experts (NOEs) and common items (CIs) on the scaling of cutting scores from expert judgments were studied for 11,917 physicians taking 2 forms of a medical specialty examination. Increasing NOEs and CIs reduced error; beyond 5 experts and 25 CIs, error differences were small. (SLD)

Descriptors: Comparative Testing, Cutting Scores, Equated Scores, Estimation (Mathematics)

The Relationship of Expert-System Scored Constrained Free-Response Items to Multiple-Choice and Open-Ended Items.

Peer reviewed

Bennett, Randy Elliot; And Others – Applied Psychological Measurement, 1990

The relationship of an expert-system-scored constrained free-response item type to multiple-choice and free-response items was studied using data for 614 students on the College Board's Advanced Placement Computer Science (APCS) Examination. Implications for testing and the APCS test are discussed. (SLD)

Descriptors: College Students, Comparative Testing, Computer Assisted Testing, Computer Science

Equating Scores from Adaptive to Linear Tests

Peer reviewed

Direct link

van der Linden, Wim J. – Applied Psychological Measurement, 2006

Two local methods for observed-score equating are applied to the problem of equating an adaptive test to a linear test. In an empirical study, the methods were evaluated against a method based on the test characteristic function (TCF) of the linear test and traditional equipercentile equating applied to the ability estimates on the adaptive test…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Format, Equated Scores

The Nominal Response Model in Computerized Adaptive Testing.

Peer reviewed

De Ayala, R. J. – Applied Psychological Measurement, 1992

A computerized adaptive test (CAT) based on the nominal response model (NR CAT) was implemented, and the performance of the NR CAT and a CAT based on the three-parameter logistic model was compared. The NR CAT produced trait estimates comparable to those of the three-parameter test. (SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Equations (Mathematics)

A Comparison of Two Area Measures for Detecting Differential Item Functioning.

Peer reviewed

Kim, Seock-Ho; Cohen, Allan S. – Applied Psychological Measurement, 1991

The exact and closed-interval area measures for detecting differential item functioning are compared for actual data from 1,000 African-American and 1,000 white college students taking a vocabulary test with items intentionally constructed to favor 1 set of examinees. No real differences in detection of biased items were found. (SLD)

Descriptors: Black Students, College Students, Comparative Testing, Equations (Mathematics)

Effects of Response Format on Diagnostic Assessment of Scholastic Achievement.

Peer reviewed

Birenbaum, Menucha; And Others – Applied Psychological Measurement, 1992

The effect of multiple-choice (MC) or open-ended (OE) response format on diagnostic assessment of algebra test performance was investigated with 231 eighth and ninth graders in Tel Aviv (Israel) using bug or rule space analysis. Both analyses indicated closer similarity between parallel OE subsets than between stem-equivalent OE and MC subsets.…

Descriptors: Algebra, Comparative Testing, Educational Assessment, Educational Diagnosis

Comparative Testing	8
Computer Assisted Testing	4
Test Items	4
Item Response Theory	3
Mathematical Models	3
Test Format	3
Adaptive Testing	2
College Students	2
Equated Scores	2
Equations (Mathematics)	2
Error of Measurement	2
Estimation (Mathematics)	2
Foreign Countries	2
Higher Education	2
Multiple Choice Tests	2
Scoring	2
Algebra	1
Algorithms	1
Automation	1
Bias	1
Black Students	1
Computer Science	1
Computer Simulation	1
Computer Uses in Education	1
Control Groups	1
More ▼