ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	5

Descriptor

Comparative Testing	11
Item Response Theory	11
Test Validity	11
Test Reliability	4
Adaptive Testing	3
Certification	3
Computer Assisted Testing	3
Research Methodology	3
Robustness (Statistics)	3
Test Format	3
Test Items	3
College Students	2
Higher Education	2
Item Analysis	2
Licensing Examinations…	2
Mastery Tests	2
Mathematical Models	2
Medical Students	2
Multidimensional Scaling	2
Pass Fail Grading	2
Psychometrics	2
Response Style (Tests)	2
Test Bias	2
Test Construction	2
Ability Identification	1
More ▼

Source

Applied Measurement in…	1
Applied Psychological…	1
Educational and Psychological…	1
European Journal of…	1
Journal of Economic Education	1
Journal of Educational and…	1
Online Submission	1
Society for Research on…	1

Author

Lunz, Mary E.	2
Bergstrom, Betty A.	1
Bhola, Dennison S.	1
DeMars, Christine E.	1
Drasgow, Fritz	1
Kong, Xiaojing J.	1
Lee, Yoonsun	1
Liow, Jong-Leng	1
Luping Niu	1
Peter F. Halpin	1
Robson, Denise	1
Seung W. Choi	1
Sykes, Robert C.	1
Taylor, Catherine S.	1
Walstad, William B.	1
Wim J. van der Linden	1
Wise, Steven L.	1
More ▼

Publication Type

Reports - Research	7
Journal Articles	6
Speeches/Meeting Papers	4
Reports - Evaluative	3
Reports - Descriptive	1

Education Level

Higher Education	4
Grade 10	1
Grade 4	1
Grade 7	1
Postsecondary Education	1

Audience

Location

Australia

Laws, Policies, & Programs

Assessments and Surveys

Armed Services Vocational…

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Do Reported Treatment Effects Generalize to Other Measures of the Same Construct: A Specification Test

Peer reviewed

Direct link

Peter F. Halpin – Society for Research on Educational Effectiveness, 2024

Background: Meta-analyses of educational interventions have consistently documented the importance of methodological factors related to the choice of outcome measures. In particular, when interventions are evaluated using measures developed by researchers involved with the intervention or its evaluation, the effect sizes tend to be larger than…

Descriptors: College Students, College Faculty, STEM Education, Item Response Theory

A Two-Level Adaptive Test Battery

Peer reviewed

Direct link

Wim J. van der Linden; Luping Niu; Seung W. Choi – Journal of Educational and Behavioral Statistics, 2024

A test battery with two different levels of adaptation is presented: a within-subtest level for the selection of the items in the subtests and a between-subtest level to move from one subtest to the next. The battery runs on a two-level model consisting of a regular response model for each of the subtests extended with a second level for the joint…

Descriptors: Adaptive Testing, Test Construction, Test Format, Test Reliability

Stability of Rasch Scales over Time

Peer reviewed

Direct link

Taylor, Catherine S.; Lee, Yoonsun – Applied Measurement in Education, 2010

Item response theory (IRT) methods are generally used to create score scales for large-scale tests. Research has shown that IRT scales are stable across groups and over time. Most studies have focused on items that are dichotomously scored. Now Rasch and other IRT models are used to create scales for tests that include polytomously scored items.…

Descriptors: Measures (Individuals), Item Response Theory, Robustness (Statistics), Item Analysis

Peer Assessment in Thesis Oral Presentation

Peer reviewed

Direct link

Liow, Jong-Leng – European Journal of Engineering Education, 2008

Peer assessment has been studied in various situations and actively pursued as a means by which students are given more control over their learning and assessment achievement. This study investigated the reliability of staff and student assessments in two oral presentations with limited feedback for a school-based thesis course in engineering…

Descriptors: Feedback (Response), Student Evaluation, Grade Point Average, Peer Evaluation

Setting the Response Time Threshold Parameter to Differentiate Solution Behavior from Rapid-Guessing Behavior

Peer reviewed

Direct link

Kong, Xiaojing J.; Wise, Steven L.; Bhola, Dennison S. – Educational and Psychological Measurement, 2007

This study compared four methods for setting item response time thresholds to differentiate rapid-guessing behavior from solution behavior. Thresholds were either (a) common for all test items, (b) based on item surface features such as the amount of reading required, (c) based on visually inspecting response time frequency distributions, or (d)…

Descriptors: Test Items, Reaction Time, Timed Tests, Item Response Theory

Appropriateness Measurement for Some Multidimensional Test Batteries.

Peer reviewed

Drasgow, Fritz; And Others – Applied Psychological Measurement, 1991

Extensions of unidimensional appropriateness indices are developed for multiunidimensional tests (multidimensional tests composed of unidimensional subtests). Simulated and real data (scores of 2,978 students on the Armed Services Vocational Aptitude Battery) were used to evaluate the indices' effectiveness in determining individuals who are…

Descriptors: Comparative Testing, Computer Simulation, Equations (Mathematics), Graphs

Scoring Subscales Using Multidimensional Item Response Theory Models

Download full text

DeMars, Christine E. – Online Submission, 2005

Several methods for estimating item response theory scores for multiple subtests were compared. These methods included two multidimensional item response theory models: a bi-factor model where each subtest was a composite score based on the primary trait measured by the set of tests and a secondary trait measured by the individual subtest, and a…

Descriptors: Item Response Theory, Multidimensional Scaling, Correlation, Scoring Rubrics

Assessing the Effects of Computer Administration on Scores and Parameter Estimates Using IRT Models.

Download full text

Sykes, Robert C.; And Others – 1991

To investigate the psychometric feasibility of replacing a paper-and-pencil licensing examination with a computer-administered test, a validity study was conducted. The computer-administered test (Cadm) was a common set of items for all test takers, distinct from computerized adaptive testing, in which test takers receive items appropriate to…

Descriptors: Adults, Certification, Comparative Testing, Computer Assisted Testing

The Effect of Review on Student Ability and Test Efficiency for Computer Adaptive Tests.

Lunz, Mary E.; And Others – 1991

This paper explores the effect of reviewing items and altering responses on the efficiency of computer adaptive tests (CATs) and the resultant ability measures of examinees. Subjects included 712 medical students: 220 subjects were randomly assigned to the review condition; 492 were randomly assigned to a review control condition (the usual CAT…

Descriptors: Academic Ability, Adaptive Testing, Certification, Comparative Testing

Confidence in Pass/Fail Decisions for Computer Adaptive and Paper and Pencil Examinations.

Bergstrom, Betty A.; Lunz, Mary E. – 1991

The level of confidence in pass/fail decisions obtained with computer adaptive tests (CATs) was compared to decisions based on paper-and-pencil tests. Subjects included 645 medical technology students from 238 educational programs across the country. The tests used in this study constituted part of the subjects' review for the certification…

Descriptors: Adaptive Testing, Certification, Comparative Testing, Computer Assisted Testing

Differential Item Functioning and Male-Female Differences on Multiple-Choice Tests in Economics.

Peer reviewed

Walstad, William B.; Robson, Denise – Journal of Economic Education, 1997

Applies Item Response Theory methods to data from the national norming of the Test of Economic Literacy to identify test questions with large male-female differences. Regression analysis showed a significant decrease in the magnitude of gender difference, although a difference was still present. (MJP)

Descriptors: Academic Aptitude, Comparative Testing, Economics, Economics Education