ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	1

Descriptor

Comparative Testing	8
Computer Assisted Testing	8
Test Items	5
Adaptive Testing	4
College Students	3
Higher Education	3
Item Response Theory	3
Multiple Choice Tests	3
Scores	3
College Entrance Examinations	2
Mathematics Tests	2
Test Construction	2
Test Format	2
Test Length	2
Test Validity	2
Answer Sheets	1
Computer Simulation	1
Correlation	1
Difficulty Level	1
Educational Assessment	1
Effect Size	1
Estimation (Mathematics)	1
Evaluation Criteria	1
Evaluation Methods	1
Factor Analysis	1
More ▼

Source

Journal of Educational…

Author

Bridgeman, Brent	2
Ankenman, Robert D.	1
Chen, Shu-Ying	1
Gitomer, Drew H.	1
Hamid Mohammadi	1
Mark J. Gierl	1
Meijer, Rob R.	1
Rock, Donald A.	1
Tahereh Firoozi	1
Wainer, Howard	1
Wise, Steven L.	1
Yamamoto, Kentaro	1
More ▼

Publication Type

Journal Articles	8
Reports - Research	5
Reports - Evaluative	3

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Using Automated Procedures to Score Educational Essays Written in Three Languages

Peer reviewed

Direct link

Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025

The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…

Descriptors: College Students, Slavic Languages, German, Italian

Using Patterns of Summed Scores in Paper-and-Pencil Tests and Computer-Adaptive Tests to Detect Misfitting Item Score Patterns

Peer reviewed

Direct link

Meijer, Rob R. – Journal of Educational Measurement, 2004

Two new methods have been proposed to determine unexpected sum scores on sub-tests (testlets) both for paper-and-pencil tests and computer adaptive tests. A method based on a conservative bound using the hypergeometric distribution, denoted p, was compared with a method where the probability for each score combination was calculated using a…

Descriptors: Probability, Adaptive Testing, Item Response Theory, Scores

Performance Modeling That Integrates Latent Trait and Class Theory.

Peer reviewed

Gitomer, Drew H.; Yamamoto, Kentaro – Journal of Educational Measurement, 1991

A model integrating latent trait and latent class theories in characterizing individual performance on the basis of qualitative understanding is presented. This HYBRID model is illustrated through experiments with 119 Air Force technicians taking a paper-and-pencil test and 136 Air Force technicians taking a computerized test. (SLD)

Descriptors: Comparative Testing, Computer Assisted Testing, Educational Assessment, Item Response Theory

A Comparison of the Performance of Simulated Hierarchical and Linear Testlets.

Peer reviewed

Wainer, Howard; And Others – Journal of Educational Measurement, 1992

Computer simulations were run to measure the relationship between testlet validity and factors of item pool size and testlet length for both adaptive and linearly constructed testlets. Making a testlet adaptive yields only modest increases in aggregate validity because of the peakedness of the typical proficiency distribution. (Author/SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Computer Simulation

Relationships among Multiple-Choice and Open-Ended Analytical Questions.

Peer reviewed

Bridgeman, Brent; Rock, Donald A. – Journal of Educational Measurement, 1993

Exploratory and confirmatory factor analyses were used to explore relationships among existing item types and three new computer-administered item types for the analytical scale of the Graduate Record Examination General Test. Results with 349 students indicate constructs the item types are measuring. (SLD)

Descriptors: College Entrance Examinations, College Students, Comparative Testing, Computer Assisted Testing

Effects of Practical Constraints on Item Selection Rules at the Early Stages of Computerized Adaptive Testing

Peer reviewed

Direct link

Chen, Shu-Ying; Ankenman, Robert D. – Journal of Educational Measurement, 2004

The purpose of this study was to compare the effects of four item selection rules--(1) Fisher information (F), (2) Fisher information with a posterior distribution (FP), (3) Kullback-Leibler information with a posterior distribution (KP), and (4) completely randomized item selection (RN)--with respect to the precision of trait estimation and the…

Descriptors: Test Length, Adaptive Testing, Computer Assisted Testing, Test Selection

A Comparison of Quantitative Questions in Open-Ended and Multiple-Choice Formats.

Peer reviewed

Bridgeman, Brent – Journal of Educational Measurement, 1992

Examinees in a regular administration of the quantitative portion of the Graduate Record Examination responded to particular items in a machine-scannable multiple-choice format. Volunteers (n=364) used a computer to answer open-ended counterparts of these items. Scores for both formats demonstrated similar correlational patterns. (SLD)

Descriptors: Answer Sheets, College Entrance Examinations, College Students, Comparative Testing

A Comparison of Self-Adapted and Computerized Adaptive Tests.

Peer reviewed

Wise, Steven L.; And Others – Journal of Educational Measurement, 1992

Performance of 156 undergraduate and 48 graduate students on a self-adapted test (SFAT)--students choose the difficulty level of their test items--was compared with performance on a computer-adapted test (CAT). Those taking the SFAT obtained higher ability scores and reported lower posttest state anxiety than did CAT takers. (SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Difficulty Level