ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	1

Descriptor

College Students	11
Computer Assisted Testing	11
Higher Education	10
Test Items	7
Adaptive Testing	4
College Entrance Examinations	3
Comparative Testing	3
Multiple Choice Tests	3
Review (Reexamination)	3
Test Validity	3
Comparative Analysis	2
Computer Simulation	2
Item Response Theory	2
Mathematical Models	2
Psychometrics	2
Vocabulary	2
Ability	1
Academic Ability	1
Admission (School)	1
Adults	1
Answer Sheets	1
Attitudes	1
Black Students	1
Correlation	1
Equated Scores	1
More ▼

Source

Journal of Educational…

Author

Vispoel, Walter P.	5
Bleiler, Timothy	2
Bridgeman, Brent	2
Rock, Donald A.	2
Bennett, Randy Elliot	1
Cohen, Allan S.	1
Enright, Mary K.	1
Hamid Mohammadi	1
Hendrickson, Amy B.	1
Hirsch, Thomas M.	1
Mark J. Gierl	1
Rocklin, Thomas R.	1
Tahereh Firoozi	1
Wang, Tianyou	1
More ▼

Publication Type

Journal Articles	11
Reports - Research	11
Speeches/Meeting Papers	4

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Using Automated Procedures to Score Educational Essays Written in Three Languages

Peer reviewed

Direct link

Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025

The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…

Descriptors: College Students, Slavic Languages, German, Italian

Can Examinees Use a Review Option To Obtain Positively Biased Ability Estimates on a Computerized Adaptive Test?

Peer reviewed

Vispoel, Walter P.; Rocklin, Thomas R.; Wang, Tianyou; Bleiler, Timothy – Journal of Educational Measurement, 1999

Investigated the effectiveness of H. Wainer's (1993) strategy for obtaining positively biased ability estimates when examinees can review and change answers on computerized adaptive tests. Results, based on simulation and testing data from 87 college students, show that the Wainer strategy sometimes yields inflated ability estimates and sometimes…

Descriptors: Ability, College Students, Computer Assisted Testing, Higher Education

Reviewing and Changing Answers on Computer-Adaptive and Self-Adaptive Vocabulary Tests.

Peer reviewed

Vispoel, Walter P. – Journal of Educational Measurement, 1998

Compared results from computer-adaptive and self-adaptive tests under conditions in which item review was and was not permitted for 379 college students. Results suggest that, when given the opportunity, most examinees will change answers, but usually only to a small portion of items, resulting in some benefit to the test taker. (SLD)

Descriptors: Adaptive Testing, College Students, Computer Assisted Testing, Higher Education

Limiting Answer Review and Change on Computerized Adaptive Vocabulary Tests: Psychometric and Attitudinal Results.

Peer reviewed

Vispoel, Walter P.; Hendrickson, Amy B.; Bleiler, Timothy – Journal of Educational Measurement, 2000

Evaluated the effectiveness of vocabulary computerized adaptive tests (CATs) with restricted review in a live testing setting involving 242 college students in which special efforts were made to increase test efficiency and reduce the possibility of obtaining positively biased proficiency estimates. Results suggest the efficacy of allowing limited…

Descriptors: Adaptive Testing, Attitudes, College Students, Computer Assisted Testing

Psychometric Characteristics of Computer-Adaptive and Self-Adaptive Vocabulary Tests: The Role of Answer Feedback and Test Anxiety.

Peer reviewed

Vispoel, Walter P. – Journal of Educational Measurement, 1998

Studied effects of administration mode [computer adaptive test (CAT) versus self-adaptive test (SAT)], item-by-item answer feedback, and test anxiety on results from computerized vocabulary tests taken by 293 college students. CATs were more reliable than SATs, and administration time was less when feedback was provided. (SLD)

Descriptors: Adaptive Testing, College Students, Computer Assisted Testing, Feedback

Improving Measurement for Graduate Admissions.

Peer reviewed

Enright, Mary K.; Rock, Donald A.; Bennett, Randy Elliot – Journal of Educational Measurement, 1998

Examined alternative-item types and section configurations for improving the discriminant and convergent validity of the Graduate Record Examination (GRE) general test using a computer-based test given to 388 examinees who had taken the GRE previously. Adding new variations of logical meaning appeared to decrease discriminant validity. (SLD)

Descriptors: Admission (School), College Entrance Examinations, College Students, Computer Assisted Testing

Relationships among Multiple-Choice and Open-Ended Analytical Questions.

Peer reviewed

Bridgeman, Brent; Rock, Donald A. – Journal of Educational Measurement, 1993

Exploratory and confirmatory factor analyses were used to explore relationships among existing item types and three new computer-administered item types for the analytical scale of the Graduate Record Examination General Test. Results with 349 students indicate constructs the item types are measuring. (SLD)

Descriptors: College Entrance Examinations, College Students, Comparative Testing, Computer Assisted Testing

Computerized Adaptive and Fixed-Item Testing of Music Listening Skill: A Comparison of Efficiency, Precision, and Concurrent Validity.

Peer reviewed

Vispoel, Walter P.; And Others – Journal of Educational Measurement, 1997

Efficiency, precision, and concurrent validity of results from adaptive and fixed-item music listening tests were studied using: (1) 2,200 simulated examinees; (2) 204 live examinees; and (3) 172 live examinees. Results support the usefulness of adaptive tests for measuring skills that require aurally produced items. (SLD)

Descriptors: Adaptive Testing, Adults, College Students, Comparative Analysis

Multidimensional Equating.

Peer reviewed

Hirsch, Thomas M. – Journal of Educational Measurement, 1989

Equatings were performed on both simulated and real data sets using common-examinee design and two abilities for each examinee. Results indicate that effective equating, as measured by comparability of true scores, is possible with the techniques used in this study. However, the stability of the ability estimates proved unsatisfactory. (TJH)

Descriptors: Academic Ability, College Students, Comparative Analysis, Computer Assisted Testing

A Comparison of Quantitative Questions in Open-Ended and Multiple-Choice Formats.

Peer reviewed

Bridgeman, Brent – Journal of Educational Measurement, 1992

Examinees in a regular administration of the quantitative portion of the Graduate Record Examination responded to particular items in a machine-scannable multiple-choice format. Volunteers (n=364) used a computer to answer open-ended counterparts of these items. Scores for both formats demonstrated similar correlational patterns. (SLD)

Descriptors: Answer Sheets, College Entrance Examinations, College Students, Comparative Testing

Influence of Prior Distributions on Detection of DIF.

Peer reviewed

Cohen, Allan S.; And Others – Journal of Educational Measurement, 1991

Detecting differential item functioning (DIF) on test items constructed to favor 1 group over another was investigated on parameter estimates from 2 item response theory-based computer programs--BILOG and LOGIST--using data for 1,000 White and 1,000 Black college students. Use of prior distributions and marginal-maximum a posteriori estimation is…

Descriptors: Black Students, College Students, Computer Assisted Testing, Equations (Mathematics)