ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	5

Descriptor

Comparative Testing	18
Computer Assisted Testing	18
Test Reliability	18
Test Validity	9
Higher Education	6
Test Format	6
College Students	5
Difficulty Level	5
Adults	4
Test Construction	4
Ability Identification	3
Adaptive Testing	3
Evaluation Methods	3
Multiple Choice Tests	3
Scoring	3
Test Items	3
Aptitude Tests	2
Comparative Analysis	2
Course Evaluation	2
Factor Analysis	2
Factor Structure	2
Foreign Countries	2
Item Response Theory	2
Latent Trait Theory	2
Mastery Tests	2
More ▼

Source

Applied Measurement in…	1
Behavior Research Methods,…	1
Educational Research and…	1
Journal of Applied Testing…	1
Journal of Education for…	1
Journal of Educational…	1
Journal of Employment…	1
Measurement and Evaluation in…	1
ProQuest LLC	1
Psychological Assessment	1

Publication Type

Reports - Research	15
Journal Articles	9
Speeches/Meeting Papers	6
Collected Works - Proceedings	1
Dissertations/Theses -…	1
Reports - Evaluative	1
Reports - General	1
Tests/Questionnaires	1

Education Level

Higher Education	4
Postsecondary Education	4
Elementary Secondary Education	1
High Schools	1
Secondary Education	1

Audience

Counselors	1
Practitioners	1

Location

China	1
Germany	1
Maryland	1

Laws, Policies, & Programs

Assessments and Surveys

Differential Aptitude Test	1
Wechsler Adult Intelligence…	1
Wechsler Memory Scale	1

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

Using Automated Procedures to Score Educational Essays Written in Three Languages

Peer reviewed

Direct link

Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025

The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…

Descriptors: College Students, Slavic Languages, German, Italian

The Application of Cognitive Task Analysis and Cognitive Load Methods in the Process of Learning Algorithms

Direct link

Razieh Fathi – ProQuest LLC, 2021

This dissertation describes an experiment to investigate how learners with different levels of background in computer science learn core concepts of computer science, in particular, algorithms. We designed a study to focus on cognitive task analysis for eliciting the empirical mental elements of learning two graph algorithms. Cognitive workload…

Descriptors: Undergraduate Students, Computer Science Education, Algorithms, Cognitive Development

Does MTV Really Do a Good Job of Evaluating Professors? An Empirical Test of the Internet Site Ratemyprofessors.com

Peer reviewed

Direct link

Murray, Keith B.; Zdravkovic, Srdan – Journal of Education for Business, 2016

Considerable debate continues regarding the efficacy of the website RateMyProfessors.com (RMP). To date, however, virtually no direct, experimental research has been reported which directly bears on questions relating to sampling adequacy or item adequacy in producing what favorable correlations have been reported. The authors compare the data…

Descriptors: Computer Assisted Testing, Computer Software Evaluation, Student Evaluation of Teacher Performance, Item Analysis

Online and Paper Evaluations of Courses: A Literature Review and Case Study

Peer reviewed

Direct link

Morrison, Keith – Educational Research and Evaluation, 2013

This paper reviews the literature on comparing online and paper course evaluations in higher education and provides a case study of a very large randomised trial on the topic. It presents a mixed but generally optimistic picture of online course evaluations with respect to response rates, what they indicate, and how to increase them. The paper…

Descriptors: Literature Reviews, Course Evaluation, Case Studies, Higher Education

The Contribution of Constructed Response Items to Large Scale Assessment: Measuring and Understanding Their Impact

Peer reviewed

Direct link

Lissitz, Robert W.; Hou, Xiaodong; Slater, Sharon Cadman – Journal of Applied Testing Technology, 2012

This article investigates several questions regarding the impact of different item formats on measurement characteristics. Constructed response (CR) items and multiple choice (MC) items obviously differ in their formats and in the resources needed to score them. As such, they have been the subject of considerable discussion regarding the impact of…

Descriptors: Computer Assisted Testing, Scoring, Evaluation Problems, Psychometrics

Comparing Paper-Pencil and Computer-Based Versions of the Harrington-O'Shea Career Decision-Making System.

Peer reviewed

Kapes, Jerome T.; Vansickle, Timothy R. – Measurement and Evaluation in Counseling and Development, 1992

Examined equivalence of mode of administration of the Career Decision-Making System, comparing paper-and-pencil version and computer-based version. Findings from 61 undergraduate students indicated that the computer-based version was significantly more reliable than paper-and-pencil version and was generally equivalent in other respects.…

Descriptors: Comparative Testing, Computer Assisted Testing, Higher Education, Test Format

Test-Retest Reliability of Computerized, Everyday Memory Measures and Traditional Memory Tests.

Youngjohn, James R.; And Others – 1991

Test-retest reliabilities and practice effect magnitudes were considered for nine computer-simulated tasks of everyday cognition and five traditional neuropsychological tests. The nine simulated everyday memory tests were from the Memory Assessment Clinic battery as follows: (1) simple reaction time while driving; (2) divided attention (driving…

Descriptors: Adults, Comparative Testing, Computer Assisted Testing, Computer Simulation

Measuring Recognition Performance Using Computer-Based and Paper-Based Methods.

Peer reviewed

Federico, Pat-Anthony – Behavior Research Methods, Instruments, and Computers, 1991

Using a within-subjects design, computer-based and paper-based tests of aircraft silhouette recognition were administered to 83 male naval pilots and flight officers to determine the relative reliabilities and validities of 2 measurement modes. Relative reliabilities and validities of the two modes were contingent on the multivariate measurement…

Descriptors: Aircraft Pilots, Comparative Testing, Computer Assisted Testing, Males

Development and Validation of a Computer-Administered Version of the Hamilton Anxiety Scale.

Peer reviewed

Kobak, Kenneth A.; And Others – Psychological Assessment, 1993

A developed computer-administered form of the Hamilton Anxiety Scale and the clinician form of the instrument were administered to 214 psychiatric outpatients and 78 community adults. Results support the reliability and validity of the computer-administered version as an alternative to the clinician-administered version. (SLD)

Descriptors: Adults, Anxiety, Clinical Diagnosis, Comparative Testing

Quantitative Comparisons of Difficulty, Discrimination and Reliability of Machine-Scored Completion Items and Tests (in the MDT Un-Cued Answer-Bank Format) in Contrast with Statistics from Comparable Multiple Choice Questions: The First Round of Results.

PDF pending restoration

Anderson, Paul S.; Hyers, Albert D. – 1991

Three descriptive statistics (difficulty, discrimination, and reliability) of multiple-choice (MC) test items were compared to those of a new (1980s) format of machine-scored questions. The new method, answer-bank multi-digit testing (MDT), uses alphabetized lists of up to 1,000 alternatives and approximates the completion style of assessment…

Descriptors: College Students, Comparative Testing, Computer Assisted Testing, Correlation

Career Interest Search: A Prototype, Computer-Assisted Occupational Interest Inventory for Functionally Illiterate Adults.

Peer reviewed

Kimball, James C. – Journal of Employment Counseling, 1988

Developed paper-and-pencil and microcomputer versions of prototype occupational interest inventory for academically disadvantaged or functionally illiterate adults. Compared results obtained from 30 such adults on the United States Employment Service Interest Inventory and both versions of the prototype inventory. Results revealed acceptable…

Descriptors: Adult Literacy, Adults, Comparative Testing, Computer Assisted Testing

Use of Restricted Item Response Theory Models for Examining the Stability of Item Parameter Estimates over Time.

Peer reviewed

Stone, Clement A.; Lane, Suzanne – Applied Measurement in Education, 1991

A model-testing approach for evaluating the stability of item response theory item parameter estimates (IPEs) in a pretest-posttest design is illustrated. Nineteen items from the Head Start Measures Battery were used. A moderately high degree of stability in the IPEs for 5,510 children assessed on 2 occasions was found. (TJH)

Descriptors: Comparative Testing, Compensatory Education, Computer Assisted Testing, Early Childhood Education

Test-Retest Consistency of Computer Adaptive Tests.

Lunz, Mary E.; And Others – 1990

This study explores the test-retest consistency of computer adaptive tests of varying lengths. The testing model used was designed as a mastery model to determine whether an examinee's estimated ability level is above or below a pre-established criterion expressed in the metric (logits) of the calibrated item pool scale. The Rasch model was used…

Descriptors: Ability Identification, Adaptive Testing, College Students, Comparative Testing

How Review Options and Administration Modes Influence Scores on Computerized Vocabulary Tests.

Vispoel, Walter P.; And Others – 1992

The effects of review options (the opportunity for examinees to review and change answers) on the magnitude, reliability, efficiency, and concurrent validity of scores obtained from three types of computerized vocabulary tests (fixed item, adaptive, and self-adapted) were studied. Subjects were 97 college students at a large midwestern university…

Descriptors: Adaptive Testing, College Students, Comparative Testing, Computer Assisted Testing

Proceedings of the 1979 Computerized Adaptive Testing Conference (Wayzata, Minnesota, June 27-30, 1979).

Weiss, David J., Ed. – 1980

This report is the Proceedings of the third conference of its type. Included are 23 of the 25 papers presented at the conference, discussion of these papers by invited discussants, and symposium papers by a group of leaders in adaptive testing and latent trait test theory research and applications. The papers are organized into the following…

Descriptors: Academic Ability, Academic Achievement, Comparative Testing, Computer Assisted Testing

Previous Page | Next Page »

Pages: 1 | 2

Anderson, Paul S.	2
Hyers, Albert D.	2
Federico, Pat-Anthony	1
Hamid Mohammadi	1
Hou, Xiaodong	1
Kapes, Jerome T.	1
Kimball, James C.	1
Kobak, Kenneth A.	1
Lane, Suzanne	1
Lissitz, Robert W.	1
Lunz, Mary E.	1
Mark J. Gierl	1
McBride, James R.	1
Morrison, Keith	1
Murray, Keith B.	1
Razieh Fathi	1
Schroeder, David H.	1
Slater, Sharon Cadman	1
Stone, Clement A.	1
Tahereh Firoozi	1
Vansickle, Timothy R.	1
Veccia, Ellen M.	1
Vispoel, Walter P.	1
Weiss, David J., Ed.	1
More ▼