ERIC - Search Results

Publication Date

In 2025	2
Since 2024	2
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	7

Descriptor

Comparative Testing	26
Computer Assisted Testing	26
Test Validity	26
Adaptive Testing	9
Test Reliability	9
Test Construction	7
Test Format	6
Test Items	6
Adults	5
Higher Education	5
Aptitude Tests	4
Multiple Choice Tests	4
Occupational Tests	4
Certification	3
Difficulty Level	3
Foreign Countries	3
Item Analysis	3
Item Response Theory	3
Latent Trait Theory	3
Licensing Examinations…	3
Mastery Tests	3
Medical Students	3
Scores	3
Academic Ability	2
Cognitive Processes	2
More ▼

Source

Educational and Psychological…	2
Journal of Educational…	2
Behavior Research Methods,…	1
Educational Research Quarterly	1
Educational Studies in…	1
Evaluation and the Health…	1
Inquiry	1
Journal of Attention Disorders	1
Journal of Education for…	1
Journal of Employment…	1
Multivariate Behavioral…	1
ProQuest LLC	1
Psychological Assessment	1
More ▼

Publication Type

Reports - Research	20
Journal Articles	14
Speeches/Meeting Papers	6
Reports - Evaluative	4
Collected Works - Proceedings	1
Dissertations/Theses -…	1
Reports - General	1
Tests/Questionnaires	1

Education Level

Higher Education	5
Postsecondary Education	3
Adult Education	1
Elementary Education	1
Secondary Education	1
Two Year Colleges	1

Audience

Counselors	1
Practitioners	1

Location

Canada	1
Germany	1
United Kingdom (England)	1
Virginia	1

Laws, Policies, & Programs

Assessments and Surveys

Armed Services Vocational…	3
Armed Forces Qualification…	1
Differential Aptitude Test	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 26 results Save | Export

Using Automated Procedures to Score Educational Essays Written in Three Languages

Peer reviewed

Direct link

Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025

The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…

Descriptors: College Students, Slavic Languages, German, Italian

Differentiating Functional Connectivity Patterns in ADHD and Autism among the Young People: A Machine Learning Solution

Peer reviewed

Direct link

Bernis Sütçübasi; Tugçe Balli; Herbert Roeyers; Jan R. Wiersema; Sami Çamkerten; Ozan Cem Öztürk; Baris Metin; Edmund Sonuga-Barke – Journal of Attention Disorders, 2025

Objective: ADHD and autism are complex and frequently co-occurring neurodevelopmental conditions with shared etiological and pathophysiological elements. In this paper, we attempt to differentiate these conditions among the young people in terms of intrinsic patterns of brain connectivity revealed during resting state using machine learning…

Descriptors: Elementary School Students, Secondary School Students, Attention Deficit Hyperactivity Disorder, Autism Spectrum Disorders

The Application of Cognitive Task Analysis and Cognitive Load Methods in the Process of Learning Algorithms

Direct link

Razieh Fathi – ProQuest LLC, 2021

This dissertation describes an experiment to investigate how learners with different levels of background in computer science learn core concepts of computer science, in particular, algorithms. We designed a study to focus on cognitive task analysis for eliciting the empirical mental elements of learning two graph algorithms. Cognitive workload…

Descriptors: Undergraduate Students, Computer Science Education, Algorithms, Cognitive Development

Does MTV Really Do a Good Job of Evaluating Professors? An Empirical Test of the Internet Site Ratemyprofessors.com

Peer reviewed

Direct link

Murray, Keith B.; Zdravkovic, Srdan – Journal of Education for Business, 2016

Considerable debate continues regarding the efficacy of the website RateMyProfessors.com (RMP). To date, however, virtually no direct, experimental research has been reported which directly bears on questions relating to sampling adequacy or item adequacy in producing what favorable correlations have been reported. The authors compare the data…

Descriptors: Computer Assisted Testing, Computer Software Evaluation, Student Evaluation of Teacher Performance, Item Analysis

College Math Assessment: SAT Scores vs. College Math Placement Scores

Peer reviewed

Direct link

Foley-Peres, Kathleen; Poirier, Dawn – Educational Research Quarterly, 2008

Many colleges and university's use SAT math scores or math placement tests to place students in the appropriate math course. This study compares the use of math placement scores and SAT scores for 188 freshman students. The student's grades and faculty observations were analyzed to determine if the SAT scores and/or college math assessment scores…

Descriptors: Educational Indicators, Student Placement, Achievement Tests, Standardized Tests

Computerizing Organizational Attitude Surveys: An Investigation of the Measurement Equivalence of a Multifaceted Job Satisfaction Measure

Peer reviewed

Direct link

Mueller, Karsten; Liebig, Christian; Hattrup, Keith – Educational and Psychological Measurement, 2007

Two quasi-experimental field studies were conducted to evaluate the psychometric equivalence of computerized and paper-and-pencil job satisfaction measures. The present research extends previous work in the area by providing better control of common threats to validity in quasi-experimental research on test mode effects and by evaluating a more…

Descriptors: Psychometrics, Field Studies, Job Satisfaction, Computer Assisted Testing

Implicit Aspects of Paper and Pencil Mathematics Assessment that Come to Light through the Use of the Computer

Peer reviewed

Direct link

Threlfall, John; Pool, Peter; Homer, Matthew; Swinnerton, Bronwen – Educational Studies in Mathematics, 2007

This article explores the effect on assessment of "translating" paper and pencil test items into their computer equivalents. Computer versions of a set of mathematics questions derived from the paper-based end of key stage 2 and 3 assessments in England were administered to age appropriate pupil samples, and the outcomes compared.…

Descriptors: Test Items, Student Evaluation, Foreign Countries, Test Validity

A Structural Comparison of Conventional and Adaptive Versions of the ASVAB.

Peer reviewed

Cudeck, Robert – Multivariate Behavioral Research, 1985

Twelve structural models of similarity were fitted to data from conventional and computer adaptive test (CAT) batteries measuring the same aptitude in a double cross-validation design. Three of the 12 models, including a multiplicative structure model, performed well, providing support for using CATs as replacements for conventional tests. (NSF)

Descriptors: Adaptive Testing, Aptitude Tests, Comparative Testing, Computer Assisted Testing

Measuring Recognition Performance Using Computer-Based and Paper-Based Methods.

Peer reviewed

Federico, Pat-Anthony – Behavior Research Methods, Instruments, and Computers, 1991

Using a within-subjects design, computer-based and paper-based tests of aircraft silhouette recognition were administered to 83 male naval pilots and flight officers to determine the relative reliabilities and validities of 2 measurement modes. Relative reliabilities and validities of the two modes were contingent on the multivariate measurement…

Descriptors: Aircraft Pilots, Comparative Testing, Computer Assisted Testing, Males

Development and Validation of a Computer-Administered Version of the Hamilton Anxiety Scale.

Peer reviewed

Kobak, Kenneth A.; And Others – Psychological Assessment, 1993

A developed computer-administered form of the Hamilton Anxiety Scale and the clinician form of the instrument were administered to 214 psychiatric outpatients and 78 community adults. Results support the reliability and validity of the computer-administered version as an alternative to the clinician-administered version. (SLD)

Descriptors: Adults, Anxiety, Clinical Diagnosis, Comparative Testing

A Comparison of the Performance of Simulated Hierarchical and Linear Testlets.

Peer reviewed

Wainer, Howard; And Others – Journal of Educational Measurement, 1992

Computer simulations were run to measure the relationship between testlet validity and factors of item pool size and testlet length for both adaptive and linearly constructed testlets. Making a testlet adaptive yields only modest increases in aggregate validity because of the peakedness of the typical proficiency distribution. (Author/SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Computer Simulation

Examining the Relationship between Student Scores on the National Council Licensing Examination for Registered Nurses (NCLEX-RN) and the Computer Adaptive Test (CAT)

Peer reviewed
PDF on ERIC

Download full text

Laird, Barbara B. – Inquiry, 2003

Laird studies the relationship between two computerized nursing tests and finds a relationship between the two sets of scores. (Contains 2 tables.)

Descriptors: Nursing Education, Nurses, Computer Assisted Testing, Comparative Testing

Quantitative Comparisons of Difficulty, Discrimination and Reliability of Machine-Scored Completion Items and Tests (in the MDT Un-Cued Answer-Bank Format) in Contrast with Statistics from Comparable Multiple Choice Questions: The First Round of Results.

PDF pending restoration

Anderson, Paul S.; Hyers, Albert D. – 1991

Three descriptive statistics (difficulty, discrimination, and reliability) of multiple-choice (MC) test items were compared to those of a new (1980s) format of machine-scored questions. The new method, answer-bank multi-digit testing (MDT), uses alphabetized lists of up to 1,000 alternatives and approximates the completion style of assessment…

Descriptors: College Students, Comparative Testing, Computer Assisted Testing, Correlation

Assessing the Effects of Computer Administration on Scores and Parameter Estimates Using IRT Models.

Download full text

Sykes, Robert C.; And Others – 1991

To investigate the psychometric feasibility of replacing a paper-and-pencil licensing examination with a computer-administered test, a validity study was conducted. The computer-administered test (Cadm) was a common set of items for all test takers, distinct from computerized adaptive testing, in which test takers receive items appropriate to…

Descriptors: Adults, Certification, Comparative Testing, Computer Assisted Testing

Preliminary Report on a National Cross-Validation of the Computerized Adaptive Screening Test (CAST).

Download full text

Knapp, Deirdre J.; Pliske, Rebecca M. – 1986

A study was conducted to validate the Army's Computerized Adaptive Screening Test (CAST), using data from 2,240 applicants from 60 army recruiting stations across the nation. CAST is a computer-assisted adaptive test used to predict performance on the Armed Forces Qualification Test (AFQT). AFQT scores are computed by adding four subtest scores of…

Descriptors: Adaptive Testing, Adults, Aptitude Tests, Comparative Testing

Previous Page | Next Page »

Pages: 1 | 2

Lunz, Mary E.	2
Albanese, Mark A.	1
Anderson, Paul S.	1
Baris Metin	1
Bergstrom, Betty A.	1
Bernis Sütçübasi	1
Christal, Raymond E.	1
Cudeck, Robert	1
Edmund Sonuga-Barke	1
Federico, Pat-Anthony	1
Foley-Peres, Kathleen	1
Hamid Mohammadi	1
Hattrup, Keith	1
Herbert Roeyers	1
Homer, Matthew	1
Hyers, Albert D.	1
Jan R. Wiersema	1
Kent, Thomas H.	1
Kimball, James C.	1
Knapp, Deirdre J.	1
Kobak, Kenneth A.	1
Laird, Barbara B.	1
Liebig, Christian	1
Mark J. Gierl	1
More ▼