ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	4

Descriptor

Comparative Testing	23
Test Format	23
Test Validity	23
Test Items	10
Higher Education	9
Test Reliability	9
Multiple Choice Tests	7
Test Construction	7
Computer Assisted Testing	6
High School Students	5
High Schools	5
Adults	4
Foreign Countries	4
Psychometrics	4
Undergraduate Students	4
Adaptive Testing	3
Item Response Theory	3
Scores	3
Ability	2
Construct Validity	2
Correlation	2
Grade Point Average	2
Instructional Effectiveness	2
Item Analysis	2
Males	2
More ▼

Source

Educational and Psychological…	3
Psychological Assessment	2
Anatomical Sciences Education	1
Behavior Research Methods,…	1
Educational Studies in…	1
Geographical Education	1
Journal of Economic Education	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Experimental…	1

Publication Type

Reports - Research	20
Journal Articles	13
Speeches/Meeting Papers	9
Reports - Evaluative	2
Reports - Descriptive	1

Education Level

Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Canada	1
Netherlands	1
Saudi Arabia	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

Embedded Figures Test	2
Armed Forces Qualification…	1
Armed Services Vocational…	1
Beck Depression Inventory	1
SAT (College Admission Test)	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

A Two-Level Adaptive Test Battery

Peer reviewed

Direct link

Wim J. van der Linden; Luping Niu; Seung W. Choi – Journal of Educational and Behavioral Statistics, 2024

A test battery with two different levels of adaptation is presented: a within-subtest level for the selection of the items in the subtests and a between-subtest level to move from one subtest to the next. The battery runs on a two-level model consisting of a regular response model for each of the subtests extended with a second level for the joint…

Descriptors: Adaptive Testing, Test Construction, Test Format, Test Reliability

The Impact of National Examinations on Geography Teachers' Assessment Practices in the Netherlands

Peer reviewed
PDF on ERIC

Download full text

Bijsterbosch, Erik – Geographical Education, 2018

Geography teachers' school-based (internal) examinations in pre-vocational geography education in the Netherlands appear to be in line with the findings in the literature, namely that teachers' assessment practices tend to focus on the recall of knowledge. These practices are strongly influenced by national (external) examinations. This paper…

Descriptors: Foreign Countries, Instructional Effectiveness, National Competency Tests, Geography Instruction

The Validity of Multiple Choice Practical Examinations as an Alternative to Traditional Free Response Examination Formats in Gross Anatomy

Peer reviewed

Direct link

Shaibah, Hassan Sami; van der Vleuten, Cees P. M. – Anatomical Sciences Education, 2013

Traditionally, an anatomy practical examination is conducted using a free response format (FRF). However, this format is resource-intensive, as it requires a relatively large time investment from anatomy course faculty in preparation and grading. Thus, several interventions have been reported where the response format was changed to a selected…

Descriptors: Multiple Choice Tests, Anatomy, Medical Education, Test Validity

Implicit Aspects of Paper and Pencil Mathematics Assessment that Come to Light through the Use of the Computer

Peer reviewed

Direct link

Threlfall, John; Pool, Peter; Homer, Matthew; Swinnerton, Bronwen – Educational Studies in Mathematics, 2007

This article explores the effect on assessment of "translating" paper and pencil test items into their computer equivalents. Computer versions of a set of mathematics questions derived from the paper-based end of key stage 2 and 3 assessments in England were administered to age appropriate pupil samples, and the outcomes compared.…

Descriptors: Test Items, Student Evaluation, Foreign Countries, Test Validity

Multiple Choice and True-False: Reliability and Validity Compared.

Peer reviewed

Green, Kathy – Journal of Experimental Education, 1979

Reliabilities and concurrent validities of teacher-made multiple-choice and true-false tests were compared. No significant differences were found even when multiple-choice reliability was adjusted to equate testing time. (Author/MH)

Descriptors: Comparative Testing, Higher Education, Multiple Choice Tests, Test Format

Measuring Recognition Performance Using Computer-Based and Paper-Based Methods.

Peer reviewed

Federico, Pat-Anthony – Behavior Research Methods, Instruments, and Computers, 1991

Using a within-subjects design, computer-based and paper-based tests of aircraft silhouette recognition were administered to 83 male naval pilots and flight officers to determine the relative reliabilities and validities of 2 measurement modes. Relative reliabilities and validities of the two modes were contingent on the multivariate measurement…

Descriptors: Aircraft Pilots, Comparative Testing, Computer Assisted Testing, Males

A Missing Data Approach to Estimating Distributions of Scores for Optional Test Sections.

Allen, Nancy L.; And Others – 1992

Many testing programs include a section of optional questions in addition to mandatory parts of a test. These optional parts of a test are not often truly parallel to one another, and groups of examinees selecting each optional test section are not equivalent to one another. This paper provides a general method based on missing-data methods for…

Descriptors: Comparative Testing, Estimation (Mathematics), Graphs, Scaling

Development and Validation of a Computer-Administered Version of the Hamilton Anxiety Scale.

Peer reviewed

Kobak, Kenneth A.; And Others – Psychological Assessment, 1993

A developed computer-administered form of the Hamilton Anxiety Scale and the clinician form of the instrument were administered to 214 psychiatric outpatients and 78 community adults. Results support the reliability and validity of the computer-administered version as an alternative to the clinician-administered version. (SLD)

Descriptors: Adults, Anxiety, Clinical Diagnosis, Comparative Testing

Validating the Measurement and Structure of the Beck Depression Inventory across English and French Nonclinical Adolescents.

Download full text

Byrne, Barbara M.; Baron, Pierre – 1991

The aims of the present study were threefold: (1) to test for the equivalency of an hierarchical three-factor structure of the Beck Depression Inventory (BDI) across English and French versions for non-clinical adolescents; (2) given evidence of poor model fit, to validate the factorial structure of the BDI French version across three independent…

Descriptors: Adolescents, Comparative Testing, English, Factor Structure

The Effect of Negation and Polar Opposite Item Reversals on Questionnaire Reliability and Validity: An Experimental Investigation.

Peer reviewed

Schriesheim, Chester A.; And Others – Educational and Psychological Measurement, 1991

Effects of item wording on questionnaire reliability and validity were studied, using 280 undergraduate business students who completed a questionnaire comprising 4 item types: (1) regular; (2) polar opposite; (3) negated polar opposite; and (4) negated regular. Implications of results favoring regular and negated regular items are discussed. (SLD)

Descriptors: Business Education, Comparative Testing, Higher Education, Negative Forms (Language)

Assessing the Effects of Computer Administration on Scores and Parameter Estimates Using IRT Models.

Download full text

Sykes, Robert C.; And Others – 1991

To investigate the psychometric feasibility of replacing a paper-and-pencil licensing examination with a computer-administered test, a validity study was conducted. The computer-administered test (Cadm) was a common set of items for all test takers, distinct from computerized adaptive testing, in which test takers receive items appropriate to…

Descriptors: Adults, Certification, Comparative Testing, Computer Assisted Testing

Preliminary Report on a National Cross-Validation of the Computerized Adaptive Screening Test (CAST).

Download full text

Knapp, Deirdre J.; Pliske, Rebecca M. – 1986

A study was conducted to validate the Army's Computerized Adaptive Screening Test (CAST), using data from 2,240 applicants from 60 army recruiting stations across the nation. CAST is a computer-assisted adaptive test used to predict performance on the Armed Forces Qualification Test (AFQT). AFQT scores are computed by adding four subtest scores of…

Descriptors: Adaptive Testing, Adults, Aptitude Tests, Comparative Testing

Concurrent Validity of Three WAIS-R Short Forms in Psychiatric Inpatients.

Peer reviewed

Benedict, Ralph H. B.; And Others – Psychological Assessment, 1992

The concurrent validities of 3 short forms of the Wechsler Adult Intelligence Scale (WAIS) were compared for their prediction of full-scale IQ for 145 male and 159 female psychiatric inpatients. Results support previous research showing better predictive accuracy for L. C. Ward's (1990) seven-subtest short form than the others. (SLD)

Descriptors: Adults, Comparative Testing, Concurrent Validity, Cost Effectiveness

Multiple-Choice and Alternate-Choice Questions: Description and Analysis.

Download full text

Dowd, Steven B. – 1992

An alternative to multiple-choice (MC) testing is suggested as it pertains to the field of radiologic technology education. General principles for writing MC questions are given and contrasted with a new type of MC question, the alternate-choice (AC) question, in which the answer choices are embedded in the question in a short form that resembles…

Descriptors: Comparative Testing, Difficulty Level, Evaluation Methods, Higher Education

The Instructional Validity of Computer Administered Tests.

Download full text

Siskind, Theresa G.; And Others – 1992

The instructional validity of computer administered tests was studied with a focus on whether differences in test scores and item behavior are a function of instructional mode (computer versus non-computer). In the first of 3 studies, performance test scores for approximately 400 high school students in 1990-91 for tasks accomplished with the…

Descriptors: Comparative Testing, Comprehension, Computer Assisted Instruction, Computer Assisted Testing

Previous Page | Next Page »

Pages: 1 | 2

Melancon, Janet G.	2
Thompson, Bruce	2
Allen, Nancy L.	1
Baron, Pierre	1
Benedict, Ralph H. B.	1
Bijsterbosch, Erik	1
Byrne, Barbara M.	1
Chang, Lei	1
Dowd, Steven B.	1
Federico, Pat-Anthony	1
Green, Kathy	1
Homer, Matthew	1
Kinicki, Angelo J.	1
Knapp, Deirdre J.	1
Kobak, Kenneth A.	1
Luping Niu	1
Pliske, Rebecca M.	1
Pool, Peter	1
Robson, Denise	1
Schriesheim, Chester A.	1
Seung W. Choi	1
Shaibah, Hassan Sami	1
Siskind, Theresa G.	1
Stricker, Lawrence J.	1
More ▼