Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 4 |
Descriptor
Source
Author
Melancon, Janet G. | 2 |
Thompson, Bruce | 2 |
Allen, Nancy L. | 1 |
Baron, Pierre | 1 |
Benedict, Ralph H. B. | 1 |
Bijsterbosch, Erik | 1 |
Byrne, Barbara M. | 1 |
Chang, Lei | 1 |
Dowd, Steven B. | 1 |
Federico, Pat-Anthony | 1 |
Green, Kathy | 1 |
More ▼ |
Publication Type
Reports - Research | 20 |
Journal Articles | 13 |
Speeches/Meeting Papers | 9 |
Reports - Evaluative | 2 |
Reports - Descriptive | 1 |
Education Level
Higher Education | 1 |
Postsecondary Education | 1 |
Secondary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Embedded Figures Test | 2 |
Armed Forces Qualification… | 1 |
Armed Services Vocational… | 1 |
Beck Depression Inventory | 1 |
SAT (College Admission Test) | 1 |
Wechsler Adult Intelligence… | 1 |
What Works Clearinghouse Rating
Wim J. van der Linden; Luping Niu; Seung W. Choi – Journal of Educational and Behavioral Statistics, 2024
A test battery with two different levels of adaptation is presented: a within-subtest level for the selection of the items in the subtests and a between-subtest level to move from one subtest to the next. The battery runs on a two-level model consisting of a regular response model for each of the subtests extended with a second level for the joint…
Descriptors: Adaptive Testing, Test Construction, Test Format, Test Reliability
Bijsterbosch, Erik – Geographical Education, 2018
Geography teachers' school-based (internal) examinations in pre-vocational geography education in the Netherlands appear to be in line with the findings in the literature, namely that teachers' assessment practices tend to focus on the recall of knowledge. These practices are strongly influenced by national (external) examinations. This paper…
Descriptors: Foreign Countries, Instructional Effectiveness, National Competency Tests, Geography Instruction
Shaibah, Hassan Sami; van der Vleuten, Cees P. M. – Anatomical Sciences Education, 2013
Traditionally, an anatomy practical examination is conducted using a free response format (FRF). However, this format is resource-intensive, as it requires a relatively large time investment from anatomy course faculty in preparation and grading. Thus, several interventions have been reported where the response format was changed to a selected…
Descriptors: Multiple Choice Tests, Anatomy, Medical Education, Test Validity
Threlfall, John; Pool, Peter; Homer, Matthew; Swinnerton, Bronwen – Educational Studies in Mathematics, 2007
This article explores the effect on assessment of "translating" paper and pencil test items into their computer equivalents. Computer versions of a set of mathematics questions derived from the paper-based end of key stage 2 and 3 assessments in England were administered to age appropriate pupil samples, and the outcomes compared.…
Descriptors: Test Items, Student Evaluation, Foreign Countries, Test Validity

Green, Kathy – Journal of Experimental Education, 1979
Reliabilities and concurrent validities of teacher-made multiple-choice and true-false tests were compared. No significant differences were found even when multiple-choice reliability was adjusted to equate testing time. (Author/MH)
Descriptors: Comparative Testing, Higher Education, Multiple Choice Tests, Test Format

Federico, Pat-Anthony – Behavior Research Methods, Instruments, and Computers, 1991
Using a within-subjects design, computer-based and paper-based tests of aircraft silhouette recognition were administered to 83 male naval pilots and flight officers to determine the relative reliabilities and validities of 2 measurement modes. Relative reliabilities and validities of the two modes were contingent on the multivariate measurement…
Descriptors: Aircraft Pilots, Comparative Testing, Computer Assisted Testing, Males
Allen, Nancy L.; And Others – 1992
Many testing programs include a section of optional questions in addition to mandatory parts of a test. These optional parts of a test are not often truly parallel to one another, and groups of examinees selecting each optional test section are not equivalent to one another. This paper provides a general method based on missing-data methods for…
Descriptors: Comparative Testing, Estimation (Mathematics), Graphs, Scaling

Kobak, Kenneth A.; And Others – Psychological Assessment, 1993
A developed computer-administered form of the Hamilton Anxiety Scale and the clinician form of the instrument were administered to 214 psychiatric outpatients and 78 community adults. Results support the reliability and validity of the computer-administered version as an alternative to the clinician-administered version. (SLD)
Descriptors: Adults, Anxiety, Clinical Diagnosis, Comparative Testing
Byrne, Barbara M.; Baron, Pierre – 1991
The aims of the present study were threefold: (1) to test for the equivalency of an hierarchical three-factor structure of the Beck Depression Inventory (BDI) across English and French versions for non-clinical adolescents; (2) given evidence of poor model fit, to validate the factorial structure of the BDI French version across three independent…
Descriptors: Adolescents, Comparative Testing, English, Factor Structure

Schriesheim, Chester A.; And Others – Educational and Psychological Measurement, 1991
Effects of item wording on questionnaire reliability and validity were studied, using 280 undergraduate business students who completed a questionnaire comprising 4 item types: (1) regular; (2) polar opposite; (3) negated polar opposite; and (4) negated regular. Implications of results favoring regular and negated regular items are discussed. (SLD)
Descriptors: Business Education, Comparative Testing, Higher Education, Negative Forms (Language)
Assessing the Effects of Computer Administration on Scores and Parameter Estimates Using IRT Models.
Sykes, Robert C.; And Others – 1991
To investigate the psychometric feasibility of replacing a paper-and-pencil licensing examination with a computer-administered test, a validity study was conducted. The computer-administered test (Cadm) was a common set of items for all test takers, distinct from computerized adaptive testing, in which test takers receive items appropriate to…
Descriptors: Adults, Certification, Comparative Testing, Computer Assisted Testing
Knapp, Deirdre J.; Pliske, Rebecca M. – 1986
A study was conducted to validate the Army's Computerized Adaptive Screening Test (CAST), using data from 2,240 applicants from 60 army recruiting stations across the nation. CAST is a computer-assisted adaptive test used to predict performance on the Armed Forces Qualification Test (AFQT). AFQT scores are computed by adding four subtest scores of…
Descriptors: Adaptive Testing, Adults, Aptitude Tests, Comparative Testing

Benedict, Ralph H. B.; And Others – Psychological Assessment, 1992
The concurrent validities of 3 short forms of the Wechsler Adult Intelligence Scale (WAIS) were compared for their prediction of full-scale IQ for 145 male and 159 female psychiatric inpatients. Results support previous research showing better predictive accuracy for L. C. Ward's (1990) seven-subtest short form than the others. (SLD)
Descriptors: Adults, Comparative Testing, Concurrent Validity, Cost Effectiveness
Dowd, Steven B. – 1992
An alternative to multiple-choice (MC) testing is suggested as it pertains to the field of radiologic technology education. General principles for writing MC questions are given and contrasted with a new type of MC question, the alternate-choice (AC) question, in which the answer choices are embedded in the question in a short form that resembles…
Descriptors: Comparative Testing, Difficulty Level, Evaluation Methods, Higher Education
Siskind, Theresa G.; And Others – 1992
The instructional validity of computer administered tests was studied with a focus on whether differences in test scores and item behavior are a function of instructional mode (computer versus non-computer). In the first of 3 studies, performance test scores for approximately 400 high school students in 1990-91 for tasks accomplished with the…
Descriptors: Comparative Testing, Comprehension, Computer Assisted Instruction, Computer Assisted Testing
Previous Page | Next Page ยป
Pages: 1 | 2