Publication Date
| In 2026 | 0 |
| Since 2025 | 215 |
| Since 2022 (last 5 years) | 1084 |
| Since 2017 (last 10 years) | 2594 |
| Since 2007 (last 20 years) | 4955 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Tannenbaum, Richard J. – 1994
A job analysis was conducted, focusing on the knowledge and abilities important for beginning French teachers. The results of the job analysis are to be used to define the content domain of the subject assessment in French for the Praxis series of professional assessments for beginning teachers. A domain of 212 knowledge statements and ability…
Descriptors: Administrators, Beginning Teachers, Cutting Scores, Educational Assessment
Powell, Z. Emily – 1992
Little research exists on the psychological impacts of computerized adaptive testing (CAT) and how it may affect test performance. Three CAT procedures were examined, in which items were selected to match students' achievement levels, from the item pool at random, or according to student choice of item difficulty levels. Twenty-four graduate…
Descriptors: Academic Achievement, Adaptive Testing, Comparative Testing, Computer Assisted Testing
Ackerman, Terry A.; Evans, John A. – 1992
The relationship between levels of reliability and the power of two bias and differential item functioning (DIF) detection methods is examined. Both methods, the Mantel-Haenszel (MH) procedure of P. W. Holland and D. T. Thayer (1988) and the Simultaneous Item Bias (SIB) procedure of R. Shealy and W. Stout (1991), use examinees' raw scores as a…
Descriptors: Comparative Analysis, Equations (Mathematics), Error of Measurement, Item Bias
Kim, Haeok; Plake, Barbara S. – 1993
A two-stage testing strategy is one method of adapting the difficulty of a test to an individual's ability level in an effort to achieve more precise measurement. A routing test provides an initial estimate of ability level, and a second-stage measurement test then evaluates the examinee further. The measurement accuracy and efficiency of item…
Descriptors: Ability, Adaptive Testing, Comparative Testing, Computer Assisted Testing
Anderson, Neil J. – 1993
The manual is designed to help Peace Corps language teachers design simple evaluation procedures that will: (1) help them select appropriate classroom activities; (2) encourage students to self-monitor their progress and take responsibility for learning; and (3) give insight into student aptitudes. An introductory chapter examines the relationship…
Descriptors: Adult Education, Formative Evaluation, French, Language Skills
Optimal Assembly of Educational and Psychological Tests, with a Bibliography. Research Report 98-05.
van der Linden, Wim J. – 1998
The advent of computers in educational and psychological measurement has lead to the need for algorithms for optimal assembly of tests from item banks. This paper reviews the literature on optimal test assembly and introduces the contributions to this report on the topic. Four different approaches to computerized test assembly are discussed:…
Descriptors: Algorithms, Computer Assisted Testing, Educational Testing, Equated Scores
Kromrey, Jeffrey D.; Bacon, Tina P. – 1992
A Monte Carlo study was conducted to estimate the small sample standard errors and statistical bias of psychometric statistics commonly used in the analysis of achievement tests. The statistics examined in this research were: (1) the index of item difficulty; (2) the index of item discrimination; (3) the corrected item-total point-biserial…
Descriptors: Achievement Tests, Comparative Analysis, Difficulty Level, Estimation (Mathematics)
Stansfield, Charles W.; And Others – 1992
This report describes the development, construction, and validation of the Preliminary Chinese Proficiency Test (Pre-CPT), a standardized, nationally-normed test of listening and reading comprehension for beginning-level native English-speaking learners of Chinese as a second language. The Pre-CPT was designed as a lower-level version of the…
Descriptors: Chinese, Higher Education, Language Proficiency, Language Tests
Lewis, J. C. – 1994
Whether boys and girls perform differently on mathematics estimation items with a picture format (applied context [AC] items) compared with items with a numbers-only (NC) format was studied when effects of computational skill, conceptual knowledge, and quantitative ability were controlled. Subjects were approximately 80,000 students from grades 4…
Descriptors: Comparative Analysis, Context Effect, Educational Assessment, Elementary Education
Missouri Univ., Columbia. Instructional Materials Lab. – 1993
This module contains a student manual and an instructor's manual for study of vocabulary for vocational education aimed at students with special needs. The student manual consists of quizzes that consist of matching and multiple-choice items that can be used to review the vocabulary of the unit as presented on a videotaped lesson. Answers to the…
Descriptors: Exceptional Persons, Learning Disabilities, Occupational Information, Secondary Education
Stone, Gregory Ethan; Lunz, Mary E. – 1994
This paper explores the comparability of item calibrations for three types of items: (1) text only; (2) text with photographs; and (3) text plus graphics when items are presented on written tests and computerized adaptive tests. Data are from five different medical technology certification examinations administered nationwide in 1993. The Rasch…
Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Diagrams
Evans, John Andrew; Ackerman, Terry – 1994
The strengths of item-response theory (IRT) are used to examine the degree of information individual test items provide, as well as to investigate how the individual item types contribute to the overall measurement accuracy of the Illinois Goal Assessment Program (IGAP) reading test. Using the graded-response model of Samejima (1969), the amount…
Descriptors: Ability, Educational Diagnosis, Elementary Education, Elementary School Students
De Ayala, R. J.; And Others – 1991
The robustness of a partial credit (PC) model-based computerized adaptive test's (CAT's) ability estimation to items that did not fit the PC model was investigated. A CAT program was written based on the PC model. The program used maximum likelihood estimation of ability. Item selection was on the basis of information. The simulation terminated…
Descriptors: Adaptive Testing, Computer Assisted Testing, Equations (Mathematics), Error of Measurement
McKinley, Robert L.; Reckase, Mark D. – 1983
A latent trait model is described that is appropriate for use with tests that measure more than one dimension, and its application to both real and simulated test data is demonstrated. Procedures for estimating the parameters of the model are presented. The research objectives are to determine whether the two-parameter logistic model more…
Descriptors: Comparative Analysis, Data Analysis, Factor Analysis, Feasibility Studies
Wang, Xiaohui; Bradlow, Eric T.; Wainer, Howard – ETS Research Report Series, 2005
SCORIGHT is a very general computer program for scoring tests. It models tests that are made up of dichotomously or polytomously rated items or any kind of combination of the two through the use of a generalized item response theory (IRT) formulation. The items can be presented independently or grouped into clumps of allied items (testlets) or in…
Descriptors: Computer Assisted Testing, Statistical Analysis, Test Items, Bayesian Statistics

Peer reviewed
