Publication Date
In 2025 | 3 |
Since 2024 | 12 |
Since 2021 (last 5 years) | 41 |
Since 2016 (last 10 years) | 126 |
Since 2006 (last 20 years) | 395 |
Descriptor
Test Theory | 1161 |
Test Items | 261 |
Test Reliability | 252 |
Test Construction | 245 |
Test Validity | 245 |
Psychometrics | 181 |
Scores | 176 |
Item Response Theory | 165 |
Foreign Countries | 159 |
Item Analysis | 141 |
Statistical Analysis | 134 |
More ▼ |
Source
Author
Publication Type
Education Level
Location
United States | 17 |
United Kingdom (England) | 15 |
Canada | 14 |
Australia | 13 |
Turkey | 12 |
Sweden | 8 |
United Kingdom | 8 |
Netherlands | 7 |
Texas | 7 |
New York | 6 |
Taiwan | 6 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 4 |
Elementary and Secondary… | 3 |
Individuals with Disabilities… | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
Wainer, Howard; Kiely, Gerard L. – 1986
Recent experience with the Computerized Adaptive Test (CAT) has raised a number of concerns about its practical applications. The concerns are principally involved with the concept of having the computer construct the test from a precalibrated item pool, and substituting statistical characteristics for the test developer's skills. Problems with…
Descriptors: Adaptive Testing, Algorithms, Computer Assisted Testing, Construct Validity
Roid, Gale H. – 1984
The goal of test item writing is to create measures of the true achievement dimensions desired, not extraneous abilities or related skills. To encourage university instructors to apply newer test item writing technologies to test construction for course examinations, this paper: (1) sets forth basic principles for quality item writing; (2) reviews…
Descriptors: Achievement Tests, Cognitive Measurement, College Faculty, College Instruction

Austin, James T. – Measurement and Evaluation in Counseling and Development, 1994
Describes the development of the Minnesota Multiphasic Personality Inventory (MMPI). Details revisions in the MMPI and then evaluates positive and negative features of these revisions in the light of construct validity and professional practice. Argues that many suggested refinements to the instrument are being actively investigated by its…
Descriptors: Construct Validity, Diagnostic Tests, Measures (Individuals), Personality Assessment
Steinberg, Wendy J. – 1990
The purpose of this study was to examine the nature and degree of differences in expert versus novice knowledge structures, both before and after training, when judging the similarity of multiple-choice test items within a statistics and test theory (STT) domain. Subjects were employees of the Testing Division of the New York State Department of…
Descriptors: Adults, Cognitive Structures, Comparative Testing, Government Employees
Sullivan, Francis J. – 1986
A study examined how pragmatic form influences evaluation of student essays in university placement testing. Specifically, the study documented how patterns in students' use of information (assumed to be either old, inferable, or new for readers) affected the holistic scores for quality given to the essays. Subjects, 99 randomly selected entering…
Descriptors: College Freshmen, Essay Tests, Evaluation Criteria, Evaluation Methods
York Region Board of Education, Aurora (Ontario). – 1986
To determine whether students enrolled in one Ontario region's early French immersion (FI) programs developed English reading skills comparable to their non-FI peers, a monitoring process was begun in the first FI program year (grade 3) in which formal English instruction is given. The FI cohort and a control group matched for mental abilities and…
Descriptors: Comparative Analysis, Elementary Education, English, Foreign Countries
Weiss, David J., Ed. – 1985
This report contains the Proceedings of the 1982 Item Response Theory and Computerized Adaptive Testing Conference. The papers and their discussions are organized into eight sessions: (1) "Developments in Latent Trait Theory," with papers by Fumiko Samejima and Michael V. Levine; (2) "Parameter Estimation," with papers by…
Descriptors: Achievement Tests, Adaptive Testing, Branching, Computer Assisted Testing
Beard, John D., Ed.; McNabb, Scott E., Ed. – 1985
Intended for teachers, this collection of articles on testing in the English language arts contains the following titles: "What Do Test Scores 'Really' Mean in Educational Policy?" by George F. Madaus; "Testing and Literacy: A Contradiction in Terms?" by Marilyn Wilson; "Throwing in the TOWL," by Mary Jane Curry; "Taking the Authority Figure Out…
Descriptors: Bilingual Education, English Instruction, English (Second Language), Language Arts
Bormuth, John R. – 1978
The feasibility of criterion referenced testing is held to be dependent on the tenability of two postulates: (1) that bias can be controlled in a principled manner from one test to the next; and (2) that one mental process measured by such tests may lawfully interact with another. Without the first postulate, criterion scores could not be…
Descriptors: Achievement Tests, Career Development, Criterion Referenced Tests, Cutting Scores
Haladyna, Tom – 1976
The existence of criterion-referenced (CR) measurement is questioned in this paper. Despite beliefs that differences exist between two alternative forms of measurement, CR and Norm Referenced (NR), an analysis of philosophical and psychological descriptions of measurement, as well as a growing number of empirical studies, reveal that the common…
Descriptors: Academic Standards, Achievement Tests, Career Development, Comparative Analysis
Harris, Jimmy Carl; Clemmons, Sandra – 1996
This paper presents the results of a search for an appropriate test of critical thinking to screen college freshmen. The search for an appropriate test of critical thinking was initiated in the Fall 1995 semester at an open-admissions comprehensive university, which normally assigns entering freshmen with ACT composite scores of 17 or less to…
Descriptors: College Entrance Examinations, College Freshmen, Compensatory Education, Critical Thinking

Harris, Deborah J. – Journal of Educational Measurement, 1991
Two data collection designs, counterbalanced and spiraling (Angoff's Design I and Angoff's Design II) were compared using item response theory and equipercentile equating methodology in the vertical equating of 2 mathematics achievement tests using 1,000 eleventh graders and 1,000 twelfth graders. The greater stability of Design II is discussed.…
Descriptors: Achievement Tests, College Entrance Examinations, Comparative Analysis, Data Collection
College Entrance Examination Board, Princeton, NJ. – 1990
This guide is designed to provide essential background material about the College Board's Computerized Placement Tests (CPTs). It is recommended for administrators and staff alike. It contains the theory on which the tests are based, information concerning how to administer them, and discussions of the reports produced and how to interpret the…
Descriptors: Adaptive Testing, Algebra, Arithmetic, College Entrance Examinations
Simic, Marge; Smith, Carl, Comp. – 1990
Originally developed for the Department of Defense Schools (DoDDS) system, this learning package on changing perspective in reading assessment is designed for teachers who wish to upgrade or expand their teaching skills on their own. The package includes a comprehensive search of the ERIC database; a lecture giving an overview on the topic; the…
Descriptors: Distance Education, Elementary Secondary Education, Higher Education, Informal Reading Inventories
Siskind, Teri G.; Rose, Janet S. – 1986
The Charleston County School District (CCSD) has recently begun development of criterion-referenced tests (CRT) in different subject areas and for different grade levels. This paper outlines the process that CCSD followed in the development of math and language arts tests for grades one through eight and area exams for required high school…
Descriptors: Behavioral Objectives, Criterion Referenced Tests, Educational Objectives, Educational Testing