Publication Date
| In 2026 | 0 |
| Since 2025 | 8 |
| Since 2022 (last 5 years) | 36 |
| Since 2017 (last 10 years) | 115 |
| Since 2007 (last 20 years) | 378 |
Descriptor
| Test Theory | 1166 |
| Test Items | 262 |
| Test Reliability | 252 |
| Test Construction | 246 |
| Test Validity | 245 |
| Psychometrics | 183 |
| Scores | 176 |
| Item Response Theory | 168 |
| Foreign Countries | 160 |
| Item Analysis | 141 |
| Statistical Analysis | 134 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Location
| United States | 17 |
| United Kingdom (England) | 15 |
| Canada | 14 |
| Australia | 13 |
| Turkey | 12 |
| Sweden | 8 |
| United Kingdom | 8 |
| Netherlands | 7 |
| Texas | 7 |
| New York | 6 |
| Taiwan | 6 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 4 |
| Elementary and Secondary… | 3 |
| Individuals with Disabilities… | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
Jacobs, Lucy Cheser; Chase, Clinton I. – 1992
This book offers specific how-to advice to college faculty on every stage of the testing process, including planning the test and classifying objectives to be measured, ensuring the validity and reliability of the test, and grading in such a way as to arrive at fair grades based on relevant data. The book examines the strengths and weaknesses of…
Descriptors: Cheating, College Faculty, Comparative Analysis, Computer Assisted Testing
Garrido, Mariquita; Payne, David A. – 1987
Minimum competency cut-off scores on a statistics exam were estimated under four conditions: the Angoff judging method with item data (n=20), and without data available (n=19); and the Modified Angoff method with (n=19), and without (n=19) item data available to judges. The Angoff method required free response percentage estimates (0-100) percent,…
Descriptors: Academic Standards, Comparative Analysis, Criterion Referenced Tests, Cutting Scores
Laveault, Dany; And Others – 1983
The main purpose of this study was to evaluate the cognitive processes of Montagnais Indians under conditions that would reduce bias and allow for a contextual interpretation of the results. Fifty-eight Montagnais children were compared to French-Canadians of the same age and grade groups whose data had been collected through previous…
Descriptors: Canada Natives, Cognitive Tests, Comparative Testing, Cross Cultural Studies
Marsh, Herbert W. – 1986
The purpose of the present investigation was to develop a construct validity approach for testing whether the separation of positive and negative item subscales is substantively meaningful in self-concept research. Results from three published studies using the Self Description Questionnaire (SDQ) III were reanalyzed. The SDQ III measures 13…
Descriptors: Behavior Rating Scales, Construct Validity, Correlation, Foreign Countries
Wainer, Howard; Kiely, Gerard L. – 1986
Recent experience with the Computerized Adaptive Test (CAT) has raised a number of concerns about its practical applications. The concerns are principally involved with the concept of having the computer construct the test from a precalibrated item pool, and substituting statistical characteristics for the test developer's skills. Problems with…
Descriptors: Adaptive Testing, Algorithms, Computer Assisted Testing, Construct Validity
Roid, Gale H. – 1984
The goal of test item writing is to create measures of the true achievement dimensions desired, not extraneous abilities or related skills. To encourage university instructors to apply newer test item writing technologies to test construction for course examinations, this paper: (1) sets forth basic principles for quality item writing; (2) reviews…
Descriptors: Achievement Tests, Cognitive Measurement, College Faculty, College Instruction
Peer reviewedAustin, James T. – Measurement and Evaluation in Counseling and Development, 1994
Describes the development of the Minnesota Multiphasic Personality Inventory (MMPI). Details revisions in the MMPI and then evaluates positive and negative features of these revisions in the light of construct validity and professional practice. Argues that many suggested refinements to the instrument are being actively investigated by its…
Descriptors: Construct Validity, Diagnostic Tests, Measures (Individuals), Personality Assessment
Steinberg, Wendy J. – 1990
The purpose of this study was to examine the nature and degree of differences in expert versus novice knowledge structures, both before and after training, when judging the similarity of multiple-choice test items within a statistics and test theory (STT) domain. Subjects were employees of the Testing Division of the New York State Department of…
Descriptors: Adults, Cognitive Structures, Comparative Testing, Government Employees
Sullivan, Francis J. – 1986
A study examined how pragmatic form influences evaluation of student essays in university placement testing. Specifically, the study documented how patterns in students' use of information (assumed to be either old, inferable, or new for readers) affected the holistic scores for quality given to the essays. Subjects, 99 randomly selected entering…
Descriptors: College Freshmen, Essay Tests, Evaluation Criteria, Evaluation Methods
York Region Board of Education, Aurora (Ontario). – 1986
To determine whether students enrolled in one Ontario region's early French immersion (FI) programs developed English reading skills comparable to their non-FI peers, a monitoring process was begun in the first FI program year (grade 3) in which formal English instruction is given. The FI cohort and a control group matched for mental abilities and…
Descriptors: Comparative Analysis, Elementary Education, English, Foreign Countries
Weiss, David J., Ed. – 1985
This report contains the Proceedings of the 1982 Item Response Theory and Computerized Adaptive Testing Conference. The papers and their discussions are organized into eight sessions: (1) "Developments in Latent Trait Theory," with papers by Fumiko Samejima and Michael V. Levine; (2) "Parameter Estimation," with papers by…
Descriptors: Achievement Tests, Adaptive Testing, Branching, Computer Assisted Testing
Beard, John D., Ed.; McNabb, Scott E., Ed. – 1985
Intended for teachers, this collection of articles on testing in the English language arts contains the following titles: "What Do Test Scores 'Really' Mean in Educational Policy?" by George F. Madaus; "Testing and Literacy: A Contradiction in Terms?" by Marilyn Wilson; "Throwing in the TOWL," by Mary Jane Curry; "Taking the Authority Figure Out…
Descriptors: Bilingual Education, English Instruction, English (Second Language), Language Arts
Bormuth, John R. – 1978
The feasibility of criterion referenced testing is held to be dependent on the tenability of two postulates: (1) that bias can be controlled in a principled manner from one test to the next; and (2) that one mental process measured by such tests may lawfully interact with another. Without the first postulate, criterion scores could not be…
Descriptors: Achievement Tests, Career Development, Criterion Referenced Tests, Cutting Scores
Haladyna, Tom – 1976
The existence of criterion-referenced (CR) measurement is questioned in this paper. Despite beliefs that differences exist between two alternative forms of measurement, CR and Norm Referenced (NR), an analysis of philosophical and psychological descriptions of measurement, as well as a growing number of empirical studies, reveal that the common…
Descriptors: Academic Standards, Achievement Tests, Career Development, Comparative Analysis
Harris, Jimmy Carl; Clemmons, Sandra – 1996
This paper presents the results of a search for an appropriate test of critical thinking to screen college freshmen. The search for an appropriate test of critical thinking was initiated in the Fall 1995 semester at an open-admissions comprehensive university, which normally assigns entering freshmen with ACT composite scores of 17 or less to…
Descriptors: College Entrance Examinations, College Freshmen, Compensatory Education, Critical Thinking


