Publication Date
In 2025 | 3 |
Since 2024 | 12 |
Since 2021 (last 5 years) | 41 |
Since 2016 (last 10 years) | 126 |
Since 2006 (last 20 years) | 395 |
Descriptor
Test Theory | 1161 |
Test Items | 261 |
Test Reliability | 252 |
Test Construction | 245 |
Test Validity | 245 |
Psychometrics | 181 |
Scores | 176 |
Item Response Theory | 165 |
Foreign Countries | 159 |
Item Analysis | 141 |
Statistical Analysis | 134 |
More ▼ |
Source
Author
Publication Type
Education Level
Location
United States | 17 |
United Kingdom (England) | 15 |
Canada | 14 |
Australia | 13 |
Turkey | 12 |
Sweden | 8 |
United Kingdom | 8 |
Netherlands | 7 |
Texas | 7 |
New York | 6 |
Taiwan | 6 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 4 |
Elementary and Secondary… | 3 |
Individuals with Disabilities… | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
Bell, Richard C. – Psychological Test Bulletin, 1991
A survey of 54 teachers of undergraduate and graduate psychological testing courses illustrates the teaching of testing in Australia. Tests covered in a course vary extensively. Intelligence testing is the most commonly taught (covered in 89 percent of the courses); most time on testing is spent in first-year postgraduate courses. (SLD)
Descriptors: College Curriculum, Course Content, Foreign Countries, Graduate Study
Martin, Janice E.; Janosik, Steven M. – NASPA Journal, 2004
A content analysis of 20 student conduct codes identified through stratified random sampling was performed to determine the extent to which legal terminology had been eliminated, as suggested by judicial affairs experts. The results showed that 80% of the codes selected in this study still contained some legal terms. These words and phrases are…
Descriptors: Ethics, Content Analysis, Sampling, Private Schools
Cotton, Sue M.; Crewther, David P.; Crewther, Sheila G. – Dyslexia, 2005
The diagnosis of developmental dyslexia (DD) is reliant on a discrepancy between intellectual functioning and reading achievement. Discrepancy-based formulae have frequently been employed to establish the significance of the difference between "intelligence" and "actual" reading achievement. These formulae, however, often fail to take into…
Descriptors: Intelligence, Dyslexia, Reading Achievement, Test Reliability
Slomp, David H.; Fuite, Jim – Assessing Writing, 2004
Specialists in the field of large-scale, high-stakes writing assessment have, over the last forty years alternately discussed the issue of maximizing either reliability or validity in test design. Factors complicating the debate--such as Messick's (1989) expanded definition of validity, and the ethical implications of testing--are explored. An…
Descriptors: Information Theory, Writing Evaluation, Writing Tests, Test Validity
Mislevy, Robert J.; And Others – 1991
The view of learning that underlies standard test theory is inconsistent with the view rapidly emerging from cognitive and educational psychology. Learners become more competent not simply by learning more facts and skills, but by reconfiguring their knowledge; by "chunking" information to reduce memory loads; and by developing…
Descriptors: Cognitive Psychology, Comprehension, Constructivism (Learning), Educational Assessment
Kehoe, Jerard – 1995
This digest describes some basics of the construction of multiple-choice tests. As a rule, the test maker should strive for test item stems (introductory questions or incomplete statements at the beginning of each item that are followed by the options) that are clear and parsimonious, answers that are unequivocal and chosen by the students who do…
Descriptors: Culture Fair Tests, Distractors (Tests), Educational Assessment, Item Bias
Espelage, Dorothy L.; Quittner, Alexandra L.; Kamps, Jodi – 1998
Generalizability theory (g-theory) was used, as an alternative to classical test theory, to evaluate measurement error in a behaviorally anchored role-play measure, highlighting the usefulness of this theory in instrument development. G-theory partitions an observed score into the universe score and error scores associated with separate sources of…
Descriptors: Behavior Patterns, Eating Disorders, Error of Measurement, Females
Baker, Eva L. – 1989
The renewed attention to assessments that attempt to capture complex aspects of educational attainments of students is explored. The definition and impetus for attention of the measurement community to higher order thinking skills are examined. Through a detailed description, a model assessment development process is presented. The process relies…
Descriptors: Cognitive Measurement, Educational Assessment, Educational Indicators, Elementary Secondary Education
North Carolina State Dept. of Public Instruction, Raleigh. Div. of Accountability Services/Research. – 1990
To facilitate the proper technical use of the test scores obtained from the administration of the tests, the curricular and psychometric characteristics of the tests are described in a series of technical manuals. This manual, the seventh in the series, contains a description of the characteristics of the North Carolina Test of Chemistry. The test…
Descriptors: Chemistry, Curriculum Evaluation, Science Education, Secondary Education
Seong, Tae-Je; Subkoviak, Michael J. – 1987
The purpose of this research was to reinvestigate the accuracy of three item bias detection procedures: (1) Linn and Harnisch's pseudo-IRT(Z) method; (2) Camilli's chi-square technique; and (3) Angoff's revised transformed item difficulty method. These methods are applied when the minority group sample size is too small to obtain stable estimates…
Descriptors: Blacks, Difficulty Level, Higher Education, Item Analysis
Thompson, Bruce; Borrello, Gloria M. – 1987
Attitude measures frequently produce distributions of item scores that attenuate interitem correlations and thus also distort findings regarding the factor structure underlying the items. An actual data set involving 260 adult subjects' responses to 55 items on the Love Relationships Scale is employed to illustrate empirical methods for…
Descriptors: Adults, Analysis of Covariance, Attitude Measures, Correlation
Jannarone, Robert J. – 1986
A variety of locally dependent models are introduced having individual difference parameters that may be interpreted as reflecting effective learning abilities. One version is a univariate extension of the Rasch model with a Markov property: the probability that a given individual will pass an item depends on previous items only through the…
Descriptors: Academic Aptitude, Bayesian Statistics, Cognitive Ability, Estimation (Mathematics)
Warren, Thomas S. – 1985
Although informal reading inventories are widely used, they are not without their shortcomings, regardless of whether they have been published commercially or have been constructed by the teacher. There are at least two significant weaknesses in inventories developed by the teacher: (1) passages selected randomly from the graded basal readers that…
Descriptors: Elementary Secondary Education, Informal Assessment, Informal Reading Inventories, Readability
Livingston, Samuel A. – 1986
This paper deals with test fairness regarding a test consisting of two parts: (1) a "common" section, taken by all students; and (2) a "variable" section, in which some students may answer a different set of questions from other students. For example, a test taken by several thousand students each year contains a common multiple-choice portion and…
Descriptors: Difficulty Level, Error of Measurement, Essay Tests, Mathematical Models
Yarnold, Paul R.; And Others – 1985
This paper reports on a short version of the Student Jenkins Activity Survey (JAS), a multiple choice questionnaire that measures Type A "coronary-prone" behavior in assessing subjects' A/B types. The primary objective was to determine if the short and long forms of the student JAS represent similar measurement instruments. A secondary…
Descriptors: Behavior Rating Scales, College Students, Comparative Testing, Factor Analysis