Publication Date
In 2025 | 3 |
Since 2024 | 12 |
Since 2021 (last 5 years) | 41 |
Since 2016 (last 10 years) | 126 |
Since 2006 (last 20 years) | 395 |
Descriptor
Test Theory | 1161 |
Test Items | 261 |
Test Reliability | 252 |
Test Construction | 245 |
Test Validity | 245 |
Psychometrics | 181 |
Scores | 176 |
Item Response Theory | 165 |
Foreign Countries | 159 |
Item Analysis | 141 |
Statistical Analysis | 134 |
More ▼ |
Source
Author
Publication Type
Education Level
Location
United States | 17 |
United Kingdom (England) | 15 |
Canada | 14 |
Australia | 13 |
Turkey | 12 |
Sweden | 8 |
United Kingdom | 8 |
Netherlands | 7 |
Texas | 7 |
New York | 6 |
Taiwan | 6 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 4 |
Elementary and Secondary… | 3 |
Individuals with Disabilities… | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
Wheeler, Patricia H. – 1993
A person's obtained score on a test provides an estimate of the individual's "true" score on that test. The obtained score is considered to have two parts, the true component and the error component. Classical test theory assumes that obtained scores for an individual over multiple administrations of the same test will lie symmetrically…
Descriptors: Cutting Scores, Error of Measurement, Scores, Statistical Distributions
Kenney, Patricia Ann – 1995
The purpose of this investigation was to develop a general framework for qualitatively analyzing the 1992 National Assessment of Educational Progress (NAEP) extended constructed-response questions. The framework dimensions were based on information about the NAEP extended questions and linked to important ideas in mathematics education and…
Descriptors: Constructed Response, Elementary School Students, Grade 4, Intermediate Grades
Vos, Hans J. – 1994
Some applications of Bayesian decision theory to intelligent tutoring systems are considered. How the problem of adapting the appropriate amount of instruction to the changing nature of a student's capabilities during the learning process can be situated in the general framework of Bayesian decision theory is discussed in the context of the…
Descriptors: Bayesian Statistics, Decision Making, Foreign Countries, Intelligent Tutoring Systems

Runco, Mark A.; And Others – 1987
This study examined four measures of creativity as predictors of mathematics and science performance in a program for talented high school students (N=29). Correlational analyses indicated that the How Do You Think Test (HDYT) and ratings on the Teachers' Evaluation of Students' Creativity (TESC) were predictive of the students' performance in the…
Descriptors: Creativity Tests, Educational Research, High Schools, Prediction
Mislevy, Robert J. – 1988
Large-scale educational assessments differ from familiar educational measurements by attempting to provide information about the levels and natures of skills in populations rather than in individuals. That the distinct purposes of assessment require different methodologies than individual measurement was recognized by the development of…
Descriptors: Educational Assessment, Evaluation Methods, Item Analysis, Latent Trait Theory

Lindstrom, Berner – 1983
The aim of this paper is to explore the Rasch model as a criterion of test homogeneity. Two empirical studies are presented to demonstrate this usage. From these studies it is argued that statistical tests of item characteristic curve (ICC) slopes are not sufficient in testing for heterogeneity. Tests of equality of ICC's over groups of subject…
Descriptors: Elementary Secondary Education, Latent Trait Theory, Mathematical Models, Multidimensional Scaling
Angoff, William H. – 1985
This paper points out that there are certain generalizations about directions for guessing and methods of scoring that require that data be derived from random groups design. It supports the viewpoint that it is neither sufficient nor appropriate to make such generalizations on the basis of an analysis of scores obtained from the answer sheets of…
Descriptors: Correlation, Guessing (Tests), Research Design, Scoring Formulas
Jones, Douglas H. – 1982
New ability estimators have been proposed by Wainer and Wright (1980) and Mislevy and Bock (1981) that are resistant against guessing and careless behaviors exhibited by some examinees. This paper presents another class of ability estimators that are resistant to departures from the underlying assumptions concerning guessing and carelessness. In…
Descriptors: Estimation (Mathematics), Guessing (Tests), Latent Trait Theory, Mathematical Models
Tatsuoka, Kikumi K.; Tatsuoka, Maurice M. – 1982
Several extended caution indices (ECIs) have been introduced earlier as a link between two distinctly different approaches: one based on standard statistics and the other, a model-based approach, utilizing item response theory (IRT). Expected values and variance of some ECIs are derived and their statistical properties are compared and discussed.…
Descriptors: Error Patterns, Higher Education, Latent Trait Theory, Models
Moy, Raymond – 1982
Score equating requires that the forms to be equated are functionally parallel. That is, the two test forms should rank order examinees in a similar fashion. In language proficiency testing situations, this assumption is often put into doubt because of the numerous tests that have been proposed as measures of language proficiency and the…
Descriptors: Equated Scores, Language Proficiency, Language Tests, Latent Trait Theory
Rogers, Bruce G. – 1984
The purpose of this study was to determine how cognitive style instructions to respondents of a survey instrument affected the resulting psychometric properties of the scale. It was hypothesized that a group which is instructed to carefully respond will have a mean score further from the neutral point, a larger standard deviation, and a higher…
Descriptors: Affective Measures, Educational Research, Higher Education, Psychometrics
Johanningmeier, E. V. – 1982
The career of Stuart Appleton Courtis in the growth of testing and educational measurement parallels the development of progressive education in the first half of the twentieth century. In 1909 he developed the standardized Courtis Arithmetic Test, Series A, the first objective test used in any city public schools. Continuing his work in testing,…
Descriptors: Biographies, Educational Researchers, Educational Testing, Elementary Secondary Education

Huynh, Huynh; Mandeville, Garrett K. – 1979
Assuming that the density p of the true ability theta in the binomial test score model is continuous in the closed interval (0, 1), a Bernstein polynomial can be used to uniformly approximate p. Then via quadratic programming techniques, least-square estimates may be obtained for the coefficients defining the polynomial. The approximation, in turn…
Descriptors: Cutting Scores, Error of Measurement, Least Squares Statistics, Mastery Tests

Rowley, Glenn – Journal of Educational Measurement, 1978
The reliabilities of various observational measures were determined, and the influence of both the number and the length of the observation periods on reliability was examined, both separately and jointly. A single simplifying assumption leads to a variant of the Spearman-Brown formula, which may have wider application. (Author/CTM)
Descriptors: Career Development, Classroom Observation Techniques, Observation, Reliability

Weiss, David J., Ed. – Applied Psychological Measurement, 1987
Issues concerning equating test scores are discussed in an introduction, four papers, and two commentaries. Equating methods research, sampling errors, linear equating, population differences, sources of equating errors, and a circular equating paradigm are considered. (SLD)
Descriptors: Equated Scores, Latent Trait Theory, Maximum Likelihood Statistics, Statistical Analysis