Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 6 |
Descriptor
Measurement Techniques | 39 |
Test Construction | 39 |
Test Theory | 39 |
Test Validity | 14 |
Psychometrics | 13 |
Educational Assessment | 10 |
Test Interpretation | 10 |
Test Reliability | 10 |
Testing Problems | 10 |
Test Items | 8 |
Test Use | 8 |
More ▼ |
Source
Author
Mislevy, Robert J. | 2 |
Airaisian, Peter W. | 1 |
Algina, James | 1 |
Bentler, P. M. | 1 |
Boothroyd, Roger A. | 1 |
Carroll, John B. | 1 |
Cheng, Britte H. | 1 |
Cliff, Norman | 1 |
Colker, Alexis M. | 1 |
Collins, Linda M. | 1 |
Crocker, Linda | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 5 |
Higher Education | 2 |
Postsecondary Education | 1 |
Audience
Researchers | 4 |
Practitioners | 1 |
Students | 1 |
Teachers | 1 |
Laws, Policies, & Programs
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
General Educational… | 1 |
Peabody Picture Vocabulary… | 1 |
SAT (College Admission Test) | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Ibrahim Kasujja; Hugo Melgar-Quinonez; Joweria Nambooze – SAGE Open, 2023
Background: School feeding programs' evaluation requires the measurement of food insecurity, a more objective indicator, within school in low-income countries. The Global Child Nutrition Foundation (GCNF) uses subjective indicators to report school feeding coverage rates across many countries that participate in the global survey of school meal…
Descriptors: Hunger, Food, Program Effectiveness, Psychometrics
Sharkness, Jessica; DeAngelo, Linda – Research in Higher Education, 2011
This study compares the psychometric utility of Classical Test Theory (CTT) and Item Response Theory (IRT) for scale construction with data from higher education student surveys. Using 2008 Your First College Year (YFCY) survey data from the Cooperative Institutional Research Program at the Higher Education Research Institute at UCLA, two scales…
Descriptors: Student Surveys, Measures (Individuals), Psychometrics, Item Response Theory
Mislevy, Robert J.; Haertel, Geneva; Cheng, Britte H.; Ructtinger, Liliana; DeBarger, Angela; Murray, Elizabeth; Rose, David; Gravel, Jenna; Colker, Alexis M.; Rutstein, Daisy; Vendlinski, Terry – Educational Research and Evaluation, 2013
Standardizing aspects of assessments has long been recognized as a tactic to help make evaluations of examinees fair. It reduces variation in irrelevant aspects of testing procedures that could advantage some examinees and disadvantage others. However, recent attention to making assessment accessible to a more diverse population of students…
Descriptors: Testing Accommodations, Access to Education, Testing, Psychometrics
Elosua, Paula; Iliescu, Dragos – International Journal of Testing, 2012
Psychometric practice does not always converge with the advances of psychometric theory. In order to investigate this gap, the authors focus on the 10 most used psychological tests in Europe, as identified by recent surveys. The article analyzes test manuals published in 6 different European countries for these 10 most used tests. A total of 32…
Descriptors: Psychological Testing, Personality Measures, Error of Measurement, Foreign Countries
von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010
The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…
Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria

Collins, Linda M.; Cliff, Norman – Psychometrika, 1985
The axioms of a three-set Guttman simplex model are presented and the effects of relaxing the axioms for one of the three sets are examined. This model can be used to define longitudinal developmental scales. (NSF)
Descriptors: Mathematical Models, Measurement Techniques, Scaling, Test Construction

Bentler, P. M.; Woodward, Arthur J. – Psychometrika, 1980
A chain of lower bound inequalities leading to the greatest lower bound to reliability is established for the internal consistency of a composite of unit-weighted scores (such as a test). Algorithms for obtaining various reliability coefficients are presented. (Author/JKS)
Descriptors: Factor Analysis, Item Analysis, Measurement Techniques, Test Construction
Hayford, Paul D.; Salter, Ruth – 1978
Reading comprehension involves a number of distinctly different intellectual skills that can be assessed if the proper techniques are employed. As part of a reading assessment system, two measures of literal comprehension were developed: the Literal Comprehension Details Test (LCDT) and the Paraphrase Reading Test (PRT). Both the LCDT and the PRT…
Descriptors: Measurement Techniques, Reading Comprehension, Reading Tests, Test Construction

Loyd, Brenda H. – Applied Measurement in Education, 1988
The impact of item response theory (IRT) on the measurement practitioner is discussed, with a review of potential benefits. The complexity of IRT theory and procedures and the lack of robustness of IRT procedures to violation of assumptions must be recognized for the measurement practitioner to realize its advantages. (SLD)
Descriptors: Educational Researchers, Evaluation Methods, Evaluators, Latent Trait Theory

Stenner, A. Jackson; And Others – Journal of Educational Measurement, 1983
In an attempt to restore the symmetry and balance between the study of person and item variation, this paper presents a novel methodology construct specification equations, which allows one to ascertain from the lawful behavior of items what an instrument is measuring. (Author/PN)
Descriptors: Measurement Objectives, Measurement Techniques, Research Methodology, Test Construction
van den Brink, Wulfert – Evaluation in Education: International Progress, 1982
Binomial models for domain-referenced testing are compared, emphasizing the assumptions underlying the beta-binomial model. Advantages and disadvantages are discussed. A proposed item sampling model is presented which takes the effect of guessing into account. (Author/CM)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Sampling, Measurement Techniques
Shaycoft, Marion F. – 1979
Focusing on the use of "paper and pencil" criterion-referenced tests in educational measurement, and to correct misconceptions, the definitions of basic terms and historical antecedents are discussed. Classifications of the tests are compared with other achievement tests. The phases in developing criterion-referenced tests are presented with the…
Descriptors: Achievement Tests, Criterion Referenced Tests, Educational Testing, Evaluation Methods
Mislevy, Robert J. – 1994
Test theory encompasses models and methods for drawing inferences about what students know and can do, cast in a framework of ideas from measurement, education, and psychology. The emerging paradigm of cognitive psychology prompts new considerations about collecting and interpreting evidence, suggesting alternative models for the nature,…
Descriptors: Alternative Assessment, Cognitive Psychology, Educational Assessment, Inferences

Fricke, Reiner; Luhmann, Reinhold – Studies in Educational Evaluation, 1983
On the basis of the characteristics of criterion-referenced tests, the contribution of German research to the development and application of criterion-referenced tests is discussed. (PN)
Descriptors: Criterion Referenced Tests, Item Analysis, Measurement Techniques, Models

Speer, David C.; Greenbaum, Paul E. – Journal of Consulting and Clinical Psychology, 1995
Currently there are at least four pretreatment-posttreatment (pre-post) difference score methods for determining client change. A fifth model, based on a random effects model and multiwave data, represents a growth curve approach and was hypothesized to be more sensitive to detecting significant (p<.05) change than pre-post models. Compares…
Descriptors: Behavior Change, Change, Counseling, Evaluation