Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 12 |
Descriptor
Measurement Techniques | 30 |
Test Items | 30 |
Test Theory | 30 |
Psychometrics | 12 |
Difficulty Level | 9 |
Mathematical Models | 9 |
Item Analysis | 8 |
Latent Trait Theory | 8 |
Test Construction | 8 |
Models | 7 |
Test Validity | 6 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 3 |
Practitioners | 1 |
Location
Turkey | 1 |
Uganda | 1 |
United Kingdom | 1 |
United States | 1 |
Laws, Policies, & Programs
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
SAT (College Admission Test) | 2 |
Students Evaluation of… | 1 |
What Works Clearinghouse Rating
Chakrabartty, Satyendra Nath – International Journal of Psychology and Educational Studies, 2021
The paper proposes new measures of difficulty and discriminating values of binary items and test consisting of such items and find their relationships including estimation of test error variance and thereby the test reliability, as per definition using cosine similarities. The measures use entire data. Difficulty value of test and item is defined…
Descriptors: Test Items, Difficulty Level, Scores, Test Reliability
Kaya Uyanik, Gulden; Demirtas Tolaman, Tugba; Gur Erdogan, Duygu – International Journal of Assessment Tools in Education, 2021
This paper aims to examine and assess the questions included in the "Turkish Common Exam" for sixth graders held in the first semester of 2018 which is one of the common exams carried out by The Measurement and Evaluation Centers, in terms of question structure, quality and taxonomic value. To this end, the test questions were examined…
Descriptors: Foreign Countries, Grade 6, Standardized Tests, Test Items
Ibrahim Kasujja; Hugo Melgar-Quinonez; Joweria Nambooze – SAGE Open, 2023
Background: School feeding programs' evaluation requires the measurement of food insecurity, a more objective indicator, within school in low-income countries. The Global Child Nutrition Foundation (GCNF) uses subjective indicators to report school feeding coverage rates across many countries that participate in the global survey of school meal…
Descriptors: Hunger, Food, Program Effectiveness, Psychometrics
Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014
A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…
Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing
Engelhard, George, Jr.; Wind, Stefanie A. – College Board, 2013
The major purpose of this study is to examine the quality of ratings assigned to CR (constructed-response) questions in large-scale assessments from the perspective of Rasch Measurement Theory. Rasch Measurement Theory provides a framework for the examination of rating scale category structure that can yield useful information for interpreting the…
Descriptors: Measurement Techniques, Rating Scales, Test Theory, Scores
Almehrizi, Rashid S. – Applied Psychological Measurement, 2013
The majority of large-scale assessments develop various score scales that are either linear or nonlinear transformations of raw scores for better interpretations and uses of assessment results. The current formula for coefficient alpha (a; the commonly used reliability coefficient) only provides internal consistency reliability estimates of raw…
Descriptors: Raw Scores, Scaling, Reliability, Computation
van der Linden, Wim J. – Journal of Educational Measurement, 2009
Two different traditions of response-time (RT) modeling are reviewed: the tradition of distinct models for RTs and responses, and the tradition of model integration in which RTs are incorporated in response models or the other way around. Several conceptual issues underlying both traditions are made explicit and analyzed for their consequences. We…
Descriptors: Test Items, Models, Reaction Time, Measurement
von Davier, Matthias – Measurement: Interdisciplinary Research and Perspectives, 2009
In this commentary, the author points out few issues, one being that there are models mislabeled as diagnostic, which deal with linear decompositions of item difficulties rather than estimating multidimensional skill variables. The author discusses the issue that there are many new names for essentially well-known models for multiple simultaneous…
Descriptors: Test Items, Probability, Models, Diagnostic Tests
Hancock, Gregory R. – Measurement: Interdisciplinary Research and Perspectives, 2009
As Rupp and Templin (2008) stated directly, diagnostic classification methods "are confirmatory in nature." Methods, though, are neither inherently confirmatory nor exploratory. Diagnostic classification modeling, with its analytical and computational obstacles eventually yielding as a comprehensive and potent discipline emerges, will…
Descriptors: Structural Equation Models, Test Items, Models, Diagnostic Tests
Have Cognitive Diagnostic Models Delivered Their Goods? Some Substantial and Methodological Concerns
Wilhelm, Oliver; Robitzsch, Alexander – Measurement: Interdisciplinary Research and Perspectives, 2009
The paper by Rupp and Templin (2008) is an excellent work on the characteristics and features of cognitive diagnostic models (CDM). In this article, the authors comment on some substantial and methodological aspects of this focus paper. They organize their comments by going through issues associated with the terms "cognitive,"…
Descriptors: Research Methodology, Test Items, Models, Diagnostic Tests
Jiao, Hong – Measurement: Interdisciplinary Research and Perspectives, 2009
Diagnostic assessment is currently an active research area in educational measurement. Literature related to diagnostic modeling has been in existence for several decades, but a great deal of research has been conducted within the last decade or so, especially within the last five years. The author summarizes the key components in the application…
Descriptors: Educational Assessment, Literature Reviews, Test Items, Probability
Hwang, Dae-Yeop – 2002
This study compared classical test theory (CTT) and item response theory (IRT). The behavior of the item and person statistics derived from these two measurement frameworks was examined analytically and empirically using a data set obtained from BILOG (R. Mislay and D. Block, 1997). The example was a 15-item test with a sample size of 600…
Descriptors: Comparative Analysis, Measurement Techniques, Scores, Statistical Distributions

Engelhard, George, Jr. – Educational and Psychological Measurement, 1992
A historical perspective is provided of the concept of invariance in measurement theory, describing sample-invariant item calibration and item-invariant measurement of individuals. Invariance as a key measurement concept is illustrated through the measurement theories of E. L. Thorndike, L. L. Thurstone, and G. Rasch. (SLD)
Descriptors: Behavioral Sciences, Educational History, Measurement Techniques, Psychometrics

Secolsky, Charles – Journal of Educational Measurement, 1987
For measuring the face validity of a test, Nevo suggested that test takers and nonprofessional users rate items on a five point scale. This article questions the ability of those raters and the credibility of the aggregated judgment as evidence of the validity of the test. (JAZ)
Descriptors: Content Validity, Measurement Techniques, Rating Scales, Test Items
Linacre, John M.; Wright, Benjamin D. – 1987
The Mantel-Haenszel (MH) procedure attempts to identify and quantify differential item performance (item bias). This paper summarizes the MH statistics, and identifies the parameters they estimate. An equivalent procedure based on the Rasch model is described. The theoretical properties of the two approaches are compared and shown to require the…
Descriptors: Algorithms, Estimation (Mathematics), Item Analysis, Measurement Techniques
Previous Page | Next Page ยป
Pages: 1 | 2