Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 7 |
Descriptor
Measurement Techniques | 15 |
Test Theory | 15 |
Testing | 15 |
Models | 9 |
Evaluation Methods | 7 |
Measurement | 6 |
Psychometrics | 6 |
Test Items | 6 |
Evaluation Problems | 5 |
Test Validity | 5 |
Classification | 4 |
More ▼ |
Source
Measurement:… | 4 |
Journal of Educational… | 2 |
Early Education and… | 1 |
Educational Research and… | 1 |
Freshman English News | 1 |
Research Quarterly for… | 1 |
Social Indicators Research | 1 |
Author
Publication Type
Journal Articles | 11 |
Reports - Evaluative | 5 |
Reports - Research | 5 |
Opinion Papers | 3 |
Speeches/Meeting Papers | 3 |
Collected Works - Proceedings | 1 |
Information Analyses | 1 |
Tests/Questionnaires | 1 |
Education Level
Elementary Secondary Education | 2 |
Audience
Practitioners | 1 |
Location
Brazil | 1 |
Netherlands | 1 |
Sweden | 1 |
United Kingdom (England) | 1 |
United Kingdom (Northern… | 1 |
United Kingdom (Wales) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
van der Linden, Wim J. – Journal of Educational Measurement, 2009
Two different traditions of response-time (RT) modeling are reviewed: the tradition of distinct models for RTs and responses, and the tradition of model integration in which RTs are incorporated in response models or the other way around. Several conceptual issues underlying both traditions are made explicit and analyzed for their consequences. We…
Descriptors: Test Items, Models, Reaction Time, Measurement
Mislevy, Robert J.; Haertel, Geneva; Cheng, Britte H.; Ructtinger, Liliana; DeBarger, Angela; Murray, Elizabeth; Rose, David; Gravel, Jenna; Colker, Alexis M.; Rutstein, Daisy; Vendlinski, Terry – Educational Research and Evaluation, 2013
Standardizing aspects of assessments has long been recognized as a tactic to help make evaluations of examinees fair. It reduces variation in irrelevant aspects of testing procedures that could advantage some examinees and disadvantage others. However, recent attention to making assessment accessible to a more diverse population of students…
Descriptors: Testing Accommodations, Access to Education, Testing, Psychometrics
Hubley, Anita M.; Zumbo, Bruno D. – Social Indicators Research, 2011
The vast majority of measures have, at their core, a purpose of personal and social change. If test developers and users want measures to have personal and social consequences and impact, then it is critical to consider the consequences and side effects of measurement in the validation process itself. The consequential basis of test interpretation…
Descriptors: Construct Validity, Social Change, Measurement, Test Interpretation
von Davier, Matthias – Measurement: Interdisciplinary Research and Perspectives, 2009
In this commentary, the author points out few issues, one being that there are models mislabeled as diagnostic, which deal with linear decompositions of item difficulties rather than estimating multidimensional skill variables. The author discusses the issue that there are many new names for essentially well-known models for multiple simultaneous…
Descriptors: Test Items, Probability, Models, Diagnostic Tests
Hancock, Gregory R. – Measurement: Interdisciplinary Research and Perspectives, 2009
As Rupp and Templin (2008) stated directly, diagnostic classification methods "are confirmatory in nature." Methods, though, are neither inherently confirmatory nor exploratory. Diagnostic classification modeling, with its analytical and computational obstacles eventually yielding as a comprehensive and potent discipline emerges, will…
Descriptors: Structural Equation Models, Test Items, Models, Diagnostic Tests
Have Cognitive Diagnostic Models Delivered Their Goods? Some Substantial and Methodological Concerns
Wilhelm, Oliver; Robitzsch, Alexander – Measurement: Interdisciplinary Research and Perspectives, 2009
The paper by Rupp and Templin (2008) is an excellent work on the characteristics and features of cognitive diagnostic models (CDM). In this article, the authors comment on some substantial and methodological aspects of this focus paper. They organize their comments by going through issues associated with the terms "cognitive,"…
Descriptors: Research Methodology, Test Items, Models, Diagnostic Tests
Jiao, Hong – Measurement: Interdisciplinary Research and Perspectives, 2009
Diagnostic assessment is currently an active research area in educational measurement. Literature related to diagnostic modeling has been in existence for several decades, but a great deal of research has been conducted within the last decade or so, especially within the last five years. The author summarizes the key components in the application…
Descriptors: Educational Assessment, Literature Reviews, Test Items, Probability

Spencer, Bruce D. – Journal of Educational Measurement, 1983
Because test scores are ordinal not cordinal attributes, the average test score often is a misleading way to summarize the scores of a group of individuals. Similarly, correlation coefficients may be misleading summary measures of association between test scores. Proper, readily interpretable, summary statistics are developed from a theory of…
Descriptors: Correlation, Measurement Techniques, Scores, Statistical Analysis
Houston, Robert – Freshman English News, 1981
Provides information on statistical data and jargon so that English department members can more confidently and responsibly identify the inevitable weaknesses and limitations of both tests of writing ability and the research on them. (RL)
Descriptors: Higher Education, Measurement Techniques, Standardized Tests, Test Reliability
Mislevy, Robert J. – 1994
Test theory encompasses models and methods for drawing inferences about what students know and can do, cast in a framework of ideas from measurement, education, and psychology. The emerging paradigm of cognitive psychology prompts new considerations about collecting and interpreting evidence, suggesting alternative models for the nature,…
Descriptors: Alternative Assessment, Cognitive Psychology, Educational Assessment, Inferences

Feldt, Leonard S.; Spray, Judith A. – Research Quarterly for Exercise and Sport, 1983
The reliabilities of two types of measurement plans were compared across six hypothetical distributions of true scores or abilities. The measurement plans were: (1) fixed-length, where the number of trials for all examinees is set in advance; and (2) trials-to-criterion, where examinees must keep trying until they complete a given number of trials…
Descriptors: Criterion Referenced Tests, Evaluation Methods, Higher Education, Measurement Techniques
Mislevy, Robert J. – 1994
Recent developments in cognitive and educational psychology, such as increased appreciation of the situated nature of learning and understanding, call for broader ranges of student models and types of data than those standard in testing today. We must specify how what we observe on the test is related to competence as we conceptualize it, and…
Descriptors: Evaluation Criteria, Inferences, Information Needs, Language Aptitude
Warfel, Katherine Ann – 1984
The goal of test design is to devise an instrument that will provide a stable and accurate assessment of student ability in some area. One means of reaching this goal is through the use of latent trait models, which determine the relationship between the unobservable trait or ability and the observable test performance. Three common latent trait…
Descriptors: Educational Research, Item Analysis, Latent Trait Theory, Measurement Techniques

Bigras, Marc; Dessen, Maria Auxiliadora – Early Education and Development, 2002
Tested validity of the Portuguese version of the Social Competence and Behavior Evaluation questionnaire. Found results similar to original French-Canadian instrument in stability, internal consistency, and factorial structure. Found associations between teacher's description of social competence and behavioral difficulties in Brazilian…
Descriptors: Behavior Problems, Factor Analysis, Foreign Countries, Interpersonal Competence
van Weeren, J., Ed. – 1983
Presented in this symposium reader are nine papers, four of which deal with the theory and impact of the Rasch model on language testing and five of which discuss final examinations in secondary schools in both general and specific terms. The papers are: "Introduction to Rasch Measurement: Some Implications for Language Testing" (J. J.…
Descriptors: Adolescents, Comparative Analysis, Comparative Education, Difficulty Level