Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 11 |
Descriptor
Test Items | 26 |
Test Theory | 26 |
Testing | 26 |
Latent Trait Theory | 6 |
Measurement Techniques | 6 |
Models | 6 |
Psychometrics | 6 |
Test Reliability | 6 |
Definitions | 5 |
Evaluation Methods | 5 |
Language Tests | 5 |
More ▼ |
Source
Author
Altepeter, Tom | 1 |
Angoff, William H. | 1 |
Bogan, Evelyn Doody | 1 |
Brown, James Dean | 1 |
Bruno D. Zumbo | 1 |
Cook, Linda L. | 1 |
Davidson, Fred | 1 |
Davis, John L. | 1 |
Dorans, Neil J. | 1 |
Dudley, Albert | 1 |
Hambleton, Ronald K. | 1 |
More ▼ |
Publication Type
Journal Articles | 18 |
Reports - Research | 12 |
Reports - Evaluative | 6 |
Speeches/Meeting Papers | 5 |
Opinion Papers | 4 |
Information Analyses | 2 |
Reports - Descriptive | 2 |
Reference Materials -… | 1 |
Tests/Questionnaires | 1 |
Education Level
Elementary Secondary Education | 1 |
Audience
Practitioners | 2 |
Researchers | 2 |
Teachers | 1 |
Location
Canada | 1 |
Laws, Policies, & Programs
Assessments and Surveys
ACT Assessment | 1 |
Expressive One Word Picture… | 1 |
Graduate Record Examinations | 1 |
Preliminary Scholastic… | 1 |
SAT (College Admission Test) | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Kylie Gorney; Sandip Sinharay – Educational and Psychological Measurement, 2025
Test-takers, policymakers, teachers, and institutions are increasingly demanding that testing programs provide more detailed feedback regarding test performance. As a result, there has been a growing interest in the reporting of subscores that potentially provide such detailed feedback. Haberman developed a method based on classical test theory…
Descriptors: Scores, Test Theory, Test Items, Testing
Bruno D. Zumbo – International Journal of Assessment Tools in Education, 2023
In line with the journal volume's theme, this essay considers lessons from the past and visions for the future of test validity. In the first part of the essay, a description of historical trends in test validity since the early 1900s leads to the natural question of whether the discipline has progressed in its definition and description of test…
Descriptors: Test Theory, Test Validity, True Scores, Definitions
Mitchell, Alison M.; Truckenmiller, Adrea; Petscher, Yaacov – Communique, 2015
As part of the Race to the Top initiative, the United States Department of Education made nearly 1 billion dollars available in State Educational Technology grants with the goal of ramping up school technology. One result of this effort is that states, districts, and schools across the country are using computerized assessments to measure their…
Descriptors: Computer Assisted Testing, Educational Technology, Testing, Efficiency
Dorans, Neil J. – Educational Measurement: Issues and Practice, 2012
Views on testing--its purpose and uses and how its data are analyzed--are related to one's perspective on test takers. Test takers can be viewed as learners, examinees, or contestants. I briefly discuss the perspective of test takers as learners. I maintain that much of psychometrics views test takers as examinees. I discuss test takers as a…
Descriptors: Testing, Test Theory, Item Response Theory, Test Reliability
van der Linden, Wim J. – Journal of Educational Measurement, 2009
Two different traditions of response-time (RT) modeling are reviewed: the tradition of distinct models for RTs and responses, and the tradition of model integration in which RTs are incorporated in response models or the other way around. Several conceptual issues underlying both traditions are made explicit and analyzed for their consequences. We…
Descriptors: Test Items, Models, Reaction Time, Measurement
von Davier, Matthias – Measurement: Interdisciplinary Research and Perspectives, 2009
In this commentary, the author points out few issues, one being that there are models mislabeled as diagnostic, which deal with linear decompositions of item difficulties rather than estimating multidimensional skill variables. The author discusses the issue that there are many new names for essentially well-known models for multiple simultaneous…
Descriptors: Test Items, Probability, Models, Diagnostic Tests
Vannest, Kimberly J.; Parker, Richard I.; Davis, John L.; Soares, Denise A.; Smith, Stacey L. – Behavioral Disorders, 2012
More and more, schools are considering the use of progress monitoring data for high-stakes decisions such as special education eligibility, program changes to more restrictive environments, and major changes in educational goals. Those high-stakes types of data-based decisions will need methodological defensibility. Current practice for…
Descriptors: Decision Making, Educational Change, Regression (Statistics), Field Tests
Hancock, Gregory R. – Measurement: Interdisciplinary Research and Perspectives, 2009
As Rupp and Templin (2008) stated directly, diagnostic classification methods "are confirmatory in nature." Methods, though, are neither inherently confirmatory nor exploratory. Diagnostic classification modeling, with its analytical and computational obstacles eventually yielding as a comprehensive and potent discipline emerges, will…
Descriptors: Structural Equation Models, Test Items, Models, Diagnostic Tests
Have Cognitive Diagnostic Models Delivered Their Goods? Some Substantial and Methodological Concerns
Wilhelm, Oliver; Robitzsch, Alexander – Measurement: Interdisciplinary Research and Perspectives, 2009
The paper by Rupp and Templin (2008) is an excellent work on the characteristics and features of cognitive diagnostic models (CDM). In this article, the authors comment on some substantial and methodological aspects of this focus paper. They organize their comments by going through issues associated with the terms "cognitive,"…
Descriptors: Research Methodology, Test Items, Models, Diagnostic Tests
Jiao, Hong – Measurement: Interdisciplinary Research and Perspectives, 2009
Diagnostic assessment is currently an active research area in educational measurement. Literature related to diagnostic modeling has been in existence for several decades, but a great deal of research has been conducted within the last decade or so, especially within the last five years. The author summarizes the key components in the application…
Descriptors: Educational Assessment, Literature Reviews, Test Items, Probability

Wilcox, Rand R. – Educational and Psychological Measurement, 1983
This article provides unbiased estimates of the proportion of items in an item domain that an examinee would answer correctly if every item were attempted, when a closed sequential testing procedure is used. (Author)
Descriptors: Estimation (Mathematics), Psychometrics, Scores, Sequential Approach

Woodruff, David – Journal of Educational Statistics, 1986
The purpose of the present paper is to derive linear equating methods for the common item nonequivalent populations design from explicitly stated congeneric type test score models. The equating methods developed are compared with previously developed methods and applied to five professionally constructed examinations administered to approximately…
Descriptors: Equated Scores, Equations (Mathematics), Mathematical Models, Scores
Wainer, Howard – 1982
This paper is the transcript of a talk given to those who use test information but who have little technical background in test theory. The concepts of modern test theory are compared with traditional test theory, as well as a probable future test theory. The explanations given are couched within an extended metaphor that allows a full description…
Descriptors: Difficulty Level, Latent Trait Theory, Metaphors, Test Items

Shohamy, Elana – Annual Review of Applied Linguistics, 1990
Reviews studies and tests that show how discourse analysis has contributed to the theory, research, and development of language testing, covering the relations among discourse analysis and competence and testing theory; research on language tests and tasks; and task development. A 60-citation unannotated bibliography is included. (CB)
Descriptors: Communicative Competence (Languages), Discourse Analysis, Language Research, Language Tests

Brown, James Dean – Language Testing, 1999
Explored the relative contributions to Test of English as a Foreign Language (TOEFL) score dependability of various numbers of persons, items, subtests, languages, and their various interactions. Sampled 15,000 test takers, 1000 each from 15 different language backgrounds. (Author/VWL)
Descriptors: English (Second Language), Language Tests, Second Language Learning, Student Characteristics
Previous Page | Next Page ยป
Pages: 1 | 2