Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 8 |
Descriptor
Educational Testing | 4 |
Scores | 4 |
Measurement Techniques | 3 |
Testing | 3 |
Validity | 3 |
Accountability | 2 |
Construct Validity | 2 |
Definitions | 2 |
Item Response Theory | 2 |
Measurement | 2 |
Psychological Testing | 2 |
More ▼ |
Source
Measurement:… | 9 |
Author
Publication Type
Journal Articles | 9 |
Reports - Descriptive | 9 |
Opinion Papers | 2 |
Education Level
Elementary Secondary Education | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Researchers | 1 |
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Schumacker, Randall E.; Wind, Stefanie A.; Holmes, Lauren F. – Measurement: Interdisciplinary Research and Perspectives, 2021
A variety of resources are available from which researchers can identify measurement instruments, including peer-reviewed journal articles, collections of technical information about published instruments, and electronic databases that are sponsored by universities, testing organizations, and other groups. Although these resources are widespread,…
Descriptors: Measurement Techniques, Journal Articles, Databases, Testing
Raykov, Tenko; Marcoulides, George A.; Huber, Chuck – Measurement: Interdisciplinary Research and Perspectives, 2020
It is demonstrated that the popular three-parameter logistic model can lead to markedly inaccurate individual ability level estimates for mixture populations. A theoretically and empirically important setting is initially considered where (a) in one of two subpopulations (latent classes) the two-parameter logistic model holds for each item in a…
Descriptors: Item Response Theory, Models, Measurement Techniques, Item Analysis
Choi, Youn-Jeng; Asilkalkan, Abdullah – Measurement: Interdisciplinary Research and Perspectives, 2019
About 45 R packages to analyze data using item response theory (IRT) have been developed over the last decade. This article introduces these 45 R packages with their descriptions and features. It also describes possible advanced IRT models using R packages, as well as dichotomous and polytomous IRT models, and R packages that contain applications…
Descriptors: Item Response Theory, Data Analysis, Computer Software, Test Bias
Cramer, Angelique O. J. – Measurement: Interdisciplinary Research and Perspectives, 2012
What is validity? A simple question but apparently one with many answers, as Paul Newton highlights in his review of the history of validity. The current definition of validity, as entertained in the 1999 "Standards for Educational and Psychological Testing" is indeed a consensus, one between the classical notion of attributes, and measures…
Descriptors: Validity, Educational Testing, Depression (Psychology), Psychology
Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2012
The 1999 "Standards for Educational and Psychological Testing" defines validity as the degree to which evidence and theory support the interpretations of test scores entailed by proposed uses of tests. Although quite explicit, there are ways in which this definition lacks precision, consistency, and clarity. The history of validity has taught us…
Descriptors: Evidence, Validity, Educational Testing, Risk
Haig, Brian D. – Measurement: Interdisciplinary Research and Perspectives, 2012
Lee Cronbach once expressed the view that all roads lead to construct validity. In looking to clarify the consensus definition of validity, and its place in assessment, Newton is also led to the troublesome idea of construct validity. To be sure, he addresses other validity issues, but in this commentary, I will restrict my attention to construct…
Descriptors: Validity, Educational Assessment, Construct Validity, Definitions
Martineau, Joseph A.; Wyse, Adam E. – Measurement: Interdisciplinary Research and Perspectives, 2015
This article is a commentary of a paper by Derek C. Briggs and Frederick A. Peck, "Using Learning Progressions to Design Vertical Scales That Support Coherent Inferences about Student Growth," which describes an elegant potential framework for at least beginning to address three priorities in large-scale assessment that have not been…
Descriptors: Performance Factors, Barriers, Program Implementation, Group Testing
Huff, Kristen; Plake, Barbara S. – Measurement: Interdisciplinary Research and Perspectives, 2010
Standard setting is a systematic process that uses a combination of judgmental and empirical procedures to make recommendations about where on the score continuum "cut scores" should be placed. Cut scores divide the score scale into categories consistent with the descriptions of student performance associated with multiple levels of achievement.…
Descriptors: Accountability, Educational Testing, Elementary Secondary Education, Standard Setting (Scoring)
Embretson, Susan E. – Measurement: Interdisciplinary Research and Perspectives, 2004
The last century was marked by dazzling changes in many areas, such as technology and communications. Predictions into the second century of testing are seemingly difficult in such a context. Yet, looking back to the turn of the last century, Kirkpatrick (1900), in his American Psychological Association presidential address, presented fundamental…
Descriptors: Ability, Testing, Futures (of Society), Psychometrics