Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 4 |
Descriptor
Sampling | 7 |
Scoring | 7 |
Statistical Analysis | 7 |
Scores | 4 |
Probability | 3 |
Testing | 3 |
Computation | 2 |
Computer Programs | 2 |
Educational Research | 2 |
Evaluation Methods | 2 |
Generalization | 2 |
More ▼ |
Source
Applied Measurement in… | 1 |
International Journal of… | 1 |
Journal of Research on… | 1 |
National Center for Education… | 1 |
Author
Bayless, D. L. | 1 |
Carol Eckerly | 1 |
Cohen, Allan S., Comp. | 1 |
Deke, John | 1 |
Green, Donald P. | 1 |
Hill, Jennifer | 1 |
John R. Donoghue | 1 |
Kern, Holger L. | 1 |
Lord, Frederic M. | 1 |
Oliveri, María Elena | 1 |
Puma, Mike | 1 |
More ▼ |
Publication Type
Journal Articles | 3 |
Reports - Research | 3 |
Information Analyses | 1 |
Reference Materials -… | 1 |
Reports - Descriptive | 1 |
Reports - Evaluative | 1 |
Education Level
Elementary Secondary Education | 1 |
Audience
Researchers | 1 |
Location
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024
Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…
Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics
Kern, Holger L.; Stuart, Elizabeth A.; Hill, Jennifer; Green, Donald P. – Journal of Research on Educational Effectiveness, 2016
Randomized experiments are considered the gold standard for causal inference because they can provide unbiased estimates of treatment effects for the experimental participants. However, researchers and policymakers are often interested in using a specific experiment to inform decisions about other target populations. In education research,…
Descriptors: Educational Research, Generalization, Sampling, Participant Characteristics
Oliveri, María Elena; von Davier, Alina A. – International Journal of Testing, 2016
In this study, we propose that the unique needs and characteristics of linguistic minorities should be considered throughout the test development process. Unlike most measurement invariance investigations in the assessment of linguistic minorities, which typically are conducted after test administration, we propose strategies that focus on the…
Descriptors: Psychometrics, Linguistics, Test Construction, Testing
Schochet, Peter Z.; Puma, Mike; Deke, John – National Center for Education Evaluation and Regional Assistance, 2014
This report summarizes the complex research literature on quantitative methods for assessing how impacts of educational interventions on instructional practices and student learning differ across students, educators, and schools. It also provides technical guidance about the use and interpretation of these methods. The research topics addressed…
Descriptors: Statistical Analysis, Evaluation Methods, Educational Research, Intervention
Bayless, D. L.; And Others – 1974
Raw scores on most standardized educational and psychological assessment instruments acquire meaning only when referenced to a set of norms. Test publishers should clearly describe their norming procedures, including the target population and the sample on which the norms are based. The primary purpose of this report is to illustrate some of the…
Descriptors: Career Education, Comparative Testing, Measurement Techniques, National Norms
Lord, Frederic M. – 1971
Some stochastic approximation procedures are considered in relation to the problem of choosing a sequence of test questions to accurately estimate a given examinee's standing on a psychological dimension. Illustrations are given evaluating certain procedures in a specific context. (Author/CK)
Descriptors: Academic Ability, Adaptive Testing, Computer Programs, Difficulty Level
Cohen, Allan S., Comp. – 1979
This partially annotated bibliography of journal articles, dissertations, convention papers, research reports, and a few books and unpublished manuscripts provides a comprehensive coverage of work on latent trait theory and practice. Documents are arranged alphabetically by author. The period covered ranges from the early 1950's to the present.…
Descriptors: Attitude Measures, Career Development, Computer Assisted Testing, Computer Programs