Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 5 |
Descriptor
Difficulty Level | 5 |
Error of Measurement | 5 |
Test Items | 5 |
Mathematics Tests | 3 |
Goodness of Fit | 2 |
Grade 7 | 2 |
Item Response Theory | 2 |
Multiple Choice Tests | 2 |
Reading Comprehension | 2 |
Reading Tests | 2 |
Sampling | 2 |
More ▼ |
Source
Behavioral Research and… | 2 |
American Institutes for… | 1 |
Applied Measurement in… | 1 |
International Journal of… | 1 |
Author
Alonzo, Julie | 2 |
Park, Bitnara Jasmine | 2 |
Tindal, Gerald | 2 |
Cetin, Sevda | 1 |
DeStefano, Lizanne | 1 |
Haertel, Edward H. | 1 |
Irvin, P. Shawn | 1 |
Johnson, Jeremiah | 1 |
Kara, Hakan | 1 |
Lai, Cheng-Fei | 1 |
Michaelides, Michalis P. | 1 |
More ▼ |
Publication Type
Reports - Research | 3 |
Journal Articles | 2 |
Numerical/Quantitative Data | 2 |
Reports - Evaluative | 2 |
Education Level
Middle Schools | 5 |
Junior High Schools | 3 |
Secondary Education | 3 |
Grade 7 | 2 |
Grade 8 | 2 |
Elementary Education | 1 |
Grade 4 | 1 |
Audience
Location
New Jersey | 1 |
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 1 |
What Works Clearinghouse Rating
Kara, Hakan; Cetin, Sevda – International Journal of Assessment Tools in Education, 2020
In this study, the efficiency of various random sampling methods to reduce the number of items rated by judges in an Angoff standard-setting study was examined and the methods were compared with each other. Firstly, the full-length test was formed by combining Placement Test 2012 and 2013 mathematics subsets. After then, simple random sampling…
Descriptors: Cutting Scores, Standard Setting (Scoring), Sampling, Error of Measurement
Michaelides, Michalis P.; Haertel, Edward H. – Applied Measurement in Education, 2014
The standard error of equating quantifies the variability in the estimation of an equating function. Because common items for deriving equated scores are treated as fixed, the only source of variability typically considered arises from the estimation of common-item parameters from responses of samples of examinees. Use of alternative, equally…
Descriptors: Equated Scores, Test Items, Sampling, Statistical Inference
DeStefano, Lizanne; Johnson, Jeremiah – American Institutes for Research, 2013
This paper describes one of the first efforts by the National Assessment of Educational Progress (NAEP) to improve measurement at the lower end of the distribution, including measurement for students with disabilities (SD) and English language learners (ELLs). One way to improve measurement at the lower end is to introduce one or more…
Descriptors: National Competency Tests, Measures (Individuals), Disabilities, English Language Learners
Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Park, Bitnara Jasmine; Tindal, Gerald – Behavioral Research and Teaching, 2012
In this technical report, we present the results of a reliability study of the seventh-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Descriptors: Reading Comprehension, Testing Programs, Statistical Analysis, Grade 7
Park, Bitnara Jasmine; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2011
This technical report describes the process of development and piloting of reading comprehension measures that are appropriate for seventh-grade students as part of an online progress screening and monitoring assessment system, http://easycbm.com. Each measure consists of an original fictional story of approximately 1,600 to 1,900 words with 20…
Descriptors: Reading Comprehension, Reading Tests, Grade 7, Test Construction