Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 9 |
Descriptor
Educational Testing | 27 |
Statistical Analysis | 27 |
Test Reliability | 27 |
Test Construction | 13 |
Academic Achievement | 8 |
Multiple Choice Tests | 8 |
Scores | 6 |
Test Interpretation | 6 |
Test Validity | 6 |
Achievement Tests | 5 |
Item Analysis | 5 |
More ▼ |
Source
Behavioral Research and… | 4 |
Regional Educational… | 2 |
Alberta Journal of… | 1 |
Cogent Education | 1 |
International Journal of… | 1 |
ProQuest LLC | 1 |
Author
Alonzo, Julie | 4 |
Irvin, P. Shawn | 4 |
Lai, Cheng-Fei | 4 |
Park, Bitnara Jasmine | 4 |
Tindal, Gerald | 4 |
Booker, Kevin | 2 |
Bruch, Julie | 2 |
Gill, Brian | 2 |
ANDRADE, MANUEL | 1 |
Arhin, Ato Kwamina | 1 |
Ekstrom, Ruth B. | 1 |
More ▼ |
Publication Type
Education Level
Elementary Education | 6 |
Elementary Secondary Education | 6 |
High Schools | 2 |
Middle Schools | 2 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Grade 6 | 1 |
Higher Education | 1 |
Audience
Researchers | 1 |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Assessments and Surveys
ACT Assessment | 2 |
Dynamic Indicators of Basic… | 2 |
Iowa Tests of Basic Skills | 2 |
Preliminary Scholastic… | 2 |
Stanford Achievement Tests | 2 |
What Works Clearinghouse Rating
Metsämuuronen, Jari – International Journal of Educational Methodology, 2020
Pearson product-moment correlation coefficient between item g and test score X, known as item-test or item-total correlation ("Rit"), and item-rest correlation ("Rir") are two of the most used classical estimators for item discrimination power (IDP). Both "Rit" and "Rir" underestimate IDP caused by the…
Descriptors: Correlation, Test Items, Scores, Difficulty Level
Quaigrain, Kennedy; Arhin, Ato Kwamina – Cogent Education, 2017
Item analysis is essential in improving items which will be used again in later tests; it can also be used to eliminate misleading items in a test. The study focused on item and test quality and explored the relationship between difficulty index (p-value) and discrimination index (DI) with distractor efficiency (DE). The study was conducted among…
Descriptors: Item Analysis, Teacher Developed Materials, Test Reliability, Educational Assessment
Lai, Cheng-Fei; Irvin, P. Shawn; Park, Bitnara Jasmine; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2012
In this technical report, we present the results of a reliability study of the third-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Descriptors: Grade 3, Curriculum Based Assessment, Educational Testing, Testing Programs
Park, Bitnara Jasmine; Irvin, P. Shawn; Lai, Cheng-Fei; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2012
In this technical report, we present the results of a reliability study of the fifth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Descriptors: Grade 5, Curriculum Based Assessment, Educational Testing, Testing Programs
Park, Bitnara Jasmine; Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Tindal, Gerald – Behavioral Research and Teaching, 2012
In this technical report, we present the results of a reliability study of the fourth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Descriptors: Grade 4, Curriculum Based Assessment, Educational Testing, Testing Programs
Irvin, P. Shawn; Alonzo, Julie; Park, Bitnara Jasmine; Lai, Cheng-Fei; Tindal, Gerald – Behavioral Research and Teaching, 2012
In this technical report, we present the results of a reliability study of the sixth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Descriptors: Grade 6, Grade 3, Curriculum Based Assessment, Educational Testing
Gill, Brian; Bruch, Julie; Booker, Kevin – Regional Educational Laboratory Mid-Atlantic, 2013
States are increasingly interested in including measures of student achievement growth, or "value-
added," in evaluating teachers. Annual state assessments, however, which are the typical measure of student
growth, usually cover only reading and math teachers and only in grades 4-8. These state assessments thus cannot
…
Descriptors: Teacher Evaluation, Teacher Competencies, Evaluation Methods, Educational Testing
Gill, Brian; Bruch, Julie; Booker, Kevin – Regional Educational Laboratory Mid-Atlantic, 2013
States and school districts are exploring alternatives to state tests for measuring teachers' contributions to student learning. One approach applies statistical value-added methods to alternative student assessments such as commercially available tests and end-of course tests. The evidence suggests that these methods can reliably distinguish…
Descriptors: Teacher Evaluation, Teacher Competencies, Evaluation Methods, Educational Testing
Joseph, Dane Christian – ProQuest LLC, 2010
Multiple-choice item-writing guideline research is in its infancy. Haladyna (2004) calls for a science of item-writing guideline research. The purpose of this study is to respond to such a call. The purpose of this study was to examine the impact of student ability and method for varying the location of correct answers in classroom multiple-choice…
Descriptors: Evidence, Test Format, Guessing (Tests), Program Effectiveness
Harris, Chester W. – 1972
The efficiency of mastery tests of fixed length which sorts students into two categories is analyzed. For the sort of the students, an index, suggested by Fisher's linear discriminant function for two groups, is provided. (DB)
Descriptors: Educational Testing, Models, Statistical Analysis, Student Distribution
Fortna, Richard O. – 1981
Measurement terms used in Title I evaluation are contained in this glossary. Several types of measurement techniques are identified and defined. Other measurement terms which are defined include those relating to validity, reliability, statistical analysis, test interpretation, and program effectiveness. (DWH)
Descriptors: Educational Testing, Evaluation Methods, Glossaries, Program Evaluation

Klinger, Don A.; Rogers, W. Todd – Alberta Journal of Educational Research, 2003
The estimation accuracy of procedures based on classical test score theory and item response theory (generalized partial credit model) were compared for examinations consisting of multiple-choice and extended-response items. Analysis of British Columbia Scholarship Examination results found an error rate of about 10 percent for both methods, with…
Descriptors: Academic Achievement, Educational Testing, Foreign Countries, High Stakes Tests
Houser, Ronald L.; And Others – 1983
This report describes a procedure that promises to improve the stability, accuracy, and efficiency of the employment of latent trait models and an application of the procedure to the Rasch model. Data were collected from the Portland Public Schools Level Tests administered to 25,740 students. Since each of the 173 items (chosen from the total…
Descriptors: Academic Achievement, Educational Testing, Item Banks, Latent Trait Theory
Michigan State Dept. of Education, Lansing. – 1971
This report describes the development of the 1969-70 Michigan Educational Assessment measures used in assessing the levels and distribution of educational performance for Michigan's districts, schools, and pupils. The report has four sections. The first section contains a brief description of the 1969-70 assessment program, including a statement…
Descriptors: Achievement Tests, Attitude Measures, Educational Testing, Measurement Instruments

Gilmer, Jerry S.; Feldt, Leonard S. – 1982
The Feldt-Gilmer congeneric reliability coefficients make it possible to estimate the reliability of a test composed of parts of unequal, unknown length. The approximate standard errors of the Feldt-Gilmer coefficients are derived via a method using the multivariate Taylor's expansion. Monte Carlo simulation is employed to corroborate the…
Descriptors: Educational Testing, Error of Measurement, Mathematical Formulas, Mathematical Models
Previous Page | Next Page »
Pages: 1 | 2