ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	14

Descriptor

Generalizability Theory	15
Grade 3	12
Elementary School Students	7
Reading Tests	7
Error of Measurement	6
Grade 4	4
Scores	4
Test Reliability	4
Curriculum Based Assessment	3
Evaluation Methods	3
Grade 5	3
Mathematics Tests	3
Reading Fluency	3
Reliability	3
Standardized Tests	3
Statistical Analysis	3
Writing (Composition)	3
Writing Evaluation	3
Achievement Tests	2
Childrens Writing	2
Cutting Scores	2
Educational Policy	2
Item Response Theory	2
Language Arts	2
Oral Reading	2
More ▼

Source

Journal of Psychoeducational…	2
Assessment for Effective…	1
Behavioral Research and…	1
Canadian Journal of…	1
ETS Research Report Series	1
Educational and Psychological…	1
Grantee Submission	1
International Journal of…	1
Journal of Educational and…	1
Learning Disability Quarterly	1
National Education Policy…	1
ProQuest LLC	1
Reading and Writing: An…	1
Society for Research on…	1
More ▼

Publication Type

Journal Articles	11
Reports - Research	8
Reports - Evaluative	6
Dissertations/Theses -…	1
Numerical/Quantitative Data	1
Opinion Papers	1

Education Level

Grade 3	15
Elementary Education	10
Early Childhood Education	6
Grade 5	6
Primary Education	6
Grade 4	5
Intermediate Grades	4
Elementary Secondary Education	3
Middle Schools	3
Grade 6	2
Grade 7	2
Grade 8	2
Grade 10	1
Grade 2	1
Grade 9	1
High Schools	1
Junior High Schools	1
Secondary Education	1
More ▼

Audience

Location

California	1
Canada	1
District of Columbia	1
Florida	1
Iowa	1
New York	1

Laws, Policies, & Programs

Individuals with Disabilities…

Assessments and Surveys

Cognitive Abilities Test	1
Dynamic Indicators of Basic…	1
Florida Comprehensive…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

(In)Stability of Test Scores

Peer reviewed
PDF on ERIC

Download full text

Merchant, Stefan; Rich, Jessica; Klinger, Don A. – Canadian Journal of Educational Administration and Policy, 2022

Both school and district administrators use the results of standardized, large-scale tests to inform decisions about the need for, or success of, educational programs and interventions. However, test results at the school level are subject to random fluctuations due to changes in cohort, test items, and other factors outside of the school's…

Descriptors: Standardized Tests, Foreign Countries, Generalizability Theory, Scores

Writing Evaluation: Rater and Task Effects on the Reliability of Writing Scores for Children in Grades 3 and 4

Peer reviewed

Direct link

Kim, Young-Suk Grace; Schatschneider, Christopher; Wanzek, Jeanne; Gatlin, Brandy; Al Otaiba, Stephanie – Reading and Writing: An Interdisciplinary Journal, 2017

We examined how raters and tasks influence measurement error in writing evaluation and how many raters and tasks are needed to reach a desirable level of 0.90 and 0.80 reliabilities for children in Grades 3 and 4. A total of 211 children (102 boys) were administered three tasks in narrative and expository genres, respectively, and their written…

Descriptors: Writing Evaluation, Elementary School Students, Grade 3, Grade 4

Writing Evaluation: Rater and Task Effects on the Reliability of Writing Scores for Children in Grades 3 and 4

Peer reviewed
PDF on ERIC

Download full text

Direct link

Kim, Young-Suk Grace; Schatschneider, Christopher; Wanzek, Jeanne; Gatlin, Brandy; Al Otaiba, Stephanie – Grantee Submission, 2017

Descriptors: Writing Evaluation, Elementary School Students, Grade 3, Grade 4

An Information-Correction Method for Testlet-Based Test Analysis: From the Perspectives of Item Response Theory and Generalizability Theory. Research Report. ETS RR-17-27

Peer reviewed
PDF on ERIC

Download full text

Li, Feifei – ETS Research Report Series, 2017

An information-correction method for testlet-based tests is introduced. This method takes advantage of both generalizability theory (GT) and item response theory (IRT). The measurement error for the examinee proficiency parameter is often underestimated when a unidimensional conditional-independence IRT model is specified for a testlet dataset. By…

Descriptors: Item Response Theory, Generalizability Theory, Tests, Error of Measurement

Assessing the Writing Achievement of Young Struggling Writers: Application of Generalizability Theory

Peer reviewed

Direct link

Graham, Steve; Hebert, Michael; Paige Sandbank, Michael; Harris, Karen R. – Learning Disability Quarterly, 2016

This study examined the number of writing samples needed to obtain a reliable estimate of young struggling writers' capabilities. It further assessed if performance in one genre was reflective of performance in other genres for these children. Second- and third-grade students (81 boys, 56 girls), who were identified as struggling writers in need…

Descriptors: Writing Achievement, Writing Difficulties, Writing (Composition), Norm Referenced Tests

An Examination of Test-Retest, Alternate Form Reliability, and Generalizability Theory Study of the easyCBM Word and Passage Reading Fluency Assessments: Grade 3. Technical Report #1218

Download full text

Park, Bitnara Jasmine; Anderson, Daniel; Alonzo, Julie; Lai, Cheng-Fei; Tindal, Gerald – Behavioral Research and Teaching, 2012

This technical report is one in a series of five describing the reliability (test/retest and alternate form) and G-Theory/D-Study research on the easyCBM reading measures, grades 1-5. Data were gathered in the spring of 2011 from a convenience sample of students nested within classrooms at a medium-sized school district in the Pacific Northwest.…

Descriptors: Test Reliability, Generalizability Theory, Curriculum Based Assessment, Reading Tests

Assessing the Generalizability of Estimates of Causal Effects from Regression Discontinuity Designs

Download full text

Bloom, Howard S.; Porter, Kristin E. – Society for Research on Educational Effectiveness, 2012

In recent years, the regression discontinuity design (RDD) has gained widespread recognition as a quasi-experimental method that when used correctly, can produce internally valid estimates of causal effects of a treatment, a program or an intervention (hereafter referred to as treatment effects). In an RDD study, subjects or groups of subjects…

Descriptors: Regression (Statistics), Research Design, Computation, Generalizability Theory

Review of "The Effects of Test-Based Retention on Student Outcomes over Time: Regression Discontinuity Evidence from Florida"

Peer reviewed
PDF on ERIC

Download full text

Robinson-Cimpian, Joseph P. – National Education Policy Center, 2015

A recent NBER [National Bureau of Economic Research] working paper examines Florida's policy to retain many low-scoring third graders. The report concludes that third-grade retention has immediate positive effects on the following year's test results, but these effects fade over the next six years, with no effect on graduation. The regression…

Descriptors: Achievement Tests, Standardized Tests, State Standards, Regression (Statistics)

Applying Rasch Model and Generalizability Theory to Study Modified-Angoff Cut Scores

Peer reviewed

Direct link

Arce, Alvaro J.; Wang, Ze – International Journal of Testing, 2012

The traditional approach to scale modified-Angoff cut scores transfers the raw cuts to an existing raw-to-scale score conversion table. Under the traditional approach, cut scores and conversion table raw scores are not only seen as interchangeable but also as originating from a common scaling process. In this article, we propose an alternative…

Descriptors: Generalizability Theory, Item Response Theory, Cutting Scores, Scaling

Generalizability Theory Analysis of CBM Maze Reliability in Third- through Fifth-Grade Students

Peer reviewed

Direct link

Mercer, Sterett H.; Dufrene, Brad A.; Zoder-Martell, Kimberly; Harpole, Lauren Lestremau; Mitchell, Rachel R.; Blaze, John T. – Assessment for Effective Intervention, 2012

Despite growing use of CBM Maze in universal screening and research, little information is available regarding the number of CBM Maze probes needed for reliable decisions. The current study extends existing research on the technical adequacy of CBM Maze by investigating the number of probes and assessment durations (1-3 min) needed for reliable…

Descriptors: Generalizability Theory, Curriculum Based Assessment, Reading Tests, Cloze Procedure

Passage Equivalency and Predictive Validity of Oral Reading Fluency Measures

Direct link

Checca, Christopher Jason – ProQuest LLC, 2012

The use of oral reading fluency (ORF) passages within a Response to Intervention (RTI) framework is examined. Significant limitations within the current ORF research are discussed. The passage equivalency and readability scores for DIBELS Next, AIMSweb, and a school district's curriculum's ORF passages are evaluated using Generalizability Theory…

Descriptors: Reading Fluency, Oral Reading, Reading Tests, Predictive Validity

Measuring Test Measurement Error: A General Approach

Peer reviewed

Direct link

Boyd, Donald; Lankford, Hamilton; Loeb, Susanna; Wyckoff, James – Journal of Educational and Behavioral Statistics, 2013

Test-based accountability as well as value-added asessments and much experimental and quasi-experimental research in education rely on achievement tests to measure student skills and knowledge. Yet, we know little regarding fundamental properties of these tests, an important example being the extent of measurement error and its implications for…

Descriptors: Accountability, Educational Research, Educational Testing, Error of Measurement

Multigroup Generalizability Analysis of Verbal, Quantitative, and Nonverbal Ability Tests for Culturally and Linguistically Diverse Students

Peer reviewed

Direct link

Lakin, Joni M.; Lai, Emily R. – Educational and Psychological Measurement, 2012

For educators seeking to differentiate instruction, cognitive ability tests sampling multiple content domains, including verbal, quantitative, and nonverbal reasoning, provide superior information about student strengths and weaknesses compared with unidimensional reasoning measures. However, these ability tests have not been fully evaluated with…

Descriptors: Aptitude Tests, Nonverbal Ability, Cognitive Ability, Verbal Ability

Rating Performance Assessments of Students with Disabilities: A Study of Reliability and Bias

Peer reviewed

Direct link

Mastergeorge, Ann M.; Martinez, Jose Felipe – Journal of Psychoeducational Assessment, 2010

Inclusion of students with disabilities in district-wide and state assessments is mandated by federal regulations, and teachers sometimes play an important role in rating these students' work. In this study, trained teachers rated student proficiency in performance assessments in language arts and mathematics in third, fifth, and ninth grades. The…

Descriptors: Play, Inclusion, Disabilities, Program Effectiveness

An Investigation of the Reliability and Standard Error of Measurement of Words Read Correctly Per Minute Using Curriculum-Based Measurement

Peer reviewed

Direct link

Poncy, Brian C.; Skinner, Christopher H.; Axtell, Philip K. – Journal of Psychoeducational Assessment, 2005

Generalizability (G) theory was used with a sample of 37 third-grade students to assess the variability in words correct per minute (WCPM) scores caused by student skill and passage variability. Reliability-like coefficients and the SEM based on a specific number of assessments using different combinations of passages demonstrated how manipulating…

Descriptors: Generalizability Theory, Curriculum Based Assessment, Error of Measurement, Reliability

Al Otaiba, Stephanie	2
Gatlin, Brandy	2
Kim, Young-Suk Grace	2
Schatschneider, Christopher	2
Wanzek, Jeanne	2
Alonzo, Julie	1
Anderson, Daniel	1
Arce, Alvaro J.	1
Axtell, Philip K.	1
Blaze, John T.	1
Bloom, Howard S.	1
Boyd, Donald	1
Checca, Christopher Jason	1
Dufrene, Brad A.	1
Graham, Steve	1
Harpole, Lauren Lestremau	1
Harris, Karen R.	1
Hebert, Michael	1
Klinger, Don A.	1
Lai, Cheng-Fei	1
Lai, Emily R.	1
Lakin, Joni M.	1
Lankford, Hamilton	1
Li, Feifei	1
Loeb, Susanna	1
More ▼