ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	16
Since 2006 (last 20 years)	53

Descriptor

Evaluation Methods	83
Generalizability Theory	83
Reliability	26
Test Reliability	17
Interrater Reliability	15
Student Evaluation	14
Performance Based Assessment	13
Test Construction	12
Test Validity	12
Scores	11
Higher Education	10
Error of Measurement	9
Scoring Rubrics	9
Statistical Analysis	9
Comparative Analysis	8
Models	8
Scoring	8
Data Analysis	7
Data Collection	7
Measurement Techniques	7
Validity	7
College Students	6
Correlation	6
Foreign Countries	6
Measures (Individuals)	6
More ▼

Publication Type

Journal Articles	60
Reports - Research	48
Reports - Evaluative	21
Speeches/Meeting Papers	12
Reports - Descriptive	4
Dissertations/Theses -…	3
Opinion Papers	3
Tests/Questionnaires	2
Book/Product Reviews	1
Books	1
Collected Works - General	1
Collected Works - Serials	1
Information Analyses	1
Numerical/Quantitative Data	1
Reference Materials -…	1
Reports - General	1
More ▼

Education Level

Higher Education	15
Elementary Education	8
Postsecondary Education	7
Elementary Secondary Education	4
Early Childhood Education	3
Grade 3	3
Secondary Education	3
Adult Education	2
Grade 4	2
Grade 9	2
Intermediate Grades	2
Junior High Schools	2
Middle Schools	2
Primary Education	2
Grade 10	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
High Schools	1
Preschool Education	1
Two Year Colleges	1
More ▼

Audience

Practitioners	1
Researchers	1

Location

California	2
United Kingdom	2
Asia	1
Canada	1
Finland (Helsinki)	1
Michigan	1
Oklahoma	1
Tennessee	1
Turkey	1

Laws, Policies, & Programs

Individuals with Disabilities…

Assessments and Surveys

Behavior Assessment System…	1
Eysenck Personality Inventory	1
Myers Briggs Type Indicator	1
Teacher Rating Scale	1

What Works Clearinghouse Rating

Showing 1 to 15 of 83 results Save | Export

A System to Structure, Measure, and Improve Student Development

Peer reviewed

Direct link

Mandviwalla, Munir; Schuff, David; Miller, Laurel; Chacko, Manoj – IEEE Transactions on Learning Technologies, 2023

In this article, we develop and evaluate a novel system and computing platform to structure, measure, and improve student development using points. We define student development broadly as the achievement of learning to do, know, live together, and be. The system leverages individual agency, social influences, content generation and sharing,…

Descriptors: Student Development, Academic Achievement, Systems Approach, Design

Generalizability of Dynamic Fit Index, Equivalence Testing, and Hu & Bentler Cutoffs for Evaluating Fit in Factor Analysis

Peer reviewed

Direct link

Daniel McNeish – Grantee Submission, 2023

Factor analysis is often used to model scales created to measure latent constructs, and internal structure validity evidence is commonly assessed with indices like SRMR, RMSEA, and CFI. These indices are essentially effect size measures and definitive benchmarks regarding which values connote reasonable fit have been elusive. Simulations from the…

Descriptors: Models, Testing, Indexes, Factor Analysis

Increasing Generalizability via the Principle of Minimum Description Length

Peer reviewed
PDF on ERIC

Download full text

Direct link

Bonifay, Wes – Grantee Submission, 2022

Traditional statistical model evaluation typically relies on goodness-of-fit testing and quantifying model complexity by counting parameters. Both of these practices may result in overfitting and have thereby contributed to the generalizability crisis. The information-theoretic principle of minimum description length addresses both of these…

Descriptors: Statistical Analysis, Models, Goodness of Fit, Evaluation Methods

Comparison of the Results of the Generalizability Theory with the Inter-Rater Agreement Coefficients

Peer reviewed
PDF on ERIC

Download full text

Eser, Mehmet Taha; Aksu, Gökhan – International Journal of Curriculum and Instruction, 2022

The agreement between raters is examined within the scope of the concept of "inter-rater reliability". Although there are clear definitions of the concepts of agreement between raters and reliability between raters, there is no clear information about the conditions under which agreement and reliability level methods are appropriate to…

Descriptors: Generalizability Theory, Interrater Reliability, Evaluation Methods, Test Theory

Generalizability Theory and Its Application to Institutional Research. The AIR Professional File, Spring 2022. Article 156

Download full text

Sturgis, Paul W.; Marchand, Leslie; Miller, M. David; Xu, Wei; Castiglioni, Analia – Association for Institutional Research, 2022

This article introduces generalizability theory (G-theory) to institutional research and assessment practitioners, and explains how it can be utilized to evaluate the reliability of assessment procedures in order to improve student learning outcomes. The fundamental concepts associated with G-theory are briefly discussed, followed by a discussion…

Descriptors: Generalizability Theory, Institutional Research, Reliability, Computer Software

Deep Reinforcement Learning for Interactive Systems

Direct link

Zhiwen Tang – ProQuest LLC, 2021

Artificial intelligence (AI) aims to build intelligent systems that can interact with and assist humans. During the interaction, a system learns the requirements from the human user and adapts to the needs to complete tasks. A popular type of interactive system is retrieval-based, where the system uses a retrieval function to retrieve relevant…

Descriptors: Artificial Intelligence, Intelligent Tutoring Systems, Objectives, Reinforcement

Validation of the Child Observation Record Advantage 1.5 Assessment Tool for Preschool Children: A Multilevel Bifactor Modeling Approach

Peer reviewed

Direct link

Akaeze, Hope O.; Wu, Jamie Heng-Chieh; Lawrence, Frank R.; Weber, Everett P. – Journal of Psychoeducational Assessment, 2023

This paper reports an investigation into the psychometric properties of the COR-Advantage1.5 (COR-Adv1.5) assessment tool, a criterion-referenced observation-based instrument designed to assess the developmental abilities of children from birth through kindergarten. Using data from 8534 children participating in a state-funded preschool program…

Descriptors: Criterion Referenced Tests, Evaluation Methods, Measures (Individuals), Measurement Techniques

Survey of Evidence in Education for Schools (SEE-S) Technical Report

Download full text

May, Henry; Blackman, Horatio; Van Horne, Sam; Tilley, Katherine; Farley-Ripple, Elizabeth N.; Shewchuk, Samantha; Agboh, Darren; Micklos, Deborah Amsden – Center for Research Use in Education, 2022

In this technical report, the Center for Research Use in Education (CRUE) presents the methodological design of a large-scale quantitative investigation of research use by school-based practitioners through the "Survey of Evidence in Education for Schools (SEE-S)." It documents the major technical aspects of the development of SEE-S,…

Descriptors: Surveys, Schools, Educational Research, Research Utilization

Site Selection in Experiments: An Assessment of Site Recruitment and Generalizability in Two Scale-Up Studies

Peer reviewed

Direct link

Tipton, Elizabeth; Fellers, Lauren; Caverly, Sarah; Vaden-Kiernan, Michael; Borman, Geoffrey; Sullivan, Kate; Ruiz de Castilla, Veronica – Journal of Research on Educational Effectiveness, 2016

Recently, statisticians have begun developing methods to improve the generalizability of results from large-scale experiments in education. This work has included the development of methods for improved site selection when random sampling is infeasible, including the use of stratification and targeted recruitment strategies. This article provides…

Descriptors: Generalizability Theory, Site Selection, Experiments, Comparative Analysis

Generalizability of Multiple Measures of Treatment Integrity: Comparisons among Direct Observation, Permanent Products, and Self-Report

Peer reviewed

Direct link

Gresham, Frank M.; Dart, Evan H.; Collins, Tai A. – School Psychology Review, 2017

The concept of treatment integrity is an essential component to databased decision making within a response-to-intervention model. Although treatment integrity is a topic receiving increased attention in the school-based intervention literature, relatively few studies have been conducted regarding the technical adequacy of treatment integrity…

Descriptors: Fidelity, Generalizability Theory, Observation, Measurement Techniques

Using Generalizability Theory to Examine the Dependability of Scores from the Learning Target Rating Scale

Peer reviewed
PDF on ERIC

Download full text

Direct link

McLaughlin, Tara W.; Snyder, Patricia A.; Algina, James – Grantee Submission, 2017

The Learning Target Rating Scale (LTRS) is a measure designed to evaluate the quality of teacher-developed learning targets for embedded instruction for early learning. In the present study, we examined the measurement dependability of LTRS scores by conducting a generalizability study (G-study). We used a partially nested, three-facet model to…

Descriptors: Generalizability Theory, Scores, Rating Scales, Evaluation Methods

Working with Sparse Data in Rated Language Tests: Generalizability Theory Applications

Peer reviewed

Direct link

Lin, Chih-Kai – Language Testing, 2017

Sparse-rated data are common in operational performance-based language tests, as an inevitable result of assigning examinee responses to a fraction of available raters. The current study investigates the precision of two generalizability-theory methods (i.e., the rating method and the subdividing method) specifically designed to accommodate the…

Descriptors: Data Analysis, Language Tests, Generalizability Theory, Accuracy

Writing Evaluation: Rater and Task Effects on the Reliability of Writing Scores for Children in Grades 3 and 4

Peer reviewed

Direct link

Kim, Young-Suk Grace; Schatschneider, Christopher; Wanzek, Jeanne; Gatlin, Brandy; Al Otaiba, Stephanie – Reading and Writing: An Interdisciplinary Journal, 2017

We examined how raters and tasks influence measurement error in writing evaluation and how many raters and tasks are needed to reach a desirable level of 0.90 and 0.80 reliabilities for children in Grades 3 and 4. A total of 211 children (102 boys) were administered three tasks in narrative and expository genres, respectively, and their written…

Descriptors: Writing Evaluation, Elementary School Students, Grade 3, Grade 4

Writing Evaluation: Rater and Task Effects on the Reliability of Writing Scores for Children in Grades 3 and 4

Peer reviewed
PDF on ERIC

Download full text

Direct link

Kim, Young-Suk Grace; Schatschneider, Christopher; Wanzek, Jeanne; Gatlin, Brandy; Al Otaiba, Stephanie – Grantee Submission, 2017

Descriptors: Writing Evaluation, Elementary School Students, Grade 3, Grade 4

Rater Reliability and Score Discrepancy under Holistic and Analytic Scoring of Second Language Writing

Peer reviewed

Direct link

Zhang, Bo; Xiao, Yunnan; Luo, Juan – Language Testing in Asia, 2015

Previous studies comparing holistic scoring to analytic scoring of second language writing have given mixed results. Some of them suffer from methodological drawbacks, such as limited writing sample size, limited number of raters, and lack of direct comparison of the two methods. Based on 300 writing samples graded by 14 raters, this research…

Descriptors: Evaluators, Reliability, Scores, Holistic Approach

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Advances in Health Sciences…	4
Grantee Submission	4
Multivariate Behavioral…	4
ProQuest LLC	3
School Psychology Review	3
Applied Psychological…	2
Educational Researcher	2
Educational and Psychological…	2
International Journal of…	2
Journal of Psychoeducational…	2
Advances in Physiology…	1
American Journal of Evaluation	1
Applied Measurement in…	1
Assessment & Evaluation in…	1
Association for Institutional…	1
Center for Research Use in…	1
Chemistry Education Research…	1
Educational Assessment	1
Eurasian Journal of…	1
Evaluation Review	1
IEEE Transactions on Learning…	1
International Journal of…	1
International Journal of…	1
Journal of College Admission	1
Journal of Communication…	1
More ▼

Suen, Hoi K.	3
Al Otaiba, Stephanie	2
Gatlin, Brandy	2
Kim, Young-Suk Grace	2
Martinez, Jose Felipe	2
Schatschneider, Christopher	2
Tipton, Elizabeth	2
Volpe, Robert J.	2
Wanzek, Jeanne	2
Abedi, Jamal	1
Agboh, Darren	1
Ahn, Soyeon	1
Akaeze, Hope O.	1
Aksu, Gökhan	1
Algina, James	1
Ames, Allison J.	1
Andreou, Pantelis	1
Attali, Yigal	1
Baker, Eva L.	1
Barnes, Michael D.	1
Ben-Simon, Anat	1
Bennett, Randy Elliott	1
Bergeron, Renee	1
Bickman, Leonard	1
More ▼