Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 17 |
Descriptor
Generalizability Theory | 29 |
Reliability | 29 |
Validity | 29 |
Scores | 10 |
Error of Measurement | 6 |
Psychometrics | 6 |
Correlation | 5 |
English (Second Language) | 4 |
Foreign Countries | 4 |
Higher Education | 4 |
Academic Achievement | 3 |
More ▼ |
Source
Author
Lee, Yong-Won | 2 |
Allen, Joseph P. | 1 |
Arthurs, Leilani | 1 |
Brandt, Lorilynn | 1 |
Briesch, Amy M. | 1 |
Carey, Jill | 1 |
Chafouleas, Sandra M. | 1 |
Chang, Kuo-En | 1 |
Chang, Tzyy-Hua | 1 |
Christ, Theodore J. | 1 |
Daly III, Edward J. | 1 |
More ▼ |
Publication Type
Journal Articles | 21 |
Reports - Research | 21 |
Speeches/Meeting Papers | 6 |
Reports - Evaluative | 5 |
Dissertations/Theses -… | 2 |
Numerical/Quantitative Data | 2 |
Opinion Papers | 1 |
Reports - Descriptive | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 8 |
Grade 8 | 3 |
Postsecondary Education | 3 |
Secondary Education | 3 |
Early Childhood Education | 2 |
Elementary Education | 2 |
Grade 7 | 2 |
High Schools | 2 |
Junior High Schools | 2 |
Middle Schools | 2 |
Grade 10 | 1 |
More ▼ |
Audience
Researchers | 2 |
Laws, Policies, & Programs
Assessments and Surveys
Motivated Strategies for… | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Martínez, José Felipe; Kloser, Matt; Srinivasan, Jayashri; Stecher, Brian; Edelman, Amanda – Educational and Psychological Measurement, 2022
Adoption of new instructional standards in science demands high-quality information about classroom practice. Teacher portfolios can be used to assess instructional practice and support teacher self-reflection anchored in authentic evidence from classrooms. This study investigated a new type of electronic portfolio tool that allows efficient…
Descriptors: Science Instruction, Academic Standards, Instructional Innovation, Electronic Publishing
Wickerd, Garry; Hulac, David – Journal of Applied School Psychology, 2017
Accurate and rapid identification of students displaying behavioral problems requires instrumentation that is user friendly and reliable. The purpose of the study was to evaluate a multi-item direct behavior rating scale called the Direct Behavior Rating-Multiple Item Scale (DBR-MIS) for disruptive behavior to determine the number of…
Descriptors: Behavior Rating Scales, Kindergarten, Behavior Problems, Young Children
Schmidgall, Jonathan – Applied Measurement in Education, 2017
This study utilizes an argument-based approach to validation to examine the implications of reliability in order to further differentiate the concepts of score and decision consistency. In a methodological example, the framework of generalizability theory was used to estimate appropriate indices of score consistency and evaluations of the…
Descriptors: Scores, Reliability, Validity, Generalizability Theory
Nielsen, Sara E.; Yezierski, Ellen – Journal of Chemical Education, 2015
Though the Chemistry Self-Concept Inventory (CSCI) was developed to study one aspect of the affective domain in college chemistry students, the instrument on which it was based, the Self-Description Questionnaire III, was developed for use with late adolescents. As such, we explored data generated from administering the CSCI to high school…
Descriptors: High School Students, Secondary School Science, Chemistry, Self Concept Measures
Arthurs, Leilani; Hsia, Jennifer F.; Schweinle, William – Journal of Geoscience Education, 2015
We developed and evaluated an Oceanography Concept Inventory (OCI), which used a mixed-methods approach to test student achievement of 11 learning goals for an introductory-level oceanography course. The OCI was designed with expert input, grounded in research on student (mis)conceptions, written with minimal jargon, tested on 464 students, and…
Descriptors: Oceanography, Mixed Methods Research, Academic Achievement, Introductory Courses
Han, Chao – Language Assessment Quarterly, 2016
As a property of test scores, reliability/dependability constitutes an important psychometric consideration, and it underpins the validity of measurement results. A review of interpreter certification performance tests (ICPTs) reveals that (a) although reliability/dependability checking has been recognized as an important concern, its theoretical…
Descriptors: Foreign Countries, Scores, English, Chinese
Briesch, Amy M.; Kilgus, Stephen P.; Chafouleas, Sandra M.; Riley-Tillman, T. Chris; Christ, Theodore J. – Assessment for Effective Intervention, 2013
The current study served to extend previous research on scaling construction of Direct Behavior Rating (DBR) in order to explore the potential flexibility of DBR to fit various intervention contexts. One hundred ninety-eight undergraduate students viewed the same classroom footage but rated student behavior using one of eight randomly assigned…
Descriptors: Validity, Intervention, Measures (Individuals), Student Behavior
Haertel, Edward H. – Educational Testing Service, 2013
Policymakers and school administrators have embraced value-added models of teacher effectiveness as tools for educational improvement. Teacher value-added estimates may be viewed as complicated scores of a certain kind. This suggests using a test validation model to examine their reliability and validity. Validation begins with an interpretive…
Descriptors: Reliability, Validity, Inferences, Teacher Effectiveness
Meijer, Joost; Sleegers, Peter; Elshout-Mohr, Marianne; van Daalen-Kapteijns, Maartje; Meeus, Wil; Tempelaar, Dirk – Educational Research, 2013
Background: Interest in the role of metacognition has been steadily rising in most forms of education. This study focuses on the construction of a questionnaire for measuring metacognitive knowledge, metacognitive regulation and metacognitive responsiveness among students in higher education and the subsequent process of testing to determine its…
Descriptors: Factor Analysis, Higher Education, Independent Study, Questionnaires
Orem, Chris D. – ProQuest LLC, 2012
Meta-assessment, or the assessment of assessment, can provide meaningful information about the trustworthiness of an academic program's assessment results (Bresciani, Gardner, & Hickmott, 2009; Palomba & Banta, 1999; Suskie, 2009). Many institutions conduct meta-assessments for their academic programs (Fulcher, Swain, & Orem, 2012),…
Descriptors: Validity, Evidence, Evaluation Methods, Meta Analysis
Mashburn, Andrew J.; Meyer, J. Patrick; Allen, Joseph P.; Pianta, Robert C. – Educational and Psychological Measurement, 2014
Observational methods are increasingly being used in classrooms to evaluate the quality of teaching. Operational procedures for observing teachers are somewhat arbitrary in existing measures and vary across different instruments. To study the effect of different observation procedures on score reliability and validity, we conducted an experimental…
Descriptors: Observation, Teacher Evaluation, Reliability, Validity
Brandt, Lorilynn – ProQuest LLC, 2010
Phonics was identified as one of the critical components in reading development by the National Reading Panel. Over time, research has repeatedly identified phonics as important to early reading development. Given the compelling evidence supporting the teaching of phonics in early reading, it is critical to make sure that instructional decisions…
Descriptors: Generalizability Theory, Phonics, Early Reading, Validity
Yin, Yue; Shavelson, Richard J. – Applied Measurement in Education, 2008
In the first part of this article, the use of Generalizability (G) theory in examining the dependability of concept map assessment scores and designing a concept map assessment for a particular practical application is discussed. In the second part, the application of G theory is demonstrated by comparing the technical qualities of two frequently…
Descriptors: Generalizability Theory, Concept Mapping, Validity, Reliability
Zhang, Bo; Johnston, Lucy; Kilic, Gulsen Bagci – Assessment & Evaluation in Higher Education, 2008
Peer and self-ratings have been strongly recommended as the means to adjust individual contributions to group work. To evaluate the quality of student ratings, previous research has primarily explored the validity of these ratings, as indicated by the degree of agreement between student and teacher ratings. This research describes a…
Descriptors: Generalizability Theory, Teaching Methods, Reliability, Validity
Sung, Yao-Ting; Chang, Kuo-En; Chang, Tzyy-Hua; Yu, Wen-Cheng – Journal of Adolescence, 2010
Self- and peer assessments are becoming more popular in classrooms, but there are few data on the reliability and validity of such assessments performed by school children. Because these factors are greatly affected by the number of raters, we conducted two studies to determine the rating behaviours of teenagers in self- and peer assessments, and…
Descriptors: Generalizability Theory, Peer Evaluation, Validity, Reliability
Previous Page | Next Page »
Pages: 1 | 2