Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 8 |
Since 2006 (last 20 years) | 16 |
Descriptor
Interrater Reliability | 21 |
Statistical Analysis | 21 |
Test Validity | 21 |
Test Reliability | 13 |
Correlation | 6 |
Foreign Countries | 6 |
Comparative Analysis | 4 |
Psychometrics | 4 |
Test Construction | 4 |
At Risk Students | 3 |
College Students | 3 |
More ▼ |
Source
Author
Abedi, Jamal | 1 |
Baker, Eva L. | 1 |
Beets, Michael W. | 1 |
Beighle, Aaron | 1 |
Ben-Simon, Anat | 1 |
Charalambous, Charalambos Y. | 1 |
Choukroun, Hadrien | 1 |
Cohen, Allan | 1 |
Cohen, Yoav | 1 |
Cost, Hollie C. | 1 |
Eklund, Katie | 1 |
More ▼ |
Publication Type
Journal Articles | 18 |
Reports - Research | 16 |
Reports - Evaluative | 4 |
Speeches/Meeting Papers | 2 |
Dissertations/Theses -… | 1 |
Reports - Descriptive | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 5 |
Elementary Education | 3 |
Postsecondary Education | 3 |
Early Childhood Education | 2 |
Grade 5 | 2 |
Grade 7 | 2 |
Middle Schools | 2 |
Preschool Education | 2 |
Elementary Secondary Education | 1 |
Grade 1 | 1 |
Grade 2 | 1 |
More ▼ |
Audience
Laws, Policies, & Programs
Assessments and Surveys
ACT Assessment | 1 |
Bracken Basic Concept Scale | 1 |
Early Childhood Environment… | 1 |
SAT (College Admission Test) | 1 |
Strengths and Difficulties… | 1 |
Test of English as a Foreign… | 1 |
Test of English for… | 1 |
What Works Clearinghouse Rating
Practices in Instrument Use and Development in "Chemistry Education Research and Practice" 2010-2021
Lazenby, Katherine; Tenney, Kristin; Marcroft, Tina A.; Komperda, Regis – Chemistry Education Research and Practice, 2023
Assessment instruments that generate quantitative data on attributes (cognitive, affective, behavioral, "etc.") of participants are commonly used in the chemistry education community to draw conclusions in research studies or inform practice. Recently, articles and editorials have stressed the importance of providing evidence for the…
Descriptors: Chemistry, Periodicals, Journal Articles, Science Education
Raczynski, Kevin; Cohen, Allan – Applied Measurement in Education, 2018
The literature on Automated Essay Scoring (AES) systems has provided useful validation frameworks for any assessment that includes AES scoring. Furthermore, evidence for the scoring fidelity of AES systems is accumulating. Yet questions remain when appraising the scoring performance of AES systems. These questions include: (a) which essays are…
Descriptors: Essay Tests, Test Scoring Machines, Test Validity, Evaluators
Cohen, Yoav; Levi, Effi; Ben-Simon, Anat – Applied Measurement in Education, 2018
In the current study, two pools of 250 essays, all written as a response to the same prompt, were rated by two groups of raters (14 or 15 raters per group), thereby providing an approximation to the essay's true score. An automated essay scoring (AES) system was trained on the datasets and then scored the essays using a cross-validation scheme. By…
Descriptors: Test Validity, Automation, Scoring, Computer Assisted Testing
Thawabieh, Ahmad M. – Journal of Curriculum and Teaching, 2017
This study aimed to compare between the students' self-assessment and teachers' assessment. The study sample consisted of 71 students at Tafila Technical University studying Introduction to Psychology course. The researcher used 2 students' self-assessment tools and 2 tests. The results indicated that students can assess themselves accurately if…
Descriptors: Comparative Analysis, Self Evaluation (Individuals), Student Evaluation, Psychology
Charalambous, Charalambos Y.; Kyriakides, Ermis; Tsangaridou, Niki; Kyriakides, Leonidas – School Effectiveness and School Improvement, 2017
Heightened accountability pressures and an increased emphasis on teaching quality have directed scholarly attention to scrutinizing instruction, particularly with respect to issues of validity and reliability. However, these attempts have largely been directed toward "core" content areas and investigated generic or content-specific…
Descriptors: Physical Education, Instructional Effectiveness, Lesson Plans, Interrater Reliability
Tanner, Nicholas; Eklund, Katie; Kilgus, Stephen P.; Johnson, Austin H. – School Psychology Review, 2018
Data derived from universal screening procedures are increasingly utilized by schools to identify and provide additional support to students at risk for behavioral and emotional concerns. As screening has the potential to be resource intensive, effort has been placed on the development of efficient screening procedures, including brief behavior…
Descriptors: Screening Tests, At Risk Students, Behavior Problems, Emotional Problems
Rios, Joseph A.; Sparks, Jesse R.; Zhang, Mo; Liu, Ou Lydia – ETS Research Report Series, 2017
Proficiency with written communication (WC) is critical for success in college and careers. As a result, institutions face a growing challenge to accurately evaluate their students' writing skills to obtain data that can support demands of accreditation, accountability, or curricular improvement. Many current standardized measures, however, lack…
Descriptors: Test Construction, Test Validity, Writing Tests, College Outcomes Assessment
Weaver, R. Glenn; Webster, Collin A.; Erwin, Heather; Beighle, Aaron; Beets, Michael W.; Choukroun, Hadrien; Kaysing, Nicole – Measurement in Physical Education and Exercise Science, 2016
The System for Observing Fitness Instruction Time (SOFIT) is commonly used to measure variables related to physical activity during physical education (PE). However, SOFIT does not yield detailed information about teacher practices related to children's moderate-to-vigorous physical activity (MVPA). This study describes the modification of SOFIT…
Descriptors: Physical Education, Observation, Physical Activity Level, Teaching Methods
Polignano, Joy C.; Hojnoski, Robin L. – Assessment for Effective Intervention, 2012
There has been increased attention to the development of assessment measures for evaluating mathematical skills in young children in order to inform instruction and intervention. However, existing tools have focused primarily on number sense with little attention to other areas of mathematical thinking such as geometry and algebra. The purpose of…
Descriptors: Numeracy, Curriculum Based Assessment, Test Reliability, Test Validity
Scharf, Davida – ProQuest LLC, 2013
Purpose: The goal of the study was to test an intervention using a brief essay as an instrument for evaluating higher-order information literacy skills in college students, while accounting for prior conditions such as socioeconomic status and prior academic achievement, and identify other predictors of information literacy through an evaluation…
Descriptors: Information Literacy, Intervention, Student Evaluation, College Students
Steed, Elizabeth A.; Webb, Mi-young L. – Journal of Positive Behavior Interventions, 2013
This report documents the reliability and validity of scores on the Preschool-Wide Evaluation Tool (PreSET), an assessment used to measure program-wide implementation of the universal level of positive behavior interventions and support (PBIS) in early childhood settings. Initial analyses of descriptive statistics, item, subscale, and total…
Descriptors: Psychometrics, Preschool Evaluation, Student Behavior, Intervention
Pae, Holly; Freeman, Greta G.; Wash, Pamela D. – AILACTE Journal, 2014
Teacher preparation programs face great challenges in ensuring their graduates are prepared for the demands of today's classrooms. The authors explore how teacher accountability has evolved based upon federal legislation leading to adoption of the Common Core State Standards (CCSS). Recognizing that future teachers will be held accountable for…
Descriptors: Preservice Teacher Education, Preservice Teachers, Knowledge Base for Teaching, State Standards
Sayed, Osama H. – English Language Teaching, 2010
The present study attempted to investigate the effect of using blog-based peer feedback on the persuasive writing of EFL business management students at the community college in Bisha, King Khalid University, Saudi Arabia. The study used a pre-test/post-test experimental and control group design. An experimental group and a control group were…
Descriptors: Foreign Countries, Business Administration Education, Electronic Publishing, Web Sites
Muyskens, Paul; Marston, Doug; Reschly, Amy L. – California School Psychologist, 2007
Behavioral difficulties of school-aged students are typically dealt with in a reactive, rather than preventative manner. This article examines a proactive approach, consistent with the Response-to-Intervention model, using a screening measure designed to identify students at risk for behavior difficulties and targeting these students for early…
Descriptors: Early Intervention, At Risk Students, Teacher Attitudes, Academic Achievement
Murdock, Linda C.; Cost, Hollie C.; Tieso, Carol – Focus on Autism and Other Developmental Disabilities, 2007
The "Social-Communication Assessment Tool" (S-CAT) was created as a direct observation instrument to quantify specific social and communication deficits of children with autism spectrum disorders (ASD) within educational settings. In this pilot study, the instrument's content validity and interrater reliability were investigated to determine the…
Descriptors: Nonverbal Communication, Autism, Content Validity, Test Validity
Previous Page | Next Page »
Pages: 1 | 2