Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 9 |
Since 2006 (last 20 years) | 14 |
Descriptor
Elementary Secondary Education | 80 |
Test Reliability | 80 |
Scoring | 54 |
Test Validity | 47 |
Test Construction | 29 |
Educational Assessment | 21 |
Performance Based Assessment | 15 |
Standardized Tests | 15 |
Evaluation Methods | 14 |
Test Interpretation | 13 |
Testing Problems | 13 |
More ▼ |
Source
Author
Koretz, Daniel | 3 |
Bergin, Christi | 2 |
Burton, Nancy W. | 2 |
Johnson, Eugene G. | 2 |
Abedi, Jamal | 1 |
Afolabi, Comfort Y. | 1 |
Allison, Howard K., II | 1 |
Anderson, David O. | 1 |
Andrews, Jac | 1 |
Benoit, Joyce | 1 |
Bohning, Gerry | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 14 |
Elementary Education | 2 |
Secondary Education | 2 |
Grade 4 | 1 |
High Schools | 1 |
Higher Education | 1 |
Intermediate Grades | 1 |
Postsecondary Education | 1 |
Location
Vermont | 4 |
Nebraska | 3 |
Canada | 2 |
Missouri | 2 |
New York | 2 |
Texas | 2 |
Alabama | 1 |
Arizona | 1 |
California | 1 |
Colorado (Denver) | 1 |
Florida | 1 |
More ▼ |
Laws, Policies, & Programs
Education Consolidation… | 1 |
Elementary and Secondary… | 1 |
Elementary and Secondary… | 1 |
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Nebraska Department of Education, 2018
The 2018 Nebraska Student-Centered Assessment System (NSCAS) Summative technical report documents the processes and procedures implemented to support the Spring 2018 NSCAS Summative English Language Arts (ELA), Mathematics, and Science assessments by NWEA under the supervision of the Nebraska Department of Education (NDE). The technical report…
Descriptors: Summative Evaluation, Language Tests, English, Mathematics Tests
Nweke, Winifred C.; Perkins, Tasha P.; Afolabi, Comfort Y. – Georgia Educational Researcher, 2019
Assessing the dispositions of teacher candidates remains a challenge for many Educator Preparation Providers (EPPs). This article details the process and results of establishing the reliability of two complementary instruments, the "Candidate Beliefs Self-Assessment Survey" (SAS) and the "Candidate Dispositions Performance…
Descriptors: Preservice Teachers, Student Characteristics, Personality Traits, Test Reliability
Collier, Jo-Kate; Huang, Becky – Language Assessment Quarterly, 2020
This article presents a critical review of the Texas English Language Proficiency Assessment System (TELPAS), a large scale standardized English language proficiency (ELP) assessment developed by the Texas Education Agency (TEA) and administered since 2004. TELPAS is used as an annual summative assessment for all English Learners (ELs) in grades…
Descriptors: English (Second Language), Language Proficiency, Language Tests, Standardized Tests
Jones, Eli; Bergin, Christi – Educational Assessment, 2019
In most U.S. schools, teachers are evaluated using observation of teaching practice (OTP). This study investigates rater effects on OTP ratings among 421 principals in an authentic teacher evaluation system. Many-facet Rasch analysis (MFR) using a block of shared ratings revealed that principals generally (a) differentiated between more and less…
Descriptors: Teacher Effectiveness, Classroom Observation Techniques, Item Response Theory, School Districts
Marbach, Joshua – Journal of Psychoeducational Assessment, 2017
The Mathematics Fluency and Calculation Tests (MFaCTs) are a series of measures designed to assess for arithmetic calculation skills and calculation fluency in children ages 6 through 18. There are five main purposes of the MFaCTs: (1) identifying students who are behind in basic math fact automaticity; (2) evaluating possible delays in arithmetic…
Descriptors: Mathematics Tests, Computation, Mathematics Skills, Arithmetic
Wind, Stefanie A.; Tsai, Chia-Lin; Grajeda, Sara B.; Bergin, Christi – School Effectiveness and School Improvement, 2018
Teacher evaluation systems commonly rely on observation of teaching practice (OTP) by school principals. However, the value of OTP as evidence of teacher effectiveness depends on its psychometric quality. In this study, we address a key aspect of the psychometric quality of principals' OTP ratings. Specifically, we investigate the degree to which…
Descriptors: Classroom Observation Techniques, Teacher Evaluation, Principals, Rating Scales
Pinder, Patrice Juliet – Online Submission, 2020
States are establishing high stakes assessments to serve as measurement tools of students' academic abilities. This study essentially compares Maryland's and Florida's mathematics and science assessments for similarities and differences. Building from 5-10 years of student level quantitative data (secondary data) and critical analyses of the…
Descriptors: Standardized Tests, Achievement Tests, State Standards, High Stakes Tests
Wagemaker, Hans, Ed. – International Association for the Evaluation of Educational Achievement, 2020
Although International Association for the Evaluation of Educational Achievement-pioneered international large-scale assessment (ILSA) of education is now a well-established science, non-practitioners and many users often substantially misunderstand how large-scale assessments are conducted, what questions and challenges they are designed to…
Descriptors: International Assessment, Achievement Tests, Educational Assessment, Comparative Analysis
Thacker, Arthur A.; Dickinson, Emily R.; Bynum, Bethany H.; Wen, Yao; Smith, Erin; Sinclair, Andrea L.; Deatz, Richard C.; Wise, Lauress L. – Partnership for Assessment of Readiness for College and Careers, 2015
The Partnership for Assessment of Readiness for College and Careers (PARCC) field tests during the spring of 2014 provided an opportunity to investigate the quality of the items, tasks, and associated stimuli. HumRRO conducted several research studies summarized in this report. Quality of test items is integral to the "Theory of Action"…
Descriptors: Achievement Tests, Test Items, Common Core State Standards, Difficulty Level
Reed, Deborah K.; Vaughn, Sharon – Scientific Studies of Reading, 2012
The purpose of this narrative synthesis is to determine the reliability and validity of retell protocols for assessing reading comprehension of students in grades K-12. Fifty-four studies were systematically coded for data related to the administration protocol, scoring procedures, and technical adequacy of the retell component. Retell was…
Descriptors: Reading Comprehension, Reading Difficulties, Elementary Secondary Education, Learning Disabilities
Gregory, Jess L.; Noto, Lori A. – Online Submission, 2012
The general education teacher has the greatest influence on a student's success in school, and a teacher's attitude towards inclusion is a major factor in determining whether inclusion will be successful. The ATTAS-mm is a 9-item scale with strong reliability and validity. The three subscales: believing all students can succeed in general…
Descriptors: Academic Achievement, General Education, Teaching (Occupation), Teacher Influence
Frame, Laura B.; Vidrine, Stephanie M.; Hinojosa, Ryan – Journal of Psychoeducational Assessment, 2016
The Kaufman Test of Educational Achievement, Third Edition (KTEA-3) is a revised and updated comprehensive academic achievement test (Kaufman & Kaufman, 2014). Authored by Drs. Alan and Nadeen Kaufman and published by Pearson, the KTEA-3 remains an individual achievement test normed for individuals of ages 4 through 25 years, or for those in…
Descriptors: Achievement Tests, Elementary Secondary Education, Test Validity, Test Reliability
Bill & Melinda Gates Foundation, 2012
No one has a bigger stake in teaching effectiveness than students. Nor are there any better experts on how teaching is experienced by its intended beneficiaries. Only recently have many policymakers and practitioners come to recognize that--when asked the right questions, in the right ways--students can be an important source of information on the…
Descriptors: Student Surveys, Student Attitudes, Feedback (Response), Test Validity

Burton, Nancy W. – Educational and Psychological Measurement, 1981
This study was concerned with selecting a measure of scorer agreement for use with the National Assessment of Educational Progress. The simple percent of agreement and Cohen's kappa were compared. It was concluded that Cohen's kappa does not add sufficient information to make its calculation worthwhile. (Author/BW)
Descriptors: Educational Assessment, Elementary Secondary Education, Quality Control, Scoring

Stephens, M. Irene; Montgomery, Allen A. – Topics in Language Disorders, 1985
Six recently published language tests are examined in terms of theoretical models, choice of subtests, test format, reliability, norming population, and reporting scores. The tests are the Screening Test of Adolescent Language, the Word Test, the Test of Language Development-Intermediate, the Fullerton Language Tests for Adolescents, Clinical…
Descriptors: Elementary Secondary Education, Language Handicaps, Scoring, Standardized Tests