NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of…1
What Works Clearinghouse Rating
Showing 1 to 15 of 25 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Susan K. Johnsen – Gifted Child Today, 2025
The author provides information about reliability and areas that educators should examine in determining if an assessment is consistent and trustworthy for use, and how it should be interpreted in making decisions about students. Reliability areas that are discussed in the column include internal consistency, test-retest or stability, inter-scorer…
Descriptors: Test Reliability, Academically Gifted, Student Evaluation, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Robert Meyer; Sara Hu; Michael Christian – Society for Research on Educational Effectiveness, 2023
Background: This paper develops a new method to estimate quasi-experimental evaluation models when it is necessary to control for measurement error in predictors and individual assignment to the treatment group is based on these same fallible variables. A major methodological finding of the study is that standard methods of estimating models that…
Descriptors: Error of Measurement, Measurement Techniques, Elementary Secondary Education, Report Cards
Wenjing Guo – ProQuest LLC, 2021
Constructed response (CR) items are widely used in large-scale testing programs, including the National Assessment of Educational Progress (NAEP) and many district and state-level assessments in the United States. One unique feature of CR items is that they depend on human raters to assess the quality of examinees' work. The judgment of human…
Descriptors: National Competency Tests, Responses, Interrater Reliability, Error of Measurement
Patrick C. Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Institute, 2024
Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international largescale assessments of cognitive and…
Descriptors: Performance Based Assessment, Evaluation Criteria, Evaluation Methods, Test Bias
Peer reviewed Peer reviewed
Direct linkDirect link
Gadke, Daniel L.; Drevon, Daniel D. – School Psychology Review, 2020
Brief experimental analysis (BEA) is frequently used to drive intervention selection decisions for students in need of intensive reading fluency intervention. Researchers have demonstrated that most BEA results for students with reading fluency difficulties are undifferentiated when considering the standard error of measurement (SEM) of…
Descriptors: Data Use, Decision Making, Efficiency, Intervention
Peer reviewed Peer reviewed
Direct linkDirect link
Gottlieb, Derek; Moroye, Christy M. – Journal of Curriculum and Pedagogy, 2016
We examine the reliance on rubrics for educational evaluation and explore whether such tools fulfill their promise. Following Wittgensteinian critical strategies, we explore what "the application of the [rubric] picture looks like" and then evaluate (a) whether those benefits are attributable to rubric use at all, and (b) whether any of…
Descriptors: Scoring Rubrics, Educational Assessment, Student Evaluation, Educational Benefits
Peer reviewed Peer reviewed
Direct linkDirect link
In'nami, Yo; Koizumi, Rie – International Journal of Testing, 2013
The importance of sample size, although widely discussed in the literature on structural equation modeling (SEM), has not been widely recognized among applied SEM researchers. To narrow this gap, we focus on second language testing and learning studies and examine the following: (a) Is the sample size sufficient in terms of precision and power of…
Descriptors: Structural Equation Models, Sample Size, Second Language Instruction, Monte Carlo Methods
Diakow, Ronli Phyllis – ProQuest LLC, 2013
This dissertation comprises three papers that propose, discuss, and illustrate models to make improved inferences about research questions regarding student achievement in education. Addressing the types of questions common in educational research today requires three different "extensions" to traditional educational assessment: (1)…
Descriptors: Inferences, Educational Assessment, Academic Achievement, Educational Research
Peer reviewed Peer reviewed
Direct linkDirect link
Turner, Gill; Gibbs, Graham – Assessment & Evaluation in Higher Education, 2010
There is considerable variation between male and female Bachelor degree performance at Oxford and Cambridge (Oxbridge) where male students attain more First and Third Class degrees and female students attain more Second Class degrees. Various hypotheses have been put forward to explain this phenomenon including the possibility that the distinctive…
Descriptors: Gender Differences, Questionnaires, Evaluation Methods, Evaluation Research
Isenberg, Eric; Hock, Heinrich – Mathematica Policy Research, Inc., 2011
This report presents the value-added models that will be used to measure school and teacher effectiveness in the District of Columbia Public Schools (DCPS) in the 2010-2011 school year. It updates the earlier technical report, "Measuring Value Added for IMPACT and TEAM in DC Public Schools." The earlier report described the methods used…
Descriptors: Public Schools, Teacher Effectiveness, School Effectiveness, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Vannest, Kimberly J.; Parker, Richard I. – Journal of Special Education, 2010
Instructional time use is an intervention without equal. The measure of such has clear and important implications for special education practice and research. Although exhortations to maximize instruction and thereby student engagement exist throughout the literature, few studies discuss how special education teachers use their time, and none…
Descriptors: School Schedules, Error of Measurement, Sampling, Special Education Teachers
Peer reviewed Peer reviewed
Direct linkDirect link
Birenbaum, Menucha – Studies in Educational Evaluation, 2007
High quality assessment practice is expected to yield valid and useful score-based interpretations about what the examinees know and are able to do with respect to a defined target domain. Given this assertion, the article presents a framework based on the "unified view of validity," advanced by Cronbach and Messick over two decades ago, to assist…
Descriptors: Quality Control, Student Evaluation, Validity, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Ferrao, Maria – Assessment & Evaluation in Higher Education, 2010
The Bologna Declaration brought reforms into higher education that imply changes in teaching methods, didactic materials and textbooks, infrastructures and laboratories, etc. Statistics and mathematics are disciplines that traditionally have the worst success rates, particularly in non-mathematics core curricula courses. This research project,…
Descriptors: Foreign Countries, Computer Assisted Testing, Educational Technology, Educational Assessment
Peer reviewed Peer reviewed
Sanders, Steven G. – Journal of College Science Teaching, 1975
Several techniques to use in evaluation and grading are presented. Some grading problems are discussed briefly. (PEB)
Descriptors: Error of Measurement, Evaluation, Evaluation Methods, Grading
Peer reviewed Peer reviewed
Direct linkDirect link
Leark, Robert A.; Wallace, Denise R.; Fitzgerald, Robert – Assessment, 2004
Test-retest reliability of the Test of Variables of Attention (T.O.V.A.) was investigated in two studies using two different time intervals: 90 min and 1 week (plus or minus 2 days). To investigate the 90-min reliability, 31 school-age children (M = 10 years, SD = 2.66) were administered the T.O.V.A. then read ministered the test 90 min afterward.…
Descriptors: Intervals, Reaction Time, Error of Measurement, Test Reliability
Previous Page | Next Page ยป
Pages: 1  |  2