NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Parents1
Laws, Policies, & Programs
Elementary and Secondary…1
Race to the Top1
What Works Clearinghouse Rating
Showing 1 to 15 of 38 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Parker, David C.; Stewart, Lisa H.; Thomson, Susan; Kaminski, Ruth A. – Assessment for Effective Intervention, 2021
Vocabulary skills are important for overall reading competence, but vocabulary assessment approaches that inform instructional decision-making and are sensitive to improvement are limited. This article describes a process for developing vocabulary measures designed to facilitate data-driven decision-making for kindergarten and first-grade students…
Descriptors: Vocabulary, Kindergarten, Grade 1, Elementary School Students
Peer reviewed Peer reviewed
Direct linkDirect link
Li, Zijia; Gooden, Caroline; Toland, Michael D. – Journal of Early Intervention, 2019
This study provides preliminary evidence for reliability and validity of the Hawaii Early Learning Profile Strands 0-3 (HELP Strands 0-3), an assessment instrument for young children. First, the degree of interobserver agreement for a sample of representative HELP items was examined; results indicated that HELP scoring was dependable and…
Descriptors: Measures (Individuals), Psychometrics, Early Childhood Education, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Hoefnagel, Laura; Espin, Christine A.; Rippe, Ralph – International Journal for Research in Learning Disabilities, 2021
Students with and without learning disabilities often struggle to learn a foreign language (FL). Teachers could benefit from a measure designed to screen and identify students at risk for FL learning difficulties. In this study, we examined the reliability and validity of scores from four curriculum-based measures (CBM) as potential indicators of…
Descriptors: Curriculum Based Assessment, Language Tests, Second Language Learning, Screening Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Mooney, Paul; McCarter, Kevin S.; Russo, Robert J., Jr.; Blackwood, Danielle L. – Assessment for Effective Intervention, 2013
The purpose of this study was to evaluate technical adequacy features of an online adaptation of vocabulary matching known as critical content monitoring. Validity and reliability studies were conducted with a sample of 106 students from one school in fifth-grade science content. Participants were administered 20 parallel forms of the general…
Descriptors: Elementary School Students, Grade 5, Elementary School Science, Vocabulary
Peer reviewed Peer reviewed
Direct linkDirect link
Polikoff, Morgan S. – American Journal of Education, 2015
Responding to federal policy and recent research, states and districts have developed and begun implementing multiple-measure teacher evaluation systems. These systems generally include observational and/or student survey measures of instructional quality alongside measures of teachers' contributions to student learning (e.g., value-added models…
Descriptors: Teacher Effectiveness, Student Surveys, Student Evaluation of Teacher Performance, Test Reliability
Stevens, Olinger; Leigh, Erika – ProQuest LLC, 2012
Scope and Method of Study: The purpose of the study is to use an empirical approach to identify a simple, economical, efficient, and technically adequate performance measure that teachers can use to assess student growth in mathematics. The current study has been designed to expand the body of research for math CBM to further examine technical…
Descriptors: Mathematics Instruction, Evaluation Methods, Student Evaluation, Measurement Techniques
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Sawaki, Yasuyo; Sinharay, Sandip – ETS Research Report Series, 2013
This study investigates the value of reporting the reading, listening, speaking, and writing section scores for the "TOEFL iBT"® test, focusing on 4 related aspects of the psychometric quality of the TOEFL iBT section scores: reliability of the section scores, dimensionality of the test, presence of distinct score profiles, and the…
Descriptors: Scores, Computer Assisted Testing, Factor Analysis, Correlation
Peer reviewed Peer reviewed
Direct linkDirect link
Fox, Janna; Fairbairn, Shelley – Language Testing, 2011
This article reviews Assessing Comprehension and Communication in English State-to-State for English Language Learners ("ACCESS for ELLs"[R]), which is a large-scale, high-stakes, standards-based, and criterion-referenced English language proficiency test administered in the USA annually to more than 840,000 English Language Learners (ELLs), in…
Descriptors: Test Preparation, Feedback (Response), Instructional Design, Testing Accommodations
Setzer, J. Carl – GED Testing Service, 2009
The GED[R] English as a Second Language (GED ESL) Test was designed to serve as an adjunct to the GED test battery when an examinee takes either the Spanish- or French-language version of the tests. The GED ESL Test is a criterion-referenced, multiple-choice instrument that assesses the functional, English reading skills of adults whose first…
Descriptors: Language Tests, High School Equivalency Programs, Psychometrics, Reading Skills
Robertson, Gary J. – 1981
Some fundamental concepts of criterion referenced test (CRT) reliability are highlighted. Emphasis is given to the procedures for determining reliability of scores for individual pupils because this is an area requiring increased awareness by classroom teachers and practitioners. Reliability issues encountered in the evaluation of instructional…
Descriptors: Criterion Referenced Tests, Reading Tests, Scores, Test Reliability
Subkoviak, Michael J. – 1976
A number of different definitions and indices of reliability for mastery tests have recently been proposed in an attempt to cope with possible lack of score variability that attenuates traditional coefficients. One promising index that has been suggested is the proportion of students in a group that are consistently assigned to the same mastery…
Descriptors: Criterion Referenced Tests, Mastery Tests, Mathematical Models, Scores
Nitko, Anthony J. – New Directions for Testing and Measurement, 1980
Criterion-referencing is a way to enhance the interpretation of test scores by referencing them to well-defined behavior domains. Behavior domains may be ordered or unordered; several varieties of criterion-referenced tests within each of these types are discussed. (Author/RL)
Descriptors: Classification, Criterion Referenced Tests, Scaling, Scores
Peer reviewed Peer reviewed
Kane, Michael T. – Journal of Educational Measurement, 1986
These analyses suggest that if a criterion-referenced test had a reliability (defined in terms of internal consistency) below 0.5, a simple a priori procedure would provide better estimates of students' universe scores than would individual observed scores. (Author/LMO)
Descriptors: Criterion Referenced Tests, Educational Research, Error of Measurement, Generalizability Theory
Peer reviewed Peer reviewed
Millman, Jason; Popham, W. James – Journal of Educational Measurement, 1974
The use of the regression equation derived from the Anglo-American sample to predict grades of Mexican-American students resulted in overprediction. An examination of the standardized regression weights revealed a significant difference in the weight given to the Scholastic Aptitude Test Mathematics Score. (Author/BB)
Descriptors: Criterion Referenced Tests, Item Analysis, Predictive Validity, Scores
Dimitrov, Dimiter M. – 1996
A Monte Carlo approach is proposed, using the Statistical Analysis System (SAS) programming language, for estimating reliability coefficients in generalizability theory studies. Test scores are generated by a probabilistic model that considers the probability for a person with a given ability score to answer an item with a given difficulty…
Descriptors: Classification, Criterion Referenced Tests, Cutting Scores, Estimation (Mathematics)
Previous Page | Next Page »
Pages: 1  |  2  |  3