ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	9

Descriptor

Criterion Referenced Tests	38
Scores	38
Test Reliability	38
Test Validity	22
Test Construction	17
Test Interpretation	11
Norm Referenced Tests	9
Statistical Analysis	9
Language Tests	8
English (Second Language)	7
Item Analysis	7
Measurement Techniques	7
Standardized Tests	7
Achievement Tests	6
Correlation	6
Elementary Secondary Education	6
Error of Measurement	6
Psychometrics	6
Cutting Scores	5
Testing	5
Comparative Analysis	4
Curriculum Based Assessment	4
Elementary School Students	4
Evaluation Methods	4
Higher Education	4
More ▼

Source

Assessment for Effective…	2
Journal of Educational…	2
Language Testing	2
American Journal of Education	1
ETS Research Report Series	1
Edinburgh Working Papers in…	1
Evaluation and the Health…	1
GED Testing Service	1
International Journal for…	1
Journal of Early Intervention	1
New Directions for Testing…	1
ProQuest LLC	1
More ▼

Publication Type

Reports - Research	23
Journal Articles	12
Speeches/Meeting Papers	8
Reports - Evaluative	3
Collected Works - Proceedings	1
Dissertations/Theses -…	1
Guides - General	1
Guides - Non-Classroom	1
Information Analyses	1
Opinion Papers	1
Reports - Descriptive	1
More ▼

Education Level

Elementary Education	5
Early Childhood Education	3
Primary Education	2
Secondary Education	2
Grade 1	1
Grade 3	1
Grade 5	1
Grade 8	1
High Schools	1
Higher Education	1
Intermediate Grades	1
Junior High Schools	1
Kindergarten	1
Middle Schools	1
Postsecondary Education	1
More ▼

Audience

Parents

Location

Colorado	2
Tennessee	2
Delaware	1
Florida	1
Hawaii	1
Illinois	1
Iran	1
Kansas	1
Kentucky	1
Louisiana	1
New Jersey	1
New York	1
North Carolina	1
Ohio	1
Texas	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	1
Race to the Top	1

Assessments and Surveys

California Achievement Tests	2
Battelle Developmental…	1
General Educational…	1
National Assessment of…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 38 results Save | Export

Development and Technical Adequacy of Instructionally Relevant Vocabulary Measures for Young Students

Peer reviewed

Direct link

Parker, David C.; Stewart, Lisa H.; Thomson, Susan; Kaminski, Ruth A. – Assessment for Effective Intervention, 2021

Vocabulary skills are important for overall reading competence, but vocabulary assessment approaches that inform instructional decision-making and are sensitive to improvement are limited. This article describes a process for developing vocabulary measures designed to facilitate data-driven decision-making for kindergarten and first-grade students…

Descriptors: Vocabulary, Kindergarten, Grade 1, Elementary School Students

Reliability and Validity Evidence for the Hawaii Early Learning Profile, Birth-3 Years

Peer reviewed

Direct link

Li, Zijia; Gooden, Caroline; Toland, Michael D. – Journal of Early Intervention, 2019

This study provides preliminary evidence for reliability and validity of the Hawaii Early Learning Profile Strands 0-3 (HELP Strands 0-3), an assessment instrument for young children. First, the degree of interobserver agreement for a sample of representative HELP items was examined; results indicated that HELP scoring was dependable and…

Descriptors: Measures (Individuals), Psychometrics, Early Childhood Education, Test Reliability

CBM Language Measures as Indicators of Foreign-Language Learning: Technical Adequacy of Scores for Secondary-School Students

Peer reviewed

Direct link

Hoefnagel, Laura; Espin, Christine A.; Rippe, Ralph – International Journal for Research in Learning Disabilities, 2021

Students with and without learning disabilities often struggle to learn a foreign language (FL). Teachers could benefit from a measure designed to screen and identify students at risk for FL learning difficulties. In this study, we examined the reliability and validity of scores from four curriculum-based measures (CBM) as potential indicators of…

Descriptors: Curriculum Based Assessment, Language Tests, Second Language Learning, Screening Tests

Examining an Online Content General Outcome Measure: Technical Features of the Static Score

Peer reviewed

Direct link

Mooney, Paul; McCarter, Kevin S.; Russo, Robert J., Jr.; Blackwood, Danielle L. – Assessment for Effective Intervention, 2013

The purpose of this study was to evaluate technical adequacy features of an online adaptation of vocabulary matching known as critical content monitoring. Validity and reliability studies were conducted with a sample of 106 students from one school in fifth-grade science content. Participants were administered 20 parallel forms of the general…

Descriptors: Elementary School Students, Grade 5, Elementary School Science, Vocabulary

The Stability of Observational and Student Survey Measures of Teaching Effectiveness

Peer reviewed

Direct link

Polikoff, Morgan S. – American Journal of Education, 2015

Responding to federal policy and recent research, states and districts have developed and begun implementing multiple-measure teacher evaluation systems. These systems generally include observational and/or student survey measures of instructional quality alongside measures of teachers' contributions to student learning (e.g., value-added models…

Descriptors: Teacher Effectiveness, Student Surveys, Student Evaluation of Teacher Performance, Test Reliability

Mathematics Curriculum Based Measurement to Predict State Test Performance: A Comparison of Measures and Methods

Direct link

Stevens, Olinger; Leigh, Erika – ProQuest LLC, 2012

Scope and Method of Study: The purpose of the study is to use an empirical approach to identify a simple, economical, efficient, and technically adequate performance measure that teachers can use to assess student growth in mathematics. The current study has been designed to expand the body of research for math CBM to further examine technical…

Descriptors: Mathematics Instruction, Evaluation Methods, Student Evaluation, Measurement Techniques

Investigating the Value of Section Scores for the "TOEFL iBT"® Test. "TOEFL iBT"® Research Report. TOEFL iBT-21. ETS Research Report RR-13-35

Peer reviewed
PDF on ERIC

Download full text

Sawaki, Yasuyo; Sinharay, Sandip – ETS Research Report Series, 2013

This study investigates the value of reporting the reading, listening, speaking, and writing section scores for the "TOEFL iBT"® test, focusing on 4 related aspects of the psychometric quality of the TOEFL iBT section scores: reliability of the section scores, dimensionality of the test, presence of distinct score profiles, and the…

Descriptors: Scores, Computer Assisted Testing, Factor Analysis, Correlation

Test Review: ACCESS for ELLs[R]

Peer reviewed

Direct link

Fox, Janna; Fairbairn, Shelley – Language Testing, 2011

This article reviews Assessing Comprehension and Communication in English State-to-State for English Language Learners ("ACCESS for ELLs"[R]), which is a large-scale, high-stakes, standards-based, and criterion-referenced English language proficiency test administered in the USA annually to more than 840,000 English Language Learners (ELLs), in…

Descriptors: Test Preparation, Feedback (Response), Instructional Design, Testing Accommodations

Reliability and Validity Evidence for the GED[R] English as a Second Language Test. GED Testing Service[R] Research Studies, 2009-4

Download full text

Setzer, J. Carl – GED Testing Service, 2009

The GED[R] English as a Second Language (GED ESL) Test was designed to serve as an adjunct to the GED test battery when an examinee takes either the Spanish- or French-language version of the tests. The GED ESL Test is a criterion-referenced, multiple-choice instrument that assesses the functional, English reading skills of adults whose first…

Descriptors: Language Tests, High School Equivalency Programs, Psychometrics, Reading Skills

Assessing Reliability of Criterion-Referenced Instruments.

Robertson, Gary J. – 1981

Some fundamental concepts of criterion referenced test (CRT) reliability are highlighted. Emphasis is given to the procedures for determining reliability of scores for individual pupils because this is an area requiring increased awareness by classroom teachers and practitioners. Reliability issues encountered in the evaluation of instructional…

Descriptors: Criterion Referenced Tests, Reading Tests, Scores, Test Reliability

Estimating Reliability from a Single Administration of a Mastery Test.

Download full text

Subkoviak, Michael J. – 1976

A number of different definitions and indices of reliability for mastery tests have recently been proposed in an attempt to cope with possible lack of score variability that attenuates traditional coefficients. One promising index that has been suggested is the proportion of students in a group that are consistently assigned to the same mastery…

Descriptors: Criterion Referenced Tests, Mastery Tests, Mathematical Models, Scores

Criterion-Referencing Schemes.

Nitko, Anthony J. – New Directions for Testing and Measurement, 1980

Criterion-referencing is a way to enhance the interpretation of test scores by referencing them to well-defined behavior domains. Behavior domains may be ordered or unordered; several varieties of criterion-referenced tests within each of these types are discussed. (Author/RL)

Descriptors: Classification, Criterion Referenced Tests, Scaling, Scores

The Role of Reliability in Criterion-Referenced Tests.

Peer reviewed

Kane, Michael T. – Journal of Educational Measurement, 1986

These analyses suggest that if a criterion-referenced test had a reliability (defined in terms of internal consistency) below 0.5, a simple a priori procedure would provide better estimates of students' universe scores than would individual observed scores. (Author/LMO)

Descriptors: Criterion Referenced Tests, Educational Research, Error of Measurement, Generalizability Theory

The Issue of Item and Test Variance for Criterion-Referenced Tests: A Clarification

Peer reviewed

Millman, Jason; Popham, W. James – Journal of Educational Measurement, 1974

The use of the regression equation derived from the Anglo-American sample to predict grades of Mexican-American students resulted in overprediction. An examination of the standardized regression weights revealed a significant difference in the weight given to the Scholastic Aptitude Test Mathematics Score. (Author/BB)

Descriptors: Criterion Referenced Tests, Item Analysis, Predictive Validity, Scores

Monte Carlo Approach for Reliability Estimations in Generalizability Studies.

Download full text

Dimitrov, Dimiter M. – 1996

A Monte Carlo approach is proposed, using the Statistical Analysis System (SAS) programming language, for estimating reliability coefficients in generalizability theory studies. Test scores are generated by a probabilistic model that considers the probability for a person with a given ability score to answer an item with a given difficulty…

Descriptors: Classification, Criterion Referenced Tests, Cutting Scores, Estimation (Mathematics)

Previous Page | Next Page »

Pages: 1 | 2 | 3

Hambleton, Ronald K.	2
Millman, Jason	2
Roudabush, Glenn E.	2
Allen, Nancy L.	1
Bergquist, Constance	1
Blackwood, Danielle L.	1
Bormuth, John R.	1
Brennan, Robert L.	1
Chan, James Y.	1
Dimitrov, Dimiter M.	1
Espin, Christine A.	1
Fairbairn, Shelley	1
Fink, Arlene	1
Fox, Janna	1
Ghonsooly, Behzad	1
Gooden, Caroline	1
Graham, Darol L.	1
Hoefnagel, Laura	1
Hua, Te-Fang	1
Isham, Steven P.	1
Izard, J. F.	1
Kaminski, Ruth A.	1
Kane, Michael	1
Kane, Michael T.	1
More ▼