ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	8

Descriptor

Reliability	68
Test Use	68
Validity	43
Elementary Secondary Education	19
Evaluation Methods	17
Test Construction	17
Educational Assessment	14
Scores	14
Student Evaluation	12
Psychometrics	10
Measurement Techniques	9
Higher Education	8
Performance Based Assessment	8
Test Interpretation	8
Foreign Countries	7
Testing Programs	7
Comparative Analysis	6
Educational Testing	6
Personality Measures	6
Test Format	6
Classification	5
College Students	5
Construct Validity	5
Educational Research	5
High Stakes Tests	5
More ▼

Publication Type

Journal Articles	35
Reports - Research	28
Reports - Evaluative	16
Speeches/Meeting Papers	14
Reports - Descriptive	8
Books	6
Guides - Non-Classroom	6
Information Analyses	5
Opinion Papers	3
Collected Works - Proceedings	1
Collected Works - Serials	1
Dissertations/Theses -…	1
Guides - Classroom - Teacher	1
Guides - General	1
Legal/Legislative/Regulatory…	1
Numerical/Quantitative Data	1
Reports - General	1
More ▼

Education Level

Elementary Secondary Education	2
High Schools	1
Higher Education	1
Postsecondary Education	1

Audience

Practitioners	5
Teachers	4
Administrators	2
Students	1

Location

Netherlands	2
Australia	1
Louisiana	1
New York	1
United Kingdom	1
United Kingdom (Northern…	1
United States	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Myers Briggs Type Indicator	2
Clinical Evaluation of…	1
Expressive One Word Picture…	1
Kaufman Assessment Battery…	1
Learning Style Inventory	1
Learning and Study Strategies…	1
Millon Clinical Multiaxial…	1
Minnesota Multiphasic…	1
Peabody Picture Vocabulary…	1
Raven Progressive Matrices	1
Self Directed Search	1
Texas Assessment of Academic…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 68 results Save | Export

Core Considerations for Selecting a Screener. Improving Literacy Brief

Direct link

National Center on Improving Literacy, 2022

There are many available screeners for reading and other education or social-emotional outcomes. This brief outlines important things to consider when choosing and using a screener.

Descriptors: Screening Tests, Literacy, Social Emotional Learning, Decision Making

Classification Consistency and Results Reporting of a Digital-First Computer-Adaptive Language Proficiency Test

Direct link

Ramsey Lee Cardwell – ProQuest LLC, 2022

The emergence of digital-first assessments is prompting reconsideration of, and innovation in, aspects of psychometrics, test validation, and test use. Using the Duolingo English Test (DET) as an example, this three-paper series seeks to address issues concerning the estimation of classification consistency and the reporting of results for such…

Descriptors: Classification, Reliability, Language Proficiency, Computer Assisted Testing

Validating the Interpretations and Uses of Test Scores

Peer reviewed

Direct link

Kane, Michael T. – Journal of Educational Measurement, 2013

To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…

Descriptors: Test Interpretation, Validity, Scores, Test Use

Deaf and Hard of Hearing Students' Through-the-Air English Skills: A Review of Formal Assessments

Peer reviewed

Direct link

Bennett, Jessica G.; Gardner, Ralph, III; Rizzi, Gleides Lopes – American Annals of the Deaf, 2013

Strong correlations exist between signed and/or spoken English and the literacy skills of deaf and hard of hearing students. Assessments that are both valid and reliable are key for researchers and practitioners investigating the signed and/or spoken English skills of signing populations. The authors conducted a literature review to explore which…

Descriptors: Deafness, Hearing Impairments, Sign Language, Language Skills

Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

Peer reviewed

Direct link

Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011

This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…

Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores

Estimating Classification Accuracy for Complex Decision Rules Based on Multiple Scores

Peer reviewed

Direct link

Douglas, Karen M.; Mislevy, Robert J. – Journal of Educational and Behavioral Statistics, 2010

Important decisions about students are made by combining multiple measures using complex decision rules. Although methods for characterizing the accuracy of decisions based on a single measure have been suggested by numerous researchers, such methods are not useful for estimating the accuracy of decisions based on multiple measures. This study…

Descriptors: Educational Development, Test Use, Classification, Computation

Defining and Measuring Dysphagia Following Stroke

Peer reviewed

Direct link

Daniels, Stephanie K.; Schroeder, Mae Fern; DeGeorge, Pamela C.; Corey, David M.; Foundas, Anne L.; Rosenbek, John C. – American Journal of Speech-Language Pathology, 2009

Purpose: To continue the development of a quantified, standard method to differentiate individuals with stroke and dysphagia from individuals without dysphagia. Method: Videofluoroscopic swallowing studies (VFSS) were completed on a group of participants with acute stroke (n = 42) and healthy age-matched individuals (n = 25). Calibrated liquid…

Descriptors: Control Groups, Test Use, Neurological Impairments, Evaluation Methods

Benchmark Assessment for Improved Learning. AACC Report

Download full text

Herman, Joan L.; Osmundson, Ellen; Dietel, Ronald – Assessment and Accountability Comprehensive Center, 2010

This report describes the purposes of benchmark assessments and provides recommendations for selecting and using benchmark assessments--addressing validity, alignment, reliability, fairness and bias and accessibility, instructional sensitivity, utility, and reporting issues. We also present recommendations on building capacity to support schools'…

Descriptors: Multiple Choice Tests, Test Items, Benchmarking, Educational Assessment

Assessment of Self-Reported Anger Expression in Youth.

Peer reviewed

Musante, Linda; Treiber, Frank A.; Davis, Harry C.; Thompson, William O.; Waller, Jennifer L. – Assessment, 1999

Findings related to internal consistency, temporal stability, and principal components structures suggest that the Anger Expression Scale (C. Spielberger and others, 1985) and the Pediatric Anger Expression Scale (G. Jacobs and others, 1989), studied with a sample of 415 youth with a mean age of 14.7 years are acceptably reliable. (SLD)

Descriptors: Adolescents, Anger, Factor Structure, Reliability

The Problem of Negative Reliabilities.

Peer reviewed

Krus, David J.; Helmstadter, Gerald C. – Educational and Psychological Measurement, 1993

Negative coefficients of reliability, sometimes returned by the standard formula for estimation of the internal-consistency reliability, are neither theoretically nor numerically correct. Alternative strategies for test development in this special case are suggested. (Author)

Descriptors: Estimation (Mathematics), Reliability, Test Construction, Test Use

Can Validity Rise When Reliability Declines?

Peer reviewed

Feldt, Leonard S. – Applied Measurement in Education, 1997

It has often been asserted that the reliability of a measure places an upper limit on its validity. This article demonstrates in theory that validity can rise when reliability declines, even when validity evidence is a correlation with an acceptable criterion. Whether empirical examples can actually be found is an open question. (SLD)

Descriptors: Correlation, Criteria, Reliability, Test Construction

Measures of Consistency for Holland-Type Codes.

Peer reviewed

Strahan, Robert F. – Journal of Vocational Behavior, 1987

Describes two new measures of consistency which refer to the extent to which more closely related scale types are found together in Holland's Self-Directed Search sort. One measure is based on the hexagonal model for use with three-point codes. The other is based on conditional probabilities for use with two-point codes. (Author/ABL)

Descriptors: Data Analysis, Data Interpretation, Personality Measures, Reliability

A Critical Review of the Literature on Kolb's Learning Style Inventory with Implications for Score Reliability.

Download full text

Hwang, Dae-Yeop; Henson, Robin K. – 2002

The Learning Style Inventory (LSI; Kolb, 1976; 1985 ) is a commonly used measure of learning styles based on Kolbs Experiential Learning Model. The psychometric soundness of LSI scores has been critiqued historically. This study reviewed the literature on the LSI and evaluated the psychometric properties of Kolbs original and revised versions of…

Descriptors: Cognitive Style, Meta Analysis, Psychometrics, Reliability

Increasing the Reliability of Ability-Achievement Difference Scores: An Example Using the Kaufman Assessment Battery for Children.

Peer reviewed

Caruso, John C.; Witkiewitz, Katie – Journal of Educational Measurement, 2002

As an alternative to equally weighted difference scores, examined an orthogonal reliable component analysis (RCA) solution and an oblique principal components analysis (PCA) solution for the standardization sample of the Kaufman Assessment Battery for Children (KABC; A. Kaufman and N. Kaufman, 1983). Discusses the practical implications of the…

Descriptors: Ability, Academic Achievement, Children, Factor Analysis

Naturalistic Assessment of Functional Performance in School Settings: Reliability and Validity of the School AMPS Scales.

Peer reviewed

Fisher, Anne G.; Bryze, Kimberly; Atchison, Bradley T. – Journal of Outcome Measurement, 2000

Studied rater reliability, internal scale validity, and person response validity of the School Assessment of Motor and Process Skills (School AMPS) using results for 208 elementary school students, some with educationally related disabilities. Results support rater reliability, scale validity, and person response validity of the School AMPS as a…

Descriptors: Disabilities, Elementary Education, Elementary School Students, Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Applied Measurement in…	4
Educational and Psychological…	3
Journal of Outcome Measurement	3
Studies in Educational…	3
Assessment	2
Educational Measurement:…	2
Journal of Educational…	2
American Annals of the Deaf	1
American Journal of Education	1
American Journal of…	1
Assessing Writing	1
Assessment and Accountability…	1
Australian Journal of…	1
Cognitive Psychology	1
Educational Assessment	1
Educational Researcher	1
Evaluation Comment	1
Evaluation and the Health…	1
Journal of Educational and…	1
Journal of Experimental…	1
Journal of Research in Reading	1
Journal of Technology,…	1
Journal of Vocational Behavior	1
Language Assessment Quarterly	1
National Center on Improving…	1
More ▼

Thompson, Bruce	4
Fisher, Anne G.	2
Mott, Michael S.	2
Archer, Robert P.	1
Arnau, Randolph C.	1
Atchison, Bradley T.	1
Bachman, Lyle F.	1
Bennett, Jessica G.	1
Bracey, Gerald W.	1
Brennan, Robert L.	1
Bryze, Kimberly	1
Buras, Avery R.	1
Caruso, John C.	1
Cecil, Heather	1
Chase, Clinton I.	1
Corey, David M.	1
Cowan, Pamela	1
Crone, Linda J.	1
Daniels, Stephanie K.	1
Davis, Harry C.	1
DeGeorge, Pamela C.	1
Denham, Thomas J.	1
Dietel, Ronald	1
Dietz, Thomas	1
More ▼