ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	6

Descriptor

Reliability	51
Test Use	51
Validity	43
Elementary Secondary Education	17
Evaluation Methods	15
Test Construction	15
Educational Assessment	12
Student Evaluation	11
Measurement Techniques	8
Performance Based Assessment	8
Psychometrics	7
Scores	7
Testing Programs	7
Higher Education	6
Test Interpretation	6
Comparative Analysis	5
Construct Validity	5
Educational Testing	5
Foreign Countries	5
Personality Measures	5
Personality Traits	5
College Students	4
Decision Making	4
Diagnostic Tests	4
High Stakes Tests	4
More ▼

Publication Type

Journal Articles	23
Reports - Research	20
Reports - Evaluative	12
Speeches/Meeting Papers	10
Books	6
Guides - Non-Classroom	6
Reports - Descriptive	6
Opinion Papers	3
Information Analyses	2
Collected Works - Proceedings	1
Collected Works - Serials	1
Dissertations/Theses -…	1
Guides - Classroom - Teacher	1
Guides - General	1
Legal/Legislative/Regulatory…	1
More ▼

Education Level

Elementary Secondary Education	1
High Schools	1

Audience

Practitioners	5
Teachers	4
Administrators	2
Students	1

Location

Australia	1
Netherlands	1
New York	1
United Kingdom	1
United Kingdom (Northern…	1
United States	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Myers Briggs Type Indicator	2
Clinical Evaluation of…	1
Expressive One Word Picture…	1
Millon Clinical Multiaxial…	1
Minnesota Multiphasic…	1
Peabody Picture Vocabulary…	1
Texas Assessment of Academic…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 51 results Save | Export

Core Considerations for Selecting a Screener. Improving Literacy Brief

Direct link

National Center on Improving Literacy, 2022

There are many available screeners for reading and other education or social-emotional outcomes. This brief outlines important things to consider when choosing and using a screener.

Descriptors: Screening Tests, Literacy, Social Emotional Learning, Decision Making

Classification Consistency and Results Reporting of a Digital-First Computer-Adaptive Language Proficiency Test

Direct link

Ramsey Lee Cardwell – ProQuest LLC, 2022

The emergence of digital-first assessments is prompting reconsideration of, and innovation in, aspects of psychometrics, test validation, and test use. Using the Duolingo English Test (DET) as an example, this three-paper series seeks to address issues concerning the estimation of classification consistency and the reporting of results for such…

Descriptors: Classification, Reliability, Language Proficiency, Computer Assisted Testing

Validating the Interpretations and Uses of Test Scores

Peer reviewed

Direct link

Kane, Michael T. – Journal of Educational Measurement, 2013

To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…

Descriptors: Test Interpretation, Validity, Scores, Test Use

Deaf and Hard of Hearing Students' Through-the-Air English Skills: A Review of Formal Assessments

Peer reviewed

Direct link

Bennett, Jessica G.; Gardner, Ralph, III; Rizzi, Gleides Lopes – American Annals of the Deaf, 2013

Strong correlations exist between signed and/or spoken English and the literacy skills of deaf and hard of hearing students. Assessments that are both valid and reliable are key for researchers and practitioners investigating the signed and/or spoken English skills of signing populations. The authors conducted a literature review to explore which…

Descriptors: Deafness, Hearing Impairments, Sign Language, Language Skills

Estimating Classification Accuracy for Complex Decision Rules Based on Multiple Scores

Peer reviewed

Direct link

Douglas, Karen M.; Mislevy, Robert J. – Journal of Educational and Behavioral Statistics, 2010

Important decisions about students are made by combining multiple measures using complex decision rules. Although methods for characterizing the accuracy of decisions based on a single measure have been suggested by numerous researchers, such methods are not useful for estimating the accuracy of decisions based on multiple measures. This study…

Descriptors: Educational Development, Test Use, Classification, Computation

Benchmark Assessment for Improved Learning. AACC Report

Download full text

Herman, Joan L.; Osmundson, Ellen; Dietel, Ronald – Assessment and Accountability Comprehensive Center, 2010

This report describes the purposes of benchmark assessments and provides recommendations for selecting and using benchmark assessments--addressing validity, alignment, reliability, fairness and bias and accessibility, instructional sensitivity, utility, and reporting issues. We also present recommendations on building capacity to support schools'…

Descriptors: Multiple Choice Tests, Test Items, Benchmarking, Educational Assessment

Measuring Adaptation in Ministers' Families: The Modified Family Adaptation Scale.

Download full text

Ostrander, Diane L.; Henry, Carolyn S. – 1993

A modification of the Family Adaptation Scale of Antonovsky and Sourani (1988), was developed for assessing the adaptation of ministers' families. A sample of 317 individuals (ministers, spouses, and children aged 8 to 18) from 135 protestant ministers' families was used to test the scale. The self-report questionnaire was tested for internal…

Descriptors: Adjustment (to Environment), Children, Clergy, Concurrent Validity

Can Validity Rise When Reliability Declines?

Peer reviewed

Feldt, Leonard S. – Applied Measurement in Education, 1997

It has often been asserted that the reliability of a measure places an upper limit on its validity. This article demonstrates in theory that validity can rise when reliability declines, even when validity evidence is a correlation with an acceptable criterion. Whether empirical examples can actually be found is an open question. (SLD)

Descriptors: Correlation, Criteria, Reliability, Test Construction

Naturalistic Assessment of Functional Performance in School Settings: Reliability and Validity of the School AMPS Scales.

Peer reviewed

Fisher, Anne G.; Bryze, Kimberly; Atchison, Bradley T. – Journal of Outcome Measurement, 2000

Studied rater reliability, internal scale validity, and person response validity of the School Assessment of Motor and Process Skills (School AMPS) using results for 208 elementary school students, some with educationally related disabilities. Results support rater reliability, scale validity, and person response validity of the School AMPS as a…

Descriptors: Disabilities, Elementary Education, Elementary School Students, Reliability

A Technical Review of the Myers-Briggs Type Indicator(tm).

Download full text

Denham, Thomas J. – 2002

This paper describes the Myers-Briggs Type Indicator (MBTI), developed by I. Myers and K. Briggs (1940s) to assess personality type. Based on Jungian theory, the MBTI has become a tool for identifying the 16 different patterns of action into which every person fits. The 16 personality types are based on patterns of: (1) extraversion-introversion;…

Descriptors: Educational Testing, Personality Assessment, Personality Measures, Personality Traits

Novaco Anger Scale: Reliability and Validity within Adult Criminal Sample.

Peer reviewed

Mills, Jeremy F.; Kroner, Daryl G.; Forth, Adelle E. – Assessment, 1998

The reliability and validity of the Novaco Anger Scale (NAS) (R. Novaco, 1994) were studied with 204 male correctional offenders admitted for general or violent offenses. Results show the NAS to be an effective measure of anger in an offender population. Results also support the validity of a computerized version of the NAS. (SLD)

Descriptors: Anger, Computer Assisted Testing, Males, Measurement Techniques

Defending a State Graduation Test: "GI Forum v. Texas Education Agency." Measurement Perspectives from an External Evaluator.

Peer reviewed

Mehrens, William A. – Applied Measurement in Education, 2000

Presents conclusions of an independent measurement expert that the Texas Assessment of Academic Skills (TAAS) was constructed according to acceptable professional standards and tests curricular material considered by the Texas Board of Education important for graduates to have mastered. Also supports the validity and reliability of the TAAS and…

Descriptors: Curriculum, Psychometrics, Reliability, Standards

Building and Supporting a Case for Test Use

Peer reviewed

Direct link

Bachman, Lyle F. – Language Assessment Quarterly, 2005

The fields of language testing and educational and psychological measurement have not, as yet, developed a set of principles and procedures for linking test scores and score-based inferences to test use and the consequences of test use. Although Messick (1989) discusses test use and consequences, his framework provides virtually no guidance on how…

Descriptors: Test Use, Testing, Language Tests, Validity

Standards for Educational and Psychological Testing.

American Educational Research Association, Washington, DC. – 1999

The standards outlined in this book have been developed to provide criteria for the evaluation of tests, testing practices, and the effects of test use. The "Standards" provides a frame of reference to ensure that relevant issues are addressed. The first part of the book, "Test Construction, Evaluation, and Documentation,"…

Descriptors: Educational Testing, Evaluation Methods, Psychological Testing, Reliability

The Precision of Measurements.

Peer reviewed

Kane, Michael – Applied Measurement in Education, 1996

This overview of the role of error and tolerance for error in measurement asserts that the generic precision associated with a measurement procedure is defined as the root mean square error, or standard error, in some relevant population. This view of precision is explored in several applications of measurement. (SLD)

Descriptors: Error of Measurement, Error Patterns, Generalizability Theory, Measurement Techniques

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Applied Measurement in…	3
Journal of Outcome Measurement	3
Studies in Educational…	2
American Annals of the Deaf	1
American Journal of Education	1
Assessing Writing	1
Assessment	1
Assessment and Accountability…	1
Australian Journal of…	1
Educational Assessment	1
Educational Measurement:…	1
Educational Researcher	1
Educational and Psychological…	1
Evaluation Comment	1
Evaluation and the Health…	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Research in Reading	1
Language Assessment Quarterly	1
National Center on Improving…	1
ProQuest LLC	1
Psychological Assessment	1
More ▼

Fisher, Anne G.	2
Mott, Michael S.	2
Thompson, Bruce	2
Archer, Robert P.	1
Atchison, Bradley T.	1
Bachman, Lyle F.	1
Bennett, Jessica G.	1
Bracey, Gerald W.	1
Brennan, Robert L.	1
Bryze, Kimberly	1
Buras, Avery R.	1
Cecil, Heather	1
Chase, Clinton I.	1
Cowan, Pamela	1
Denham, Thomas J.	1
Dietel, Ronald	1
Douglas, Karen M.	1
Ediger, Marlow	1
El-Hassan, Karma	1
Espelage, Dorothy L.	1
Feldt, Leonard S.	1
Fisher, William P., Jr.	1
Forth, Adelle E.	1
Froberg, Debra G.	1
More ▼