NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
No Child Left Behind Act 20011
What Works Clearinghouse Rating
Showing 1 to 15 of 51 results Save | Export
National Center on Improving Literacy, 2022
There are many available screeners for reading and other education or social-emotional outcomes. This brief outlines important things to consider when choosing and using a screener.
Descriptors: Screening Tests, Literacy, Social Emotional Learning, Decision Making
Ramsey Lee Cardwell – ProQuest LLC, 2022
The emergence of digital-first assessments is prompting reconsideration of, and innovation in, aspects of psychometrics, test validation, and test use. Using the Duolingo English Test (DET) as an example, this three-paper series seeks to address issues concerning the estimation of classification consistency and the reporting of results for such…
Descriptors: Classification, Reliability, Language Proficiency, Computer Assisted Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Kane, Michael T. – Journal of Educational Measurement, 2013
To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…
Descriptors: Test Interpretation, Validity, Scores, Test Use
Peer reviewed Peer reviewed
Direct linkDirect link
Bennett, Jessica G.; Gardner, Ralph, III; Rizzi, Gleides Lopes – American Annals of the Deaf, 2013
Strong correlations exist between signed and/or spoken English and the literacy skills of deaf and hard of hearing students. Assessments that are both valid and reliable are key for researchers and practitioners investigating the signed and/or spoken English skills of signing populations. The authors conducted a literature review to explore which…
Descriptors: Deafness, Hearing Impairments, Sign Language, Language Skills
Peer reviewed Peer reviewed
Direct linkDirect link
Douglas, Karen M.; Mislevy, Robert J. – Journal of Educational and Behavioral Statistics, 2010
Important decisions about students are made by combining multiple measures using complex decision rules. Although methods for characterizing the accuracy of decisions based on a single measure have been suggested by numerous researchers, such methods are not useful for estimating the accuracy of decisions based on multiple measures. This study…
Descriptors: Educational Development, Test Use, Classification, Computation
Herman, Joan L.; Osmundson, Ellen; Dietel, Ronald – Assessment and Accountability Comprehensive Center, 2010
This report describes the purposes of benchmark assessments and provides recommendations for selecting and using benchmark assessments--addressing validity, alignment, reliability, fairness and bias and accessibility, instructional sensitivity, utility, and reporting issues. We also present recommendations on building capacity to support schools'…
Descriptors: Multiple Choice Tests, Test Items, Benchmarking, Educational Assessment
Ostrander, Diane L.; Henry, Carolyn S. – 1993
A modification of the Family Adaptation Scale of Antonovsky and Sourani (1988), was developed for assessing the adaptation of ministers' families. A sample of 317 individuals (ministers, spouses, and children aged 8 to 18) from 135 protestant ministers' families was used to test the scale. The self-report questionnaire was tested for internal…
Descriptors: Adjustment (to Environment), Children, Clergy, Concurrent Validity
Peer reviewed Peer reviewed
Feldt, Leonard S. – Applied Measurement in Education, 1997
It has often been asserted that the reliability of a measure places an upper limit on its validity. This article demonstrates in theory that validity can rise when reliability declines, even when validity evidence is a correlation with an acceptable criterion. Whether empirical examples can actually be found is an open question. (SLD)
Descriptors: Correlation, Criteria, Reliability, Test Construction
Peer reviewed Peer reviewed
Fisher, Anne G.; Bryze, Kimberly; Atchison, Bradley T. – Journal of Outcome Measurement, 2000
Studied rater reliability, internal scale validity, and person response validity of the School Assessment of Motor and Process Skills (School AMPS) using results for 208 elementary school students, some with educationally related disabilities. Results support rater reliability, scale validity, and person response validity of the School AMPS as a…
Descriptors: Disabilities, Elementary Education, Elementary School Students, Reliability
Denham, Thomas J. – 2002
This paper describes the Myers-Briggs Type Indicator (MBTI), developed by I. Myers and K. Briggs (1940s) to assess personality type. Based on Jungian theory, the MBTI has become a tool for identifying the 16 different patterns of action into which every person fits. The 16 personality types are based on patterns of: (1) extraversion-introversion;…
Descriptors: Educational Testing, Personality Assessment, Personality Measures, Personality Traits
Peer reviewed Peer reviewed
Mills, Jeremy F.; Kroner, Daryl G.; Forth, Adelle E. – Assessment, 1998
The reliability and validity of the Novaco Anger Scale (NAS) (R. Novaco, 1994) were studied with 204 male correctional offenders admitted for general or violent offenses. Results show the NAS to be an effective measure of anger in an offender population. Results also support the validity of a computerized version of the NAS. (SLD)
Descriptors: Anger, Computer Assisted Testing, Males, Measurement Techniques
Peer reviewed Peer reviewed
Mehrens, William A. – Applied Measurement in Education, 2000
Presents conclusions of an independent measurement expert that the Texas Assessment of Academic Skills (TAAS) was constructed according to acceptable professional standards and tests curricular material considered by the Texas Board of Education important for graduates to have mastered. Also supports the validity and reliability of the TAAS and…
Descriptors: Curriculum, Psychometrics, Reliability, Standards
Peer reviewed Peer reviewed
Direct linkDirect link
Bachman, Lyle F. – Language Assessment Quarterly, 2005
The fields of language testing and educational and psychological measurement have not, as yet, developed a set of principles and procedures for linking test scores and score-based inferences to test use and the consequences of test use. Although Messick (1989) discusses test use and consequences, his framework provides virtually no guidance on how…
Descriptors: Test Use, Testing, Language Tests, Validity
American Educational Research Association, Washington, DC. – 1999
The standards outlined in this book have been developed to provide criteria for the evaluation of tests, testing practices, and the effects of test use. The "Standards" provides a frame of reference to ensure that relevant issues are addressed. The first part of the book, "Test Construction, Evaluation, and Documentation,"…
Descriptors: Educational Testing, Evaluation Methods, Psychological Testing, Reliability
Peer reviewed Peer reviewed
Kane, Michael – Applied Measurement in Education, 1996
This overview of the role of error and tolerance for error in measurement asserts that the generic precision associated with a measurement procedure is defined as the root mean square error, or standard error, in some relevant population. This view of precision is explored in several applications of measurement. (SLD)
Descriptors: Error of Measurement, Error Patterns, Generalizability Theory, Measurement Techniques
Previous Page | Next Page ยป
Pages: 1  |  2  |  3  |  4