Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 14 |
Descriptor
Test Construction | 135 |
Test Interpretation | 135 |
Testing | 135 |
Test Validity | 53 |
Test Reliability | 48 |
Scoring | 29 |
Language Tests | 21 |
Achievement Tests | 19 |
Test Bias | 17 |
Elementary Secondary Education | 16 |
Item Analysis | 16 |
More ▼ |
Source
Author
Publication Type
Education Level
Elementary Secondary Education | 3 |
Elementary Education | 2 |
Secondary Education | 1 |
Audience
Practitioners | 12 |
Teachers | 9 |
Researchers | 3 |
Administrators | 2 |
Students | 2 |
Location
California | 3 |
Canada | 2 |
Australia | 1 |
Austria | 1 |
California (Stanford) | 1 |
China | 1 |
Colorado (Denver) | 1 |
Illinois | 1 |
Illinois (Chicago) | 1 |
Italy | 1 |
Switzerland | 1 |
More ▼ |
Laws, Policies, & Programs
Education for All Handicapped… | 1 |
Elementary and Secondary… | 1 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
Woodcock Johnson Tests of… | 2 |
California Achievement Tests | 1 |
General Aptitude Test Battery | 1 |
Graduate Management Admission… | 1 |
Graduate Record Examinations | 1 |
Stanford Achievement Tests | 1 |
Test of Economic Literacy | 1 |
What Works Clearinghouse Rating
Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022
While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…
Descriptors: Scoring, Testing, Test Items, Test Format
Patrick Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Report Series, 2024
Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international large-scale assessments of cognitive and…
Descriptors: Assessment Literacy, Testing, Test Bias, Test Construction
Boyer, Michelle; Dadey, Nathan; Keng, Leslie – National Center for the Improvement of Educational Assessment, 2020
This school year, every state education agency (SEA) is faced with unprecedented, COVID-19-related challenges for the implementation of 2021 statewide summative assessments. Two overarching challenges are in how tests will be administered, and how scores will be interpreted and used, with many intervening and related challenges. Test…
Descriptors: State Departments of Education, Summative Evaluation, Educational Planning, Educational Strategies
Cormier, Damien C.; Bulut, Okan; McGrew, Kevin S.; Kennedy, Kathleen – Journal of Intelligence, 2022
Consideration of the influence of English language skills during testing is an understandable requirement for fair and valid cognitive test interpretation. Several professional standards and expert recommendations exist to guide psychologists as they attempt to engage in best practices when assessing English learners (ELs). Nonetheless, relatively…
Descriptors: Language Tests, English (Second Language), Second Language Learning, Culture Fair Tests
Reynolds, Matthew R.; Niileksela, Christopher R. – Journal of Psychoeducational Assessment, 2015
"The Woodcock-Johnson IV Tests of Cognitive Abilities" (WJ IV COG) is an individually administered measure of psychometric intellectual abilities designed for ages 2 to 90+. The measure was published by Houghton Mifflin Harcourt-Riverside in 2014. Frederick Shrank, Kevin McGrew, and Nancy Mather are the authors. Richard Woodcock, the…
Descriptors: Cognitive Tests, Testing, Scoring, Test Interpretation
Fraccaro, Rebecca L.; Stelnicki, Andrea M.; Nordstokke, David W. – Canadian Journal of School Psychology, 2015
Anxiety disorders are among the most prevalent mental disorders among school-age children and can lead to impaired academic and social functioning (Keeley & Storch, 2009). Unfortunately, anxiety disorders in this population are often undetected (Herzig-Anderson, Colognori, Fox, Stewart, & Warner, 2012). The availability of psychometrically…
Descriptors: Anxiety, Measures (Individuals), Symptoms (Individual Disorders), Testing
Kopriva, Rebecca J.; Thurlow, Martha L.; Perie, Marianne; Lazarus, Sheryl S.; Clark, Amy – Educational Psychologist, 2016
This article argues that test takers are as integral to determining validity of test scores as defining target content and conditioning inferences on test use. A principled sustained attention to how students interact with assessment opportunities is essential, as is a principled sustained evaluation of evidence confirming the validity or calling…
Descriptors: Tests, Testing, Test Interpretation, Scores
Kranzler, John H.; Benson, Nicholas; Floyd, Randy G. – International Journal of School & Educational Psychology, 2016
This article briefly reviews the history of intellectual assessment of children and youth in the United States of America, as well as current practices and future directions. Although administration of intelligence tests in the schools has been a longstanding practice in the United States, their use has also elicited sharp controversy over time.…
Descriptors: Intelligence Tests, Children, Youth, Test Construction
Henig, Jeffrey R. – Teachers College Record, 2013
Background/Context: Validity issues are often discussed in technical terms, but the context changes when measures enter broad public debate, and a wider range of interests come into play. Purpose: This article, part of a special section of TCR, considers the political dimensions of validity questions as raised by a keynote address and panel…
Descriptors: Testing, Politics of Education, Test Validity, Expertise
Pommerich, Mary – Educational Measurement: Issues and Practice, 2012
Neil Dorans has made a career of advocating for the examinee. He continues to do so in his NCME career award address, providing a thought-provoking commentary on some current trends in educational measurement that could potentially affect the integrity of test scores. Concerns expressed in the address call attention to a conundrum that faces…
Descriptors: Testing, Scores, Measurement, Test Construction
Johnson, Sandra – Routledge, Taylor & Francis Group, 2011
"Assessing Learning in the Primary Classroom" is an accessible introduction to the concepts critical to a professional understanding of this vital aspect of a teacher's role. It comprehensively considers the principles underpinning effective assessment, the different forms it can take and the different purposes it serves, both within and beyond…
Descriptors: Student Evaluation, Elementary Education, Educational Assessment, Validity
Bailey, Jennifer; Little, Chelsea; Rigney, Rex; Thaler, Anna; Weiderman, Ken; Yorkovich, Ben – Online Submission, 2010
This handbook is designed as a quick reference for first-year teachers who find themselves in an assessment driven environment with little experience to help make sense of the language, underlying philosophy, or organizational structure of the assessment system. The handbook begins with advice on developing and evaluating effective learning…
Descriptors: Student Evaluation, Portfolio Assessment, Elementary Secondary Education, Performance Based Assessment
Johnston, Thomas J. – Educational Technology, 1974
Article focuses attention on some of the more important factors to be considered in evaluating educational products and programs within the domain-referenced framework. (Author)
Descriptors: Curriculum Evaluation, Evaluation Criteria, Program Evaluation, Test Construction
Wright, Patricia – Programmed Learning and Educational Technology, 1975
An investigation which compared five ways of expressing information about the choice of questions to be made from different sections of a test paper. (Author)
Descriptors: Educational Research, Forced Choice Technique, Test Construction, Test Interpretation

Piotrowski, Richard J. – Psychology in the Schools, 1976
Changes in the full scale reliability of the WISC-R were computed at three age levels when each subtest was omitted by itself. The same procedure was followed with those subtests which independently had the smallest effect in lowering full scale reliability. Cautions were noted concerning the exclusion of subtests. (Author)
Descriptors: Intelligence Tests, Statistical Studies, Test Construction, Test Interpretation