NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 1 to 15 of 73 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Arce, Alvaro J.; Young, Michael J. – International Journal of Testing, 2022
The paper argues that contemporary test validity theory places the consequences of testing on the lives of all college applicants at the back of the test validation argument. It introduces the notion of test efficacy as a process to gather evidence on claims on consequences of testing on all college applicants that can be traced back to validity.…
Descriptors: Test Validity, Test Theory, College Applicants, College Entrance Examinations
Peer reviewed Peer reviewed
Direct linkDirect link
Ing, Marsha; Chinen, Starlie; Jackson, Kara; Smith, Thomas M. – Educational Measurement: Issues and Practice, 2021
Despite the ease of accessing a wide range of measures, little attention is given to validity arguments when considering whether to use the measure for a new purpose or in a different context. Making a validity argument has historically focused on the intended interpretation and use. There has been a press to consider both the intended and actual…
Descriptors: Instructional Improvement, Measures (Individuals), Test Validity, Test Interpretation
Peer reviewed Peer reviewed
Direct linkDirect link
Andrew P. Jaciw – American Journal of Evaluation, 2025
By design, randomized experiments (XPs) rule out bias from confounded selection of participants into conditions. Quasi-experiments (QEs) are often considered second-best because they do not share this benefit. However, when results from XPs are used to generalize causal impacts, the benefit from unconfounded selection into conditions may be offset…
Descriptors: Elementary School Students, Elementary School Teachers, Generalization, Test Bias
Peer reviewed Peer reviewed
Direct linkDirect link
Clark, Amy K.; Karvonen, Meagan – Educational Assessment, 2020
Alternate assessments based on alternate achievement standards (AA-AAS) have historically lacked broad validity evidence and an overall evaluation of the extent to which evidence supports intended uses of results. An expanding body of validation literature, the funding of two AA-AAS consortia, and advances in computer-based assessment have…
Descriptors: Alternative Assessment, Test Validity, Test Use, Students with Disabilities
Dadey, Nathan; Keng, Leslie; Boyer, Michelle; Marion, Scott – National Center for the Improvement of Educational Assessment, 2021
State summative educational assessment is about to begin in earnest. Rightfully, many are raising questions about the quality, meaning, and appropriate use of the assessment results. This document was written to support state educational agencies (SEAs) and their assessment providers in devising effective and efficient analysis plans. This…
Descriptors: Educational Assessment, Summative Evaluation, Student Evaluation, Test Use
Im, Gwan-Hyeok; Shin, Dongil; Park, Soohyeon – Current Issues in Language Planning, 2022
This study suggests a conceptual framework for policy-driven test development and validation, using the Test of Proficiency in Korean (TOPIK) as an example context. By linking the literature on policy analysis and argument structure in the validation of testing, the strong relationships between policy and testing are illustrated. This rationalizes…
Descriptors: Language Proficiency, Language Tests, Korean, Test Construction
Papageorgiou, Spiros; Davis, Larry; Norris, John M.; Garcia Gomez, Pablo; Manna, Venessa F.; Monfils, Lora – Educational Testing Service, 2021
The "TOEFL® Essentials"™ test is a new English language proficiency test in the "TOEFL"® family of assessments. It measures foundational language skills and communication abilities in academic and general (daily life) contexts. The test covers the four language skills of reading, listening, writing, and speaking and is intended…
Descriptors: Language Tests, English (Second Language), Second Language Learning, Language Proficiency
Peer reviewed Peer reviewed
Direct linkDirect link
Haertel, Edward H. – Educational Psychologist, 2018
In the service of educational accountability, student achievement tests are being used to measure constructs quite unlike those envisioned by test developers. Scores are compared to cut points to create classifications like "proficient"; scores are combined over time to measure growth; student scores are aggregated to measure the…
Descriptors: Achievement Tests, Scores, Test Validity, Test Interpretation
College Board, 2023
Over the past several years, content experts, psychometricians, and researchers have been hard at work developing, refining, and studying the digital SAT. The work is grounded in foundational best practices and advances in measurement and assessment design, with fairness for students informing all of the work done. This paper shares learnings from…
Descriptors: College Entrance Examinations, Psychometrics, Computer Assisted Testing, Best Practices
Sriram, Rishi – NASPA - Student Affairs Administrators in Higher Education, 2014
When student affairs professionals assess their work, they often employ some type of survey. The use of surveys stems from a desire to objectively measure outcomes, a demand from someone else (e.g., supervisor, accreditation committee) for data, or the feeling that numbers can provide an aura of competence. Although surveys are effective tools for…
Descriptors: Surveys, Test Construction, Student Personnel Services, Test Use
New York State Education Department, 2018
This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 English Language Arts (ELA) and Mathematics 2018 Operational Tests. This report includes information about test content and test development, item (i.e., individual…
Descriptors: English, Language Arts, Language Tests, Mathematics Tests
New York State Education Department, 2017
This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 English Language Arts (ELA) and Mathematics 2017 Operational Tests. This report includes information about test content and test development, item (i.e., individual…
Descriptors: English, Language Arts, Language Tests, Mathematics Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Nichols, Paul D.; Williams, Natasha – Educational Measurement: Issues and Practice, 2009
This article has three goals. The first goal is to clarify the role that the consequences of test score use play in validity judgments by reviewing the role that modern writers on validity have ascribed for consequences in supporting validity judgments. The second goal is to summarize current views on who is responsible for collecting evidence of…
Descriptors: Tests, Test Validity, Scores, Data Collection
Peer reviewed Peer reviewed
Direct linkDirect link
Falk, Beverly; Ort, Suzanne Wichterle; Moirs, Katie – Educational Assessment, 2007
This article describes the findings of studies conducted on a large-scale, classroom-based performance assessment of literacy for the early grades designed to provide information that is useful for reporting, as well as teaching. Technical studies found the assessment to be a promising instrument that is reliable and valid. Follow-up studies of…
Descriptors: Program Effectiveness, Performance Based Assessment, Student Evaluation, Evaluation Research
Peer reviewed Peer reviewed
Krieshok, Thomas S.; Harrington, Robert G. – Journal of Counseling & Development, 1985
Reviews the administrative features, uses, development, standardization, reliability, and validity of the Multidimensional Aptitude Battery (MAB), a new group intelligence test designed to be a paper-and-pencil parallel to the Wechsler Adult Intelligence Scale-Revised (WAIS-R). (BL)
Descriptors: Group Testing, Intelligence Tests, Test Construction, Test Reliability
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5