Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 23 |
Descriptor
Source
Author
Publication Type
Education Level
Elementary Secondary Education | 7 |
Higher Education | 5 |
Postsecondary Education | 5 |
Grade 3 | 2 |
Early Childhood Education | 1 |
Elementary Education | 1 |
Grade 2 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Preschool Education | 1 |
Location
United Kingdom | 4 |
Australia | 2 |
California | 2 |
Florida | 2 |
Singapore | 2 |
United States | 2 |
Alabama | 1 |
Canada | 1 |
Ethiopia | 1 |
France | 1 |
Hungary | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Arffman, Inga – Educational Measurement: Issues and Practice, 2013
The article reviews research and findings on problems and issues faced when translating international academic achievement tests. The purpose is to draw attention to the problems, to help to develop the procedures followed when translating the tests, and to provide suggestions for further research. The problems concentrate on the following: the…
Descriptors: Achievement Tests, Translation, Testing Problems, Test Construction
Sabatini, John; O'Reilly, Tenaha; Deane, Paul – ETS Research Report Series, 2013
This report describes the foundation and rationale for a framework designed to measure reading literacy. The aim of the effort is to build an assessment system that reflects current theoretical conceptions of reading and is developmentally sensitive across a prekindergarten to 12th grade student range. The assessment framework is intended to…
Descriptors: Reading Tests, Literacy, Models, Testing Programs
Tarar, Jessica M.; Meisinger, Elizabeth B.; Dickens, Rachel H. – Canadian Journal of School Psychology, 2015
The TOWRE-2 was developed to provide an efficient measure of two essential wordlevel reading skills, sight word reading and phonetic decoding skills. The Sight Word Efficiency (SWE) subtest assesses the number of real words that an individual can read from a vertical list within 45 s. This subtest is designed to measure the size of an individual's…
Descriptors: Word Study Skills, Sight Method, Phonetics, Decoding (Reading)
Stohlman, Trey – Journal of the Scholarship of Teaching and Learning, 2015
A good assessment plan combines many direct and indirect measures to validate the collected data. One often controversial assessment measure comes in the form of retention exams. Although assessment retention exams may come with faults, others advocate for their inclusion in program assessment. Objective-based tests may offer insight to…
Descriptors: Alternative Assessment, Retention (Psychology), Program Evaluation, Program Effectiveness
Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2013
The purpose of this study was to evaluate the combined effects of reduced equating sample size and shortened anchor test length on item response theory (IRT)-based linking and equating results. Data from two independent operational forms of a large-scale testing program were used to establish the baseline results for evaluating the results from…
Descriptors: Test Construction, Item Response Theory, Testing Programs, Simulation
Albano, Anthony D. – Journal of Educational Measurement, 2013
In many testing programs it is assumed that the context or position in which an item is administered does not have a differential effect on examinee responses to the item. Violations of this assumption may bias item response theory estimates of item and person parameters. This study examines the potentially biasing effects of item position. A…
Descriptors: Test Items, Item Response Theory, Test Format, Questioning Techniques
Chen, Xinnian; Graesser, Donnasue; Sah, Megha – Advances in Physiology Education, 2015
Laboratory courses serve as important gateways to science, technology, engineering, and mathematics education. One of the challenges in assessing laboratory learning is to conduct meaningful and standardized practical exams, especially for large multisection laboratory courses. Laboratory practical exams in life sciences courses are frequently…
Descriptors: Laboratory Experiments, Standardized Tests, Testing Programs, Testing Problems
Becker, Kirk A.; Bergstrom, Betty A. – Practical Assessment, Research & Evaluation, 2013
The need for increased exam security, improved test formats, more flexible scheduling, better measurement, and more efficient administrative processes has caused testing agencies to consider converting the administration of their exams from paper-and-pencil to computer-based testing (CBT). Many decisions must be made in order to provide an optimal…
Descriptors: Testing, Models, Testing Programs, Program Administration
Wyse, Adam E. – Applied Psychological Measurement, 2011
In many practical testing situations, alternate test forms from the same testing program are not strictly parallel to each other and instead the test forms exhibit small psychometric differences. This article investigates the potential practical impact that these small psychometric differences can have on expected classification accuracy. Ten…
Descriptors: Test Format, Test Construction, Testing Programs, Psychometrics
Moses, Tim; Liu, Jinghua; Tan, Adele; Deng, Weiling; Dorans, Neil J. – ETS Research Report Series, 2013
In this study, differential item functioning (DIF) methods utilizing 14 different matching variables were applied to assess DIF in the constructed-response (CR) items from 6 forms of 3 mixed-format tests. Results suggested that the methods might produce distinct patterns of DIF results for different tests and testing programs, in that the DIF…
Descriptors: Test Construction, Multiple Choice Tests, Test Items, Item Analysis
Goldstein, Jessica; Behuniak, Peter – Assessment for Effective Intervention, 2011
State-level testing programs continue to grow, and the challenge of validation does not wane. Although more than a decade has passed since the 1999 Joint Standards for Educational and Psychological Testing set out a call for the organization of validity evidence into validity arguments, practical examples of such arguments are not readily…
Descriptors: Testing Programs, State Programs, Alternative Assessment, Test Validity
Dimacali, Allen M. – Journal of Mathematics Education at Teachers College, 2012
In conjunction with the adoption and subsequent implementation of the "Common Core State Standards for Mathematics" (CCSSM), state-led consortia are developing next-generation assessments aligned to the CCSSM. This paper discusses the progress and plans of two main coalitions of states--the Partnership for Assessment of Readiness for…
Descriptors: Common Core State Standards, Alternative Assessment, Test Construction, Testing Programs
Kettler, Ryan J. – Review of Research in Education, 2015
This chapter introduces theory that undergirds the role of testing adaptations in assessment, provides examples of item modifications and testing accommodations, reviews research relevant to each, and introduces a new paradigm that incorporates opportunity to learn (OTL), academic enablers, testing adaptations, and inferences that can be made from…
Descriptors: Meta Analysis, Literature Reviews, Testing, Testing Accommodations
Pellegrino, James W. – Journal of Research in Science Teaching, 2012
Beginning with a reference to living in a time of both uncertainty and opportunity, this article presents a discussion of key areas where shared understanding is needed if we are to successfully realize the design and use of high quality, valid assessments of science. The key areas discussed are: (1) assessment purpose and use, (2) the nature of…
Descriptors: Science Education, Science and Society, Academic Standards, State Standards
Shohamy, Elana – Language and Intercultural Communication, 2013
While much of the work in language testing is concerned with constructing quality tests in order to measure language knowledge in reliable and valid ways, there has been a significant movement in language testing research that examines tests in the context of their use in education and society. This line of research exits from the notion that…
Descriptors: Language Tests, Testing, Evaluation Research, Ideology