Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 7 |
Descriptor
Standardized Tests | 53 |
Test Construction | 53 |
Test Format | 53 |
Test Items | 18 |
Higher Education | 17 |
Test Use | 14 |
Achievement Tests | 13 |
Elementary Secondary Education | 13 |
Test Validity | 11 |
Test Reliability | 10 |
Student Evaluation | 9 |
More ▼ |
Source
Author
Huntley, Renee M. | 2 |
Abedi, Jamal | 1 |
Anderson, Scarvia B. | 1 |
Baker, Holly | 1 |
Bayley, Robert | 1 |
Boyd, Herbert F. | 1 |
Braun, Henry | 1 |
Braun, Henry I. | 1 |
Brueggemann, Louis V. | 1 |
Carcelli, Larry | 1 |
Carrick, Tessa | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 2 |
Postsecondary Education | 2 |
Elementary Secondary Education | 1 |
Grade 8 | 1 |
Secondary Education | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Jiajing Huang – ProQuest LLC, 2022
The nonequivalent-groups anchor-test (NEAT) data-collection design is commonly used in large-scale assessments. Under this design, different test groups take different test forms. Each test form has its own unique items and all test forms share a set of common items. If item response theory (IRT) models are applied to analyze the test data, the…
Descriptors: Item Response Theory, Test Format, Test Items, Test Construction
Braun, Henry – British Journal of Educational Psychology, 2019
Background: There is unrealized potential in higher education for greater use of performance assessment, particularly in support of teaching and learning: Well-designed performance tasks can elicit evidence regarding what students know and can do with respect to complex learning objectives. At the same time, there is some pressure, at least in the…
Descriptors: Performance Based Assessment, Higher Education, Test Format, Standardized Tests
Lina Anaya; Nagore Iriberri; Pedro Rey-Biel; Gema Zamarro – Annenberg Institute for School Reform at Brown University, 2021
Standardized assessments are widely used to determine access to educational resources with important consequences for later economic outcomes in life. However, many design features of the tests themselves may lead to psychological reactions influencing performance. In particular, the level of difficulty of the earlier questions in a test may…
Descriptors: Test Construction, Test Wiseness, Test Items, Difficulty Level
Fairman, Janet; Johnson, Amy; Mette, Ian; Wickerd, Garry; LaBrie, Sharon – Center for Education Policy, Applied Research, and Evaluation, 2018
The Maine Legislature requested the Maine Education Policy Research Institute (MEPRI) to conduct an assessment of standardized testing in Maine schools to understand the amount, cost, and usefulness of it. This report summarizes the resulting effort, which included a literature scan, document analysis, and surveys of two groups of school…
Descriptors: Standardized Tests, Educational Assessment, Educational Benefits, Screening Tests
Keller, Lisa A.; Keller, Robert R. – Applied Measurement in Education, 2015
Equating test forms is an essential activity in standardized testing, with increased importance with the accountability systems in existence through the mandate of Adequate Yearly Progress. It is through equating that scores from different test forms become comparable, which allows for the tracking of changes in the performance of students from…
Descriptors: Item Response Theory, Rating Scales, Standardized Tests, Scoring Rubrics
Abedi, Jamal; Leon, Seth; Kao, Jenny; Bayley, Robert; Ewers, Nancy; Herman, Joan; Mundhenk, Kimberly – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2011
The purpose of this study was to examine the characteristics of reading test items that may differentially impede the performance of students with disabilities. The findings suggest that there are certain revisions that can be done on current assessments to make them more accessible for students with disabilities. Features such as words per page,…
Descriptors: Test Items, Reading Tests, Disabilities, Student Evaluation

Wang, Tianyou; Kolen, Michael J. – Applied Psychological Measurement, 1996
A quadratic curve test equating method for equating different test forms under a random-groups data collection design is proposed that equates the first three central moments of the test forms. When applied to real test data, the method performs as well as other equating methods. Procedures from implementing the test are described. (SLD)
Descriptors: Data Collection, Equated Scores, Standardized Tests, Test Construction
Shermis, Mark D.; DiVesta, Francis J. – Rowman & Littlefield Publishers, Inc., 2011
"Classroom Assessment in Action" clarifies the multi-faceted roles of measurement and assessment and their applications in a classroom setting. Comprehensive in scope, Shermis and Di Vesta explain basic measurement concepts and show students how to interpret the results of standardized tests. From these basic concepts, the authors then…
Descriptors: Student Evaluation, Standardized Tests, Scores, Measurement

Readence, John E.; Moore, David W. – Journal of Reading, 1983
Examines the development of standardized reading comprehension tests in the critical states from the early 1900s through current testing trends. (AEA)
Descriptors: Educational History, Literature Reviews, Questioning Techniques, Reading Tests
Boyd, Herbert F. – 1984
The Non-Verbal Test of Cognitive Skills (NTCS), by Boyd and Johnson, attempts to address the problems of non-verbal and culturally biased testing instruments in three directions: (1) Test items should be developed which cannot be linked to a specific culture or environmental source as previous learning; (2) Assumed language facility should be…
Descriptors: Academic Ability, Achievement Tests, Culture Fair Tests, Elementary Secondary Education
Haladyna, Thomas M.; Downing, Steven M. – 1988
The proposition that the optimal number of options in a multiple choice test item is three was examined. The concept of functional distractor, a plausible wrong answer that is negatively discriminating when total test performance is the criterion, is discussed. Three distinct groups of achievers (high, middle, and low) on a national standardized…
Descriptors: Achievement Tests, Item Analysis, Multiple Choice Tests, Physicians

Brueggemann, Louis V. – Reading Horizons, 1987
Reviews research dealing with test wiseness and concludes that teachers and others who administer tests and those who review results should consider the degree to which test wiseness characteristics might have been operative had a planned effort been undertaken to provide special instruction. (FL)
Descriptors: Reading Instruction, Reading Research, Reading Tests, Standardized Tests
Anderson, Scarvia B. – American School Board Journal, 1981
Describes types of tests, types of student responses required, ways to interpret tests, and the major test developers. (WD)
Descriptors: Elementary Secondary Education, Responses, Standardized Tests, Teacher Made Tests
Rosner, Frieda C.; Weber, Wilford A. – 1982
A review of the National Teacher Examinations (NTE) has focused on the Commons Examinations component. The Commons, now named the National Teacher Examinations Core Battery, tests general knowledge, communication skills, and professional knowledge in three separate tests. Users may select portions of the tests that best suit their needs at various…
Descriptors: Classroom Techniques, Higher Education, Program Evaluation, Standardized Tests

Kibby, Michael W. – Journal of Reading, 1981
Describes the test, The Degrees of Reading Power, developed by the New York State Education Department for use as a formalized "informal reading inventory." (MKM)
Descriptors: Elementary Secondary Education, Informal Reading Inventories, Minimum Competency Testing, Readability