Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 5 |
Descriptor
Test Format | 41 |
Test Validity | 36 |
Test Reliability | 19 |
Test Construction | 15 |
Elementary Secondary Education | 12 |
Student Evaluation | 9 |
Test Use | 9 |
Language Tests | 7 |
Literature Reviews | 7 |
Testing | 7 |
Testing Problems | 7 |
More ▼ |
Source
Author
Hambleton, Ronald K. | 2 |
Baker, Eva L. | 1 |
Baker, Holly | 1 |
Benson, Jeri | 1 |
Bollwark, John | 1 |
Bond, Lloyd | 1 |
Brannon, Robert | 1 |
Brittain, Clay V. | 1 |
Brittain, Mary M. | 1 |
Buser, Karen | 1 |
Caldwell, JoAnne | 1 |
More ▼ |
Publication Type
Information Analyses | 41 |
Journal Articles | 21 |
Reports - Evaluative | 6 |
Speeches/Meeting Papers | 6 |
Opinion Papers | 5 |
Guides - Non-Classroom | 3 |
Reports - Research | 3 |
ERIC Publications | 1 |
Reports - Descriptive | 1 |
Education Level
Elementary Secondary Education | 2 |
Postsecondary Education | 2 |
Elementary Education | 1 |
Higher Education | 1 |
Audience
Practitioners | 2 |
Administrators | 1 |
Researchers | 1 |
Location
Japan | 1 |
Laws, Policies, & Programs
Assessments and Surveys
SAT (College Admission Test) | 2 |
Conflict Tactics Scale | 1 |
Keymath Diagnostic Arithmetic… | 1 |
National Teacher Examinations | 1 |
Self Directed Search | 1 |
What Works Clearinghouse Rating
Laura A. Outhwaite; Pirjo Aunio; Jaimie Ka Yu Leung; Jo Van Herwegen – Educational Psychology Review, 2024
Successful early mathematical development is vital to children's later education, employment, and wellbeing outcomes. However, established measurement tools are infrequently used to (i) assess children's mathematical skills and (ii) identify children with or at-risk of mathematical learning difficulties. In response, this pre-registered systematic…
Descriptors: Mathematics Tests, Screening Tests, Mathematics Skills, At Risk Students
Xueliang Chen; Vahid Aryadoust; Wenxin Zhang – Language Testing, 2025
The growing diversity among test takers in second or foreign language (L2) assessments makes the importance of fairness front and center. This systematic review aimed to examine how fairness in L2 assessments was evaluated through differential item functioning (DIF) analysis. A total of 83 articles from 27 journals were included in a systematic…
Descriptors: Second Language Learning, Language Tests, Test Items, Item Analysis
Rios, Joseph A.; Ihlenfeldt, Samuel D.; Dosedel, Michael; Riegelman, Amy – Educational Measurement: Issues and Practice, 2020
This systematic review investigated the topics studied and reporting practices of published meta-analyses in educational measurement. Our findings indicated that meta-analysis is not a highly utilized methodological tool in educational measurement; on average, less than one meta-analysis has been published per year over the past 30 years (28…
Descriptors: Meta Analysis, Educational Assessment, Test Format, Testing Accommodations
Cui, Ying; Chen, Fu; Lutsyk, Alina; Leighton, Jacqueline P.; Cutumisu, Maria – Assessment in Education: Principles, Policy & Practice, 2023
With the exponential increase in the volume of data available in the 21st century, data literacy skills have become vitally important in work places and everyday life. This paper provides a systematic review of available data literacy assessments targeted at different audiences and educational levels. The results can help researchers and…
Descriptors: Data, Information Literacy, 21st Century Skills, Competence
Rogers, Christopher M.; Thurlow, Martha L.; Lazarus, Sheryl S.; Liu, Kristin K. – National Center on Educational Outcomes, 2019
The purpose of this report is to present a synthesis of the research on test accommodations published in 2015 and 2016. We summarize the research to review current research trends and enhance understanding of the implications of accommodations use in the development of future policy directions, to highlight implementation of current and new…
Descriptors: Testing Accommodations, Students with Disabilities, Elementary Secondary Education, Postsecondary Education
Wiley, David E. – 1981
The Ralph Nader report on the Educational Testing Service (ETS), entitled The Reign of ETS, the Corporation That Makes Up Minds, explicitly and implicitly raises serious issues concerning the testing enterprise. Major themes include the role of testing in the educational selection system, the validity of existing tests, and the corporate power of…
Descriptors: Book Reviews, College Entrance Examinations, Test Format, Test Validity

Turner, Jean – Annual Review of Applied Linguistics, 1998
This review of research on second-language oral testing outlines the nature of early research in interview-format proficiency testing, then reports on new directions in investigation of construct validity of interview-format and other oral skills tests through examination of examinee, interviewer, and rater performance. Research on empirically…
Descriptors: Construct Validity, Educational Trends, Interrater Reliability, Interviews

Haladyna, Thomas M.; Downing, Steven M. – Applied Measurement in Education, 1989
Results of 96 theoretical/empirical studies were reviewed to see if they support a taxonomy of 43 rules for writing multiple-choice test items. The taxonomy is the result of an analysis of 46 textbooks dealing with multiple-choice item writing. For nearly half of the rules, no research was found. (SLD)
Descriptors: Classification, Literature Reviews, Multiple Choice Tests, Test Construction
Hambleton, Ronald K.; Bollwark, John – 1991
The validity of results from international assessments depends on the correctness of the test translations. If the tests presented in one language are more or less difficult because of the manner in which they are translated, the validity of any interpretation of the results can be questioned. Many test translation methods exist in the literature,…
Descriptors: Cultural Differences, Educational Assessment, English, Foreign Countries

Brannon, Robert – Psychology of Women Quarterly, 1981
Discusses methodological issues in paper-and-pencil measuring instruments applicable to the assessment of attitudes toward women, men and a variety of gender-related issues. Commonly used questionnaire formats are critiqued and the limitations of heterogeneous scales explored. Presents recommendations for the construction of scales that predict…
Descriptors: Attitude Measures, Females, Literature Reviews, Questioning Techniques

Stiggins, Richard J. – Research in the Teaching of English, 1982
Compares direct and indirect writing assessment strategies and contrasts them in terms of the relationship each has to specific classroom decision-making situations, the components of writing assessed, practical testing matters, characteristics of test exercises, test scoring procedures, and procedures for determining test quality. (HOD)
Descriptors: Comparative Analysis, Decision Making, Educational Assessment, Test Format

Wergin, Jon F. – New Directions for Teaching and Learning, 1988
Each of the many methods or approaches to assessing student learning is based on a clear understanding of validity, reliability, course objectives, the advantages and disadvantages of different evaluation formats, and the ways assessment data will be used to improve instruction. (Author/MSE)
Descriptors: College Instruction, Educational Objectives, Evaluation Methods, Higher Education

Benson, Jeri – Educational and Psychological Measurement, 1981
A review of the research on item writing, item format, test instructions, and item readability indicated the importance of instrument structure in the interpretation of test data. The effect of failing to consider these areas on the content validity of achievement test scores is discussed. (Author/GK)
Descriptors: Achievement Tests, Elementary Secondary Education, Literature Reviews, Scores
Brittain, Mary M.; Brittain, Clay V. – 1981
A behavioral domain is well-defined when it is clear to both test developers and test users which categories of performance should or should not be considered for potential test items. Only those tests that are keyed to well-defined domains meet the definition of criterion-referenced tests. The greatest proliferation of criterion-referenced tests…
Descriptors: Criterion Referenced Tests, Reading Achievement, Reading Tests, Test Construction

Nickerson, Raymond S. – Educational Researcher, 1989
Discusses issues involved in the construction, validity, and use of tests that evaluate educational progress, especially those that assess higher-order cognitive functioning. Reviews the four articles in this special issue. (FMW)
Descriptors: Cognitive Measurement, Educational Testing, Elementary Secondary Education, Evaluation