Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 4 |
| Since 2007 (last 20 years) | 102 |
Descriptor
| Comparative Analysis | 248 |
| Educational Testing | 248 |
| Academic Achievement | 62 |
| Foreign Countries | 49 |
| Educational Assessment | 46 |
| Test Results | 46 |
| Evaluation Methods | 40 |
| Program Effectiveness | 34 |
| Student Evaluation | 33 |
| Educational Policy | 32 |
| Scores | 26 |
| More ▼ | |
Source
Author
| Donovan, Jenny | 3 |
| Lennon, Melissa | 3 |
| Hutton, Penny | 2 |
| Llaudet, Elena | 2 |
| Morris, Cathy | 2 |
| Morrissey, Noni | 2 |
| Newton, Paul E. | 2 |
| O'Connor, Gayl | 2 |
| Peterson, Paul E. | 2 |
| Popham, W. James | 2 |
| Yang, Xiangdong | 2 |
| More ▼ | |
Publication Type
Education Level
| Elementary Secondary Education | 65 |
| Elementary Education | 29 |
| Higher Education | 25 |
| Secondary Education | 20 |
| Postsecondary Education | 18 |
| Grade 8 | 13 |
| Grade 4 | 12 |
| Middle Schools | 11 |
| High Schools | 10 |
| Grade 5 | 6 |
| Grade 6 | 6 |
| More ▼ | |
Location
| United Kingdom | 14 |
| United States | 10 |
| Australia | 9 |
| United Kingdom (England) | 9 |
| California | 5 |
| Canada | 5 |
| Florida | 5 |
| Finland | 4 |
| United Kingdom (Great Britain) | 4 |
| United Kingdom (Wales) | 4 |
| Hong Kong | 3 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 12 |
| Bilingual Education Act 1968 | 1 |
| Elementary and Secondary… | 1 |
| Elementary and Secondary… | 1 |
| Individuals with Disabilities… | 1 |
| Stewart B McKinney Homeless… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Yixi Wang – ProQuest LLC, 2020
Binary item response theory (IRT) models are widely used in educational testing data. These models are not perfect because they simplify the individual item responding process, ignore the differences among different response patterns, cannot handle multidimensionality that lay behind options within a single item, and cannot manage missing response…
Descriptors: Item Response Theory, Educational Testing, Data, Models
Suthathip Thirakunkovit – Language Testing in Asia, 2025
Establishing a cut score is a crucial aspect of the test development process since the selected cut score has the potential to impact students' performance outcomes and shape instructional strategies within the classroom. Therefore, it is vital for those involved in test development to set a cut score that is both fair and justifiable. This cut…
Descriptors: Cutting Scores, Culture Fair Tests, Language Tests, Test Construction
Berman, Amy I.; Haertel, Edward H.; Pellegrino, James W. – National Academy of Education, 2020
This National Academy of Education (NAEd) volume provides guidance to key stakeholders on how to accurately report and interpret comparability assertions concerning large-scale educational assessments as well as how to ensure greater comparability by paying close attention to key aspects of assessment design, content, and procedures. The goal of…
Descriptors: Educational Assessment, Educational Testing, Scores, Comparative Analysis
Wright, Daniel B. – Educational Measurement: Issues and Practice, 2019
There is much discussion about and many policies to address achievement gaps in education among groups of students. The focus here is on a different gap and it is argued that it also should be of concern. Speed gaps are differences in how quickly different groups of students answer the questions on academic assessments. To investigate some speed…
Descriptors: Academic Achievement, Achievement Gap, Reaction Time, Educational Testing
Veldkamp, Bernard P. – Journal of Educational Measurement, 2016
Many standardized tests are now administered via computer rather than paper-and-pencil format. The computer-based delivery mode brings with it certain advantages. One advantage is the ability to adapt the difficulty level of the test to the ability level of the test taker in what has been termed computerized adaptive testing (CAT). A second…
Descriptors: Computer Assisted Testing, Reaction Time, Standardized Tests, Difficulty Level
Baker, Eva L. – Educational Researcher, 2016
This article investigates the persistent and change elements of educational testing and assessment from 1920 to the present day. I show by examining the addresses and texts of American Educational Research Association presidents a continuing focus on schools, from early experiments and development up through applications in accountability systems.…
Descriptors: Research, Educational Testing, Presidents, Professional Associations
Woodruff, David; Traynor, Anne; Cui, Zhongmin; Fang, Yu – ACT, Inc., 2013
Professional standards for educational testing recommend that both the overall standard error of measurement and the conditional standard error of measurement (CSEM) be computed on the score scale used to report scores to examinees. Several methods have been developed to compute scale score CSEMs. This paper compares three methods, based on…
Descriptors: Comparative Analysis, Error of Measurement, Scores, Scaling
Ling, Guangming – International Journal of Testing, 2016
To investigate possible iPad related mode effect, we tested 403 8th graders in Indiana, Maryland, and New Jersey under three mode conditions through random assignment: a desktop computer, an iPad alone, and an iPad with an external keyboard. All students had used an iPad or computer for six months or longer. The 2-hour test included reading, math,…
Descriptors: Educational Testing, Computer Assisted Testing, Handheld Devices, Computers
Embretson, Susan E.; Yang, Xiangdong – Psychometrika, 2013
This paper presents a noncompensatory latent trait model, the multicomponent latent trait model for diagnosis (MLTM-D), for cognitive diagnosis. In MLTM-D, a hierarchical relationship between components and attributes is specified to be applicable to permit diagnosis at two levels. MLTM-D is a generalization of the multicomponent latent trait…
Descriptors: Mathematics Achievement, Achievement Tests, Item Response Theory, Measurement
Bielinska-Kwapisz, Agnieszka; Brown, F. William; Semenik, Richard – Journal of Education for Business, 2012
The Major Field Test in Business (MFT-B), a standardized assessment test of business knowledge among undergraduate business seniors, is widely used to measure student achievement. The Educational Testing Service, publisher of the assessment, provides data that allow institutions to compare their own MFT-B performance to national norms, but that…
Descriptors: Educational Testing, Academic Achievement, Field Tests, National Norms
Barry, Carol L. – College Board, 2013
The College-Level Examination Program® (CLEP®) is an exam program consisting of 33 exams in five subject areas that typically correspond to single-semester courses, but some correspond to full-year or two-year courses. CLEP exams offer students the opportunity to receive college course credit for learning that has already occurred outside of the…
Descriptors: Higher Education, College Credits, Educational Testing, Prior Learning
Hixson, Nate; Rhudy, Vaughn – West Virginia Department of Education, 2013
Student responses to the West Virginia Educational Standards Test (WESTEST) 2 Online Writing Assessment are scored by a computer-scoring engine. The scoring method is not widely understood among educators, and there exists a misperception that it is not comparable to hand scoring. To address these issues, the West Virginia Department of Education…
Descriptors: Scoring Formulas, Scoring Rubrics, Interrater Reliability, Test Scoring Machines
Condon, William – Assessing Writing, 2013
Automated Essay Scoring (AES) has garnered a great deal of attention from the rhetoric and composition/writing studies community since the Educational Testing Service began using e-rater[R] and the "Criterion"[R] Online Writing Evaluation Service as products in scoring writing tests, and most of the responses have been negative. While the…
Descriptors: Measurement, Psychometrics, Evaluation Methods, Educational Testing
Marshall, Jeffery H.; Chinna, Ung; Hok, Ung Ngo; Tinon, Souer; Veasna, Meung; Nissay, Put – Educational Assessment, Evaluation and Accountability, 2012
The global spread of national assessment testing activities, and the growing pressure to move beyond basic measures of participation in educational monitoring, means that student achievement measures are likely to become increasingly relevant indicators of systemic progress in the developing world. Using data from the CESSP project in Cambodia,…
Descriptors: Foreign Countries, Academic Achievement, Developing Nations, Evaluation Methods
Wimberley, Alan – ProQuest LLC, 2010
This study was conducted to analyze the performance differences between alternative education campuses in Texas that used teacher-directed strategies and those that used self-directed strategies. The study was also conducted to inform educators of the results these two strategies had achieved with at-risk students during the three years of…
Descriptors: Nontraditional Education, At Risk Students, Program Effectiveness, Educational Strategies

Direct link
Peer reviewed
