Publication Date
In 2025 | 0 |
Since 2024 | 8 |
Since 2021 (last 5 years) | 59 |
Since 2016 (last 10 years) | 203 |
Since 2006 (last 20 years) | 498 |
Descriptor
Comparative Analysis | 626 |
Scores | 626 |
Computer Assisted Testing | 192 |
Hypothesis Testing | 174 |
Statistical Analysis | 169 |
Foreign Countries | 163 |
Testing | 123 |
Academic Achievement | 111 |
Correlation | 107 |
English (Second Language) | 89 |
Teaching Methods | 87 |
More ▼ |
Source
Author
Sinharay, Sandip | 6 |
Kim, Sooyeon | 5 |
Attali, Yigal | 4 |
Bridgeman, Brent | 3 |
Chudowsky, Naomi | 3 |
Chudowsky, Victor | 3 |
Hackathorn, Jana | 3 |
Puhan, Gautam | 3 |
Alderman, Donald L. | 2 |
Bennett, Randy Elliot | 2 |
Camara, Wayne J. | 2 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 6 |
Practitioners | 2 |
Administrators | 1 |
Students | 1 |
Location
United States | 13 |
Canada | 12 |
India | 12 |
Texas | 12 |
Iran | 11 |
Turkey | 11 |
China | 10 |
Germany | 10 |
Israel | 10 |
New Jersey | 10 |
Florida | 8 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 11 |
Elementary and Secondary… | 1 |
Elementary and Secondary… | 1 |
Individuals with Disabilities… | 1 |
Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Melchor Sánchez-Mendiola; Abigail P. Manzano-Patiño; Manuel García-Minjares; Enrique Buzo Casanova; Careli J. Herrera Penilla; Katyna Goytia-Rodríguez; Adrián Martínez-González – Educational Assessment, Evaluation and Accountability, 2023
COVID-19 has disrupted higher education globally, and there is scarce information about the "learning loss" in university students throughout this crisis. The goal of the study was to compare scores in a large-scale knowledge diagnostic exam applied to students admitted to the university, before and during the pandemic. Research design…
Descriptors: College Freshmen, Diagnostic Tests, Scores, Achievement Gains
Kim, Sooyeon; Walker, Michael – ETS Research Report Series, 2021
In this investigation, we used real data to assess potential differential effects associated with taking a test in a test center (TC) versus testing at home using remote proctoring (RP). We used a pseudo-equivalent groups (PEG) approach to examine group equivalence at the item level and the total score level. If our assumption holds that the PEG…
Descriptors: Testing, Distance Education, Comparative Analysis, Test Items
Marinho, Nathalie L.; Witmer, Sara E.; Jess, Nicole; Roschmann, Sarina – Language Assessment Quarterly, 2023
The use of accommodations is often recommended to remove barriers to academic testing among English Learners (ELs). However, it is unclear whether accommodations are particularly effective at improving ELs' test scores. A growing foundation of empirical work has explored this topic. We conducted a meta-analysis that examined several possible…
Descriptors: English Language Learners, Testing Accommodations, Barriers, Scores
Markus T. Jansen; Ralf Schulze – Educational and Psychological Measurement, 2024
Thurstonian forced-choice modeling is considered to be a powerful new tool to estimate item and person parameters while simultaneously testing the model fit. This assessment approach is associated with the aim of reducing faking and other response tendencies that plague traditional self-report trait assessments. As a result of major recent…
Descriptors: Factor Analysis, Models, Item Analysis, Evaluation Methods
Santi Lestari – Research Matters, 2024
Despite the increasing ubiquity of computer-based tests, many general qualifications examinations remain in a paper-based mode. Insufficient and unequal digital provision across schools is often identified as a major barrier to a full adoption of computer-based exams for general qualifications. One way to overcome this barrier is a gradual…
Descriptors: Keyboarding (Data Entry), Handwriting, Test Format, Comparative Analysis
Jones, Paul; Tong, Ye; Liu, Jinghua; Borglum, Joshua; Primoli, Vince – Journal of Educational Measurement, 2022
This article studied two methods to detect mode effects in two credentialing exams. In Study 1, we used a "modal scale comparison approach," where the same pool of items was calibrated separately, without transformation, within two TC cohorts (TC1 and TC2) and one OP cohort (OP1) matched on their pool-based scale score distributions. The…
Descriptors: Scores, Credentials, Licensing Examinations (Professions), Computer Assisted Testing
Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022
While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…
Descriptors: Scoring, Testing, Test Items, Test Format
Senel, Selma; Kutlu, Ömer – European Journal of Special Needs Education, 2018
This paper examines listening comprehension skills of visually impaired students (VIS) using computerised adaptive testing (CAT) and reader-assisted paper-pencil testing (raPPT) and student views about them. Explanatory mixed method design was used in this study. Sample is comprised of 51 VIS, in 7th and 8th grades. 9 of these students were…
Descriptors: Computer Assisted Testing, Adaptive Testing, Visual Impairments, Student Attitudes
Puhan, Gautam; Kim, Sooyeon – Journal of Educational Measurement, 2022
As a result of the COVID-19 pandemic, at-home testing has become a popular delivery mode in many testing programs. When programs offer at-home testing to expand their service, the score comparability between test takers testing remotely and those testing in a test center is critical. This article summarizes statistical procedures that could be…
Descriptors: Scores, Scoring, Comparative Analysis, Testing
Matt I. Brown; Patrick R. Heck; Christopher F. Chabris – Journal of Autism and Developmental Disorders, 2024
The Social Shapes Test (SST) is a measure of social intelligence which does not use human faces or rely on extensive verbal ability. The SST has shown promising validity among adults without autism spectrum disorder (ASD), but it is uncertain whether it is suitable for adults with ASD. We find measurement invariance between adults with (n = 229)…
Descriptors: Interpersonal Competence, Autism Spectrum Disorders, Emotional Intelligence, Verbal Ability
Wei Wang – Education and Information Technologies, 2024
As information and communication technologies develop in China, language tests are shifting from conventional paper-and-pencil testing to computerized testing. The aim of this study is to investigate Chinese test-takers' adaptability to computerized language exams, including their performance across testing modes and their perception of the…
Descriptors: Computer Assisted Testing, Language Tests, Second Language Learning, Second Language Instruction
Sinharay, Sandip – Journal of Educational Measurement, 2017
Person-fit assessment (PFA) is concerned with uncovering atypical test performance as reflected in the pattern of scores on individual items on a test. Existing person-fit statistics (PFSs) include both parametric and nonparametric statistics. Comparison of PFSs has been a popular research topic in PFA, but almost all comparisons have employed…
Descriptors: Goodness of Fit, Testing, Test Items, Scores
Uminski, Crystal; Hubbard, Joanna K.; Couch, Brian A. – CBE - Life Sciences Education, 2023
Biology instructors use concept assessments in their courses to gauge student understanding of important disciplinary ideas. Instructors can choose to administer concept assessments based on participation (i.e., lower stakes) or the correctness of responses (i.e., higher stakes), and students can complete the assessment in an in-class or…
Descriptors: Biology, Science Tests, High Stakes Tests, Scores
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS) to provide guidance to states that are interested in including New Meridian content and would like to either keep reporting scores on the New Meridian Scale or use the New Meridian performance levels; that is, the state…
Descriptors: Testing, Standards, Comparative Analysis, Test Content
Isbell, Dan; Winke, Paula – Language Testing, 2019
The American Council on the Teaching of Foreign Languages (ACTFL) oral proficiency interview -- computer (OPIc) testing system represents an ambitious effort in language assessment: Assessing oral proficiency in over a dozen languages, on the same scale, from virtually anywhere at any time. Especially for users in contexts where multiple foreign…
Descriptors: Oral Language, Language Tests, Language Proficiency, Second Language Learning