Publication Date
In 2025 | 2 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 13 |
Descriptor
Comparative Analysis | 66 |
Test Validity | 66 |
Testing | 66 |
Test Reliability | 34 |
Language Tests | 13 |
Scores | 12 |
Statistical Analysis | 11 |
Test Construction | 11 |
Computer Assisted Testing | 10 |
English (Second Language) | 9 |
Higher Education | 9 |
More ▼ |
Source
Author
Hakstian, A. Ralph | 2 |
Jeff Allen | 2 |
Kansup, Wanlop | 2 |
Klein-Braley, Christine | 2 |
Ty Cruce | 2 |
Weiss, David J. | 2 |
Anderson, Paul S. | 1 |
Anivan, Sarinee, Ed. | 1 |
Babad, Elisha Y. | 1 |
Barry, Carol L. | 1 |
Bennett, Randy Elliot | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 7 |
Postsecondary Education | 4 |
Elementary Education | 2 |
Elementary Secondary Education | 2 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
High Schools | 1 |
Intermediate Grades | 1 |
Secondary Education | 1 |
Audience
Practitioners | 2 |
Administrators | 1 |
Teachers | 1 |
Location
Australia | 2 |
Georgia | 1 |
Hungary | 1 |
Iran | 1 |
Israel | 1 |
Malawi | 1 |
Netherlands | 1 |
Ohio | 1 |
Sweden | 1 |
Turkey | 1 |
United Kingdom (England) | 1 |
More ▼ |
Laws, Policies, & Programs
Rehabilitation Act 1973… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Jeff Allen; Jay Thomas; Stacy Dreyer; Scott Johanningmeier; Dana Murano; Ty Cruce; Xin Li; Edgar Sanchez – ACT Education Corp., 2025
This report describes the process of developing and validating the enhanced ACT. The report describes the changes made to the test content and the processes by which these design decisions were implemented. The authors describe how they shared the overall scope of the enhancements, including the initial blueprints, with external expert panels,…
Descriptors: College Entrance Examinations, Testing, Change, Test Construction
Jeff Allen; Ty Cruce – ACT Education Corp., 2025
This report summarizes some of the evidence supporting interpretations of scores from the enhanced ACT, focusing on reliability, concurrent validity, predictive validity, and score comparability. The authors argue that the evidence presented in this report supports the interpretation of scores from the enhanced ACT as measures of high school…
Descriptors: College Entrance Examinations, Testing, Change, Scores
Uminski, Crystal; Hubbard, Joanna K.; Couch, Brian A. – CBE - Life Sciences Education, 2023
Biology instructors use concept assessments in their courses to gauge student understanding of important disciplinary ideas. Instructors can choose to administer concept assessments based on participation (i.e., lower stakes) or the correctness of responses (i.e., higher stakes), and students can complete the assessment in an in-class or…
Descriptors: Biology, Science Tests, High Stakes Tests, Scores
Rios, Joseph A.; Liu, Ou Lydia – American Journal of Distance Education, 2017
Online higher education institutions are presented with the concern of how to obtain valid results when administering student learning outcomes (SLO) assessments remotely. Traditionally, there has been a great reliance on unproctored Internet test administration (UIT) due to increased flexibility and reduced costs; however, a number of validity…
Descriptors: Online Courses, Testing, Test Wiseness, Academic Achievement
Öz, Hüseyin; Özturan, Tuba – Journal of Language and Linguistic Studies, 2018
This article reports the findings of a study that sought to investigate whether computer-based vs. paper-based test-delivery mode has an impact on the reliability and validity of an achievement test for a pedagogical content knowledge course in an English teacher education program. A total of 97 university students enrolled in the English as a…
Descriptors: Computer Assisted Testing, Testing, Test Format, Teaching Methods
Mitchell, Alison M.; Truckenmiller, Adrea; Petscher, Yaacov – Communique, 2015
As part of the Race to the Top initiative, the United States Department of Education made nearly 1 billion dollars available in State Educational Technology grants with the goal of ramping up school technology. One result of this effort is that states, districts, and schools across the country are using computerized assessments to measure their…
Descriptors: Computer Assisted Testing, Educational Technology, Testing, Efficiency
Wiliam, Dylan – Educational Psychologist, 2010
This article explores the use of standardized tests to hold schools accountable. The history of testing for accountability is reviewed, and it is shown that currently between-school differences account for less than 10% of the variance in student scores, in part because the progress of individuals is small compared to the spread of achievement…
Descriptors: Testing, Standardized Tests, Accountability, Inferences
Rawls, Anita Michelle Wilson – ProQuest LLC, 2009
The study discussed the importance of test validity, often established when making decisions that may affect a student's future. The decisions made by policymakers and educators must not adversely affect any particular subgroups of students (i.e., year of administration, gender, ethnicity, level English proficiency, socioeconomic status, and…
Descriptors: Test Validity, Reading Tests, Factor Analysis, Item Analysis
Heldsinger, Sandra; Humphry, Stephen – Australian Educational Researcher, 2010
Demands for accountability have seen the implementation of large scale testing programs in Australia and internationally. There is, however, a growing body of evidence to show that externally imposed testing programs do not have a sustained impact on student achievement. It has been argued that teacher assessment is more effective in raising…
Descriptors: Testing Programs, Testing, Academic Achievement, Measures (Individuals)
Tsagari, Dina, Ed.; Csepes, Ildiko, Ed. – Peter Lang Frankfurt, 2012
The Guidelines for Good Practice of the European Association for Language Testing and Assessment (EALTA) stress the importance of collaboration between all parties involved in the process of developing instruments, activities and programmes for testing and assessment. Collaboration is considered to be as important as validity and reliability,…
Descriptors: Sign Language, Testing, Language Tests, Test Validity
Barry, Carol L.; Finney, Sara J. – Research & Practice in Assessment, 2009
The effects of gathering test scores under low-stakes conditions has been a prominent domain of research in the assessment and testing literature. One important area within this larger domain concerns the implications of a test being low-stakes on test evaluation and development. The current study examined one variable, the testing context, that…
Descriptors: Testing, Context Effect, Comparative Analysis, Test Validity

Hoffmann, Norman G.; Butcher, James N. – Journal of Consulting and Clinical Psychology, 1975
Three Minnesota Multiphasic Personality Inventory short forms, the Mini-Mult, Faschingbauer's 166, and the MMPI-168, which were constructed by different methodologies, were compared on a sample of 1,028 psychiatric patients. Results of this study seriously question the use of MMPI short forms for clinical interpretation. (Author)
Descriptors: Comparative Analysis, Personality Measures, Psychological Testing, Test Validity
Lushene, Robert E.; And Others – 1972
Within the context of a counterbalanced design, 63 female students were tested with a computerized Minnesota Multiphasic Personality Inventory (MMPI) and a group booklet mode of administration. State anxiety was measured before and after each testing session. The correlation between the computer-based MMPI scale scores and the booklet…
Descriptors: Anxiety, Comparative Analysis, Computer Assisted Instruction, Intermode Differences

Nickel, Ted – Educational and Psychological Measurement, 1971
Directions are provided for the construction of a reduced size Rod and Frame Test. Simpler and less expensive, the proposed apparatus has criterion validity parallel to that of the full-sized. (GS)
Descriptors: Comparative Analysis, Psychological Studies, Sex Differences, Statistical Analysis

Redfering, David L.; Collins, Jackie – Educational and Psychological Measurement, 1982
Forty elementary students were administered the Bender-Gestalt Test using two techniques: Koppitz routine instructions and the Hutt testing-the-limits method. The mean number of Koppitz errors was approximately two greater than the number obtained using the Hutt technique. (Author/BW)
Descriptors: Comparative Analysis, Correlation, Elementary Education, Intelligence Tests