Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 29 |
Descriptor
Comparative Analysis | 89 |
Testing | 89 |
Test Validity | 64 |
Test Reliability | 35 |
Validity | 18 |
Language Tests | 17 |
Scores | 16 |
Foreign Countries | 13 |
Statistical Analysis | 13 |
Test Construction | 13 |
English (Second Language) | 12 |
More ▼ |
Source
Author
Publication Type
Education Level
Higher Education | 10 |
Postsecondary Education | 6 |
Elementary Education | 3 |
Elementary Secondary Education | 2 |
Adult Education | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
High Schools | 1 |
Intermediate Grades | 1 |
Preschool Education | 1 |
More ▼ |
Audience
Practitioners | 2 |
Administrators | 1 |
Teachers | 1 |
Laws, Policies, & Programs
Rehabilitation Act 1973… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Uminski, Crystal; Hubbard, Joanna K.; Couch, Brian A. – CBE - Life Sciences Education, 2023
Biology instructors use concept assessments in their courses to gauge student understanding of important disciplinary ideas. Instructors can choose to administer concept assessments based on participation (i.e., lower stakes) or the correctness of responses (i.e., higher stakes), and students can complete the assessment in an in-class or…
Descriptors: Biology, Science Tests, High Stakes Tests, Scores
Schmidt, Henk G.; Baars, Gerard J. A.; Hermus, Peter; van der Molen, Henk T.; Arnold, Ivo J. M.; Smeets, Guus – European Journal of Higher Education, 2022
The purpose of the study reported here was to observe the effects of examination practices on the extent to which university students procrastinate. These examination practices were: (1) limiting the number of resits, (2) compensatory rather than conjunctive decision-making about student progress, and (3) restricting the time available for…
Descriptors: Educational Change, Study Habits, Decision Making, Undergraduate Students
Park, Ryoungsun; Kim, Jiseon; Chung, Hyewon; Dodd, Barbara G. – Educational and Psychological Measurement, 2017
The current study proposes novel methods to predict multistage testing (MST) performance without conducting simulations. This method, called MST test information, is based on analytic derivation of standard errors of ability estimates across theta levels. We compared standard errors derived analytically to the simulation results to demonstrate the…
Descriptors: Testing, Performance, Prediction, Error of Measurement
Rios, Joseph A.; Liu, Ou Lydia – American Journal of Distance Education, 2017
Online higher education institutions are presented with the concern of how to obtain valid results when administering student learning outcomes (SLO) assessments remotely. Traditionally, there has been a great reliance on unproctored Internet test administration (UIT) due to increased flexibility and reduced costs; however, a number of validity…
Descriptors: Online Courses, Testing, Test Wiseness, Academic Achievement
Bailey, Janelle M.; Johnson, Bruce; Prather, Edward E.; Slater, Timothy F. – International Journal of Science Education, 2012
Concept inventories (CIs)--typically multiple-choice instruments that focus on a single or small subset of closely related topics--have been used in science education for more than a decade. This paper describes the development and validation of a new CI for astronomy, the "Star Properties Concept Inventory" (SPCI). Questions cover the areas of…
Descriptors: Educational Strategies, Validity, Testing, Astronomy
Peters, Scott J.; Gentry, Marcia – Gifted Child Quarterly, 2013
The "HOPE Scale" was developed to identify academic and social components of giftedness and talent in elementary-aged students with particular attention to students from low-income and/or culturally diverse families. Based on previous findings, additional research was conducted on revisions made to the "HOPE Scale". Items were…
Descriptors: Validity, Achievement Tests, Rating Scales, Low Income Groups
Öz, Hüseyin; Özturan, Tuba – Journal of Language and Linguistic Studies, 2018
This article reports the findings of a study that sought to investigate whether computer-based vs. paper-based test-delivery mode has an impact on the reliability and validity of an achievement test for a pedagogical content knowledge course in an English teacher education program. A total of 97 university students enrolled in the English as a…
Descriptors: Computer Assisted Testing, Testing, Test Format, Teaching Methods
Chan, Christopher – Communications in Information Literacy, 2016
With increasing interest in the assessment of learning outcomes in higher education, stakeholders are demanding concrete evidence of student learning. This applies no less to information literacy outcomes, which have been adopted by many colleges and universities around the world. This article describes the experience of a university library in…
Descriptors: Foreign Countries, College Libraries, Testing, Standardized Tests
Mitchell, Alison M.; Truckenmiller, Adrea; Petscher, Yaacov – Communique, 2015
As part of the Race to the Top initiative, the United States Department of Education made nearly 1 billion dollars available in State Educational Technology grants with the goal of ramping up school technology. One result of this effort is that states, districts, and schools across the country are using computerized assessments to measure their…
Descriptors: Computer Assisted Testing, Educational Technology, Testing, Efficiency
Ghilay, Yaron; Ghilay, Ruth – Journal of Educational Technology, 2012
The study examined advantages and disadvantages of computerised assessment compared to traditional evaluation. It was based on two samples of college students (n=54) being examined in computerised tests instead of paper-based exams. Students were asked to answer a questionnaire focused on test effectiveness, experience, flexibility and integrity.…
Descriptors: Student Evaluation, Higher Education, Comparative Analysis, Computer Assisted Testing
Lu, Chia-Chen; Luh, Ding-Bang – Creativity Research Journal, 2012
Although previous studies have attempted to use different experiences of raters to rate product creativity by adopting the Consensus Assessment Method (CAT) approach, the validity of replacing CAT with another measurement tool has not been adequately tested. This study aimed to compare raters with different levels of experience (expert ves.…
Descriptors: Creativity, Interrater Reliability, Construct Validity, Comparative Analysis
Keselman, H. J.; Miller, Charles W.; Holland, Burt – Psychological Methods, 2011
There have been many discussions of how Type I errors should be controlled when many hypotheses are tested (e.g., all possible comparisons of means, correlations, proportions, the coefficients in hierarchical models, etc.). By and large, researchers have adopted familywise (FWER) control, though this practice certainly is not universal. Familywise…
Descriptors: Validity, Statistical Significance, Probability, Computation
Barrueco, Sandra; Lopez, Michael; Ong, Christine; Lozano, Patricia – Brookes Publishing Company, 2012
As the population of young dual language learners continues to rise, how can early childhood professionals choose culturally and linguistically appropriate assessments for Spanish-English bilingual preschoolers? They'll get expert guidance in this one-of-a-kind resource, a comprehensive roundup and analysis of 37 developmental assessments…
Descriptors: Disabilities, Preschool Children, Psychometrics, English (Second Language)
Wiliam, Dylan – Educational Psychologist, 2010
This article explores the use of standardized tests to hold schools accountable. The history of testing for accountability is reviewed, and it is shown that currently between-school differences account for less than 10% of the variance in student scores, in part because the progress of individuals is small compared to the spread of achievement…
Descriptors: Testing, Standardized Tests, Accountability, Inferences
Brambring, Michael; Asbrock, Doreen – Journal of Autism and Developmental Disorders, 2010
Previous studies have reported that congenitally blind children without any additional impairment reveal a developmental delay of at least 4 years in perspective taking based on testing first-order false-belief tasks. These authors interpret this delay as a sign of autism-like behavior. However, the delay may be caused by testing blind children…
Descriptors: Blindness, Autism, Testing, Perspective Taking