Publication Date
| In 2026 | 0 |
| Since 2025 | 49 |
| Since 2022 (last 5 years) | 211 |
| Since 2017 (last 10 years) | 492 |
| Since 2007 (last 20 years) | 984 |
Descriptor
| Test Validity | 3908 |
| Test Reliability | 1517 |
| Testing | 1090 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 616 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 494 |
| Higher Education | 490 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Shanmugam, S. Kanageswari Suppiah; Lan, Ong Saw – Malaysian Journal of Learning and Instruction, 2013
Purpose: This study aims to investigate the validity of using bilingual test to measure the mathematics achievement of students who have limited English proficiency (LEP). The bilingual test and the English-only test consist of 20 computation and 20 word problem multiple-choice questions (from TIMSS 2003 and 2007 released items. The bilingual test…
Descriptors: Bilingualism, Language Tests, Limited English Speaking, English (Second Language)
Thissen, David; Norton, Scott – American Institutes for Research, 2013
Development of the Common Core State Standards (CCSS), and the creation of the Smarter Balanced Assessment Consortium (Smarter Balanced) and the Partnership for Assessment of Readiness for College and Careers (PARCC), changes the pattern of accountability testing. These changes raise the question: "How should NAEP's validity and utility be…
Descriptors: National Competency Tests, Psychometrics, State Standards, Academic Standards
Dolan, Robert P.; Burling, Kelly; Harms, Michael; Strain-Seymour, Ellen; Way, Walter; Rose, David H. – Pearson, 2013
The increased capabilities offered by digital technologies offer new opportunities to evaluate students' deeper knowledge and skills and on constructs that are difficult to measure using traditional methods. Such assessments can also incorporate tools and interfaces that improve accessibility for diverse students, as well as inadvertently…
Descriptors: Educational Technology, Technology Uses in Education, Access to Education, Evaluation Methods
Abedi, Jamal – Educational Assessment, 2009
This study compared performance of both English language learners (ELLs) and non-ELL students in Grades 4 and 8 under accommodated and nonaccommodated testing conditions. The accommodations used in this study included a computerized administration of a math test with a pop-up glossary, a customized English dictionary, extra testing time, and…
Descriptors: Computer Assisted Testing, Testing Accommodations, Mathematics Tests, Grade 4
Matson, Johnny L.; Wilkins, Jonathan – Research in Developmental Disabilities: A Multidisciplinary Journal, 2009
Social skill excesses and deficits have garnered considerable attention from researchers and clinicians over the last three decades. This trend is undoubtedly due to the central role these problems play in psychopathology and the general adjustment of children of all ages. Not surprisingly, these concerns and attention to such problems have also…
Descriptors: Testing, Psychopathology, Psychometrics, Interpersonal Competence
Cizek, Gregory J.; Bowen, Daniel; Church, Keri – Educational and Psychological Measurement, 2010
This study followed up on previous work that examined the incidence of reporting evidence based on test consequences in "Mental Measurements Yearbook". In the present study, additional possible outlets for what has been called "consequential validity" evidence were investigated, including all articles published in the past 10 years in several…
Descriptors: Educational Research, Educational Assessment, Psychological Testing, Followup Studies
Miller, Matthew J. – Journal of Counseling Psychology, 2010
This study attempted to replicate Miller's (2007) finding that a bilinear domain-specific model of Asian American acculturation demonstrated superior model fit when compared to unilinear and bilinear domain-generic models. Current confirmatory factor analytic tests of competing acculturation models in a cross-validation sample of 306 participants…
Descriptors: Acculturation, Asian Americans, Immigrants, Models
Kass, Darrin; Grandzol, Christian; Bommer, William – Journal of Education for Business, 2012
Consistent with previous research, the authors found that the combined use of undergraduate grade point average and the Graduate Management Admission Test (GMAT) verbal and quantitative sections successfully predicted performance in a master of business administration (MBA) program. However, these measures did not successfully predict the…
Descriptors: College Entrance Examinations, Grade Point Average, Program Effectiveness, Business Administration Education
Ezzelle, Carol; Setzer, J. Carl – GED Testing Service, 2009
This manual was written to provide technical information regarding the 2002 Series GED (General Educational Development) Tests. Throughout this manual, documentation is provided regarding the development of the GED Tests, data collection activities, as well as reliability and validity evidence. The purpose of this manual is to provide evidence…
Descriptors: High School Equivalency Programs, Testing Programs, Test Validity, Test Reliability
Biber, Douglas; Gray, Bethany – ETS Research Report Series, 2013
One of the major innovations of the "TOEFL iBT"® test is the incorporation of integrated tasks complementing the independent tasks to which examinees respond. In addition, examinees must produce discourse in both modes (speech and writing). The validity argument for the TOEFL iBT includes the claim that examinees vary their discourse in…
Descriptors: Discourse Analysis, English (Second Language), Second Language Learning, Language Tests
Chou, Yu-Chi – ProQuest LLC, 2013
This dissertation consists of four chapters. Chapter 1 provides an introduction to the self-determination literature documenting the importance of promoting the self-determination of transition and secondary age students with disabilities, as well as a summary of research examining the self-determination of students with disabilities across…
Descriptors: Self Determination, Disabilities, Transitional Programs, Autism
Haberman, Shelby J. – Educational Testing Service, 2011
Alternative approaches are discussed for use of e-rater[R] to score the TOEFL iBT[R] Writing test. These approaches involve alternate criteria. In the 1st approach, the predicted variable is the expected rater score of the examinee's 2 essays. In the 2nd approach, the predicted variable is the expected rater score of 2 essay responses by the…
Descriptors: Writing Tests, Scoring, Essays, Language Tests
Moore, Delilah S.; Ellis, Rebecca; Kosma, Maria; Fabre, Jennifer M.; McCarter, Kevin S.; Wood, Robert H. – Research Quarterly for Exercise and Sport, 2011
We examined the measurement properties of fall-related psychological instruments with a sample of 133 older adults (M age = 74.4 years, SD = 9.4). Measures included the Comprehensive Falls Risk Screening Instrument, Falls-efficacy Scale-International (FES-I), Activities-specific Balance Confidence (ABC), modified Survey of Activities and Fear of…
Descriptors: Comparative Analysis, Psychological Testing, Test Validity, Test Reliability
Sparfeldt, Jorn R.; Kimmel, Rumena; Lowenkamp, Lena; Steingraber, Antje; Rost, Detlef H. – Educational Assessment, 2012
Multiple-choice (MC) reading comprehension test items comprise three components: text passage, questions about the text, and MC answers. The construct validity of this format has been repeatedly criticized. In three between-subjects experiments, fourth graders (N[subscript 1] = 230, N[subscript 2] = 340, N[subscript 3] = 194) worked on three…
Descriptors: Test Items, Reading Comprehension, Construct Validity, Grade 4
Pellegrino, James W. – Journal of Research in Science Teaching, 2012
Beginning with a reference to living in a time of both uncertainty and opportunity, this article presents a discussion of key areas where shared understanding is needed if we are to successfully realize the design and use of high quality, valid assessments of science. The key areas discussed are: (1) assessment purpose and use, (2) the nature of…
Descriptors: Science Education, Science and Society, Academic Standards, State Standards

Peer reviewed
Direct link
