Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 31 |
Descriptor
Psychometrics | 47 |
Testing | 47 |
Measurement Techniques | 13 |
Test Construction | 13 |
Test Items | 9 |
Item Response Theory | 8 |
Measurement | 8 |
Scoring | 8 |
Evaluation Methods | 7 |
Measures (Individuals) | 7 |
Reliability | 7 |
More ▼ |
Source
Author
Dunne, Michael P. | 2 |
Isaeva, Oksana | 2 |
Jain, Dipty | 2 |
Puhan, Gautam | 2 |
Ramirez, Clemencia | 2 |
Runyan, Desmond K. | 2 |
Volkova, Elena | 2 |
Zolotor, Adam J. | 2 |
Ahrens, Stefanie | 1 |
Andreva-Miller, Inna | 1 |
Angoff, William H. | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 4 |
Elementary Education | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Grade 6 | 1 |
Grade 7 | 1 |
Grade 8 | 1 |
Grade 9 | 1 |
Higher Education | 1 |
Kindergarten | 1 |
More ▼ |
Audience
Practitioners | 1 |
Researchers | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Norris, John; Drackert, Anastasia – Language Testing, 2018
The Test of German as a Foreign Language (TestDaF) plays a critical role as a standardized test of German language proficiency. Developed and administered by the Society for Academic Study Preparation and Test Development (g.a.s.t.), TestDaF was launched in 2001 and has experienced persistent annual growth, with more than 44,000 test takers in…
Descriptors: German, Second Language Learning, Language Tests, Language Proficiency
Zhang, Dongbo; Koda, Keiko – Asian-Pacific Journal of Second and Foreign Language Education, 2017
Word Associates Format (WAF) tests are often used to measure second language learners' vocabulary depth with a focus on their network knowledge. Yet, there were often many variations in the specific forms of the tests and the ways they were used, which tended to have an impact on learners' response behaviors and, more importantly, the psychometric…
Descriptors: Language Tests, Vocabulary Development, Second Language Learning, Test Construction
Briggs, Derek C. – Assessment in Education: Principles, Policy & Practice, 2017
In the United States, students have historically taken large-scale assessments for many different purposes. One purpose that is shared with many other countries is a desire to monitor aggregate trends in educational attainment in core subject domains such as literacy, mathematics, and science. In this commentary, the author examines testing,…
Descriptors: Educational Assessment, Learning Theories, Learning, Psychometrics
Nebraska Department of Education, 2018
The 2018 Nebraska Student-Centered Assessment System (NSCAS) Summative technical report documents the processes and procedures implemented to support the Spring 2018 NSCAS Summative English Language Arts (ELA), Mathematics, and Science assessments by NWEA under the supervision of the Nebraska Department of Education (NDE). The technical report…
Descriptors: Summative Evaluation, Language Tests, English, Mathematics Tests
Dumas, Denis G.; McNeish, Daniel M. – Educational Researcher, 2017
Single-timepoint educational measurement practices are capable of assessing student ability at the time of testing but are not designed to be informative of student capacity for developing in any particular academic domain, despite commonly being used in such a manner. For this reason, such measurement practice systematically underestimates the…
Descriptors: Measurement Techniques, Student Evaluation, Evaluation Methods, Testing
Arendasy, Martin E.; Sommer, Markus – Learning and Individual Differences, 2012
The use of new test administration technologies such as computerized adaptive testing in high-stakes educational and occupational assessments demands large item pools. Classic item construction processes and previous approaches to automatic item generation faced the problems of a considerable loss of items after the item calibration phase. In this…
Descriptors: Item Banks, Test Items, Adaptive Testing, Psychometrics
Warne, Russell T. – Roeper Review, 2012
Above-level testing (also called "out-of-level testing," "off-grade testing," and "off-level testing") is the practice of administering a test level that was designed for and normed on an older population to a gifted child. This comprehensive literature review traces the practice of above-level testing from the…
Descriptors: Evidence, Gifted, Testing, Psychometrics
McCrimmon, Adam W.; Smith, Amanda D. – Journal of Psychoeducational Assessment, 2013
The Wechsler Abbreviated Scale of Intelligence, Second Edition (WASI-II; Wechsler, 2011), published by Pearson, is a newly updated abbreviated measure of cognitive intelligence designed for individuals 6 to 90 years of age. Primarily used in clinical, psychoeducational, and research
settings, the WASI-II was developed to quickly and accurately…
Descriptors: Intelligence Tests, Testing, Masters Degrees, Doctoral Degrees
Guo, Hongwen – Psychometrika, 2010
After many equatings have been conducted in a testing program, equating errors can accumulate to a degree that is not negligible compared to the standard error of measurement. In this paper, the author investigates the asymptotic accumulative standard error of equating (ASEE) for linear equating methods, including chained linear, Tucker, and…
Descriptors: Testing Programs, Testing, Error of Measurement, Equated Scores
McNamara, Tim – Language Testing, 2011
The paper by Wilson and Moore (this volume), based on the Messick Lecture delivered in 2006 at the annual Language Testing Research Colloquium in Melbourne, may present a familiar challenge to some language testers: of reading outside one's comfort zone. The distinctive character of language testing lies in its combination of two primary fields of…
Descriptors: Expertise, Applied Linguistics, Testing, Language Tests
Dorans, Neil J. – Educational Measurement: Issues and Practice, 2012
Views on testing--its purpose and uses and how its data are analyzed--are related to one's perspective on test takers. Test takers can be viewed as learners, examinees, or contestants. I briefly discuss the perspective of test takers as learners. I maintain that much of psychometrics views test takers as examinees. I discuss test takers as a…
Descriptors: Testing, Test Theory, Item Response Theory, Test Reliability
Bagner, Daniel M.; Rodriguez, Gabriela M.; Blake, Clair A.; Linares, Dainelys; Carter, Alice S. – Clinical Child and Family Psychology Review, 2012
Behavioral and emotional problems are highly prevalent in early childhood and represent an important focus of practice for clinical child and pediatric psychologists. Although psychological or psychiatric disorders are not typically diagnosed in children under the age of 2 years, recent research has demonstrated the appropriateness of assessing…
Descriptors: Evidence, Emotional Problems, Early Intervention, Psychologists
Barrueco, Sandra; Lopez, Michael; Ong, Christine; Lozano, Patricia – Brookes Publishing Company, 2012
As the population of young dual language learners continues to rise, how can early childhood professionals choose culturally and linguistically appropriate assessments for Spanish-English bilingual preschoolers? They'll get expert guidance in this one-of-a-kind resource, a comprehensive roundup and analysis of 37 developmental assessments…
Descriptors: Disabilities, Preschool Children, Psychometrics, English (Second Language)
Guler, Nese; Penfield, Randall D. – Journal of Educational Measurement, 2009
In this study, we investigate the logistic regression (LR), Mantel-Haenszel (MH), and Breslow-Day (BD) procedures for the simultaneous detection of both uniform and nonuniform differential item functioning (DIF). A simulation study was used to assess and compare the Type I error rate and power of a combined decision rule (CDR), which assesses DIF…
Descriptors: Test Bias, Simulation, Test Items, Measurement
Puhan, Gautam; Moses, Timothy P.; Grant, Mary C.; McHale, Frederick – Journal of Educational Measurement, 2009
A single-group (SG) equating with nearly equivalent test forms (SiGNET) design was developed by Grant to equate small-volume tests. Under this design, the scored items for the operational form are divided into testlets or mini tests. An additional testlet is created but not scored for the first form. If the scored testlets are testlets 1-6 and the…
Descriptors: Equated Scores, Test Construction, Measurement, Measures (Individuals)