Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 5 |
Since 2016 (last 10 years) | 18 |
Since 2006 (last 20 years) | 27 |
Descriptor
Psychometrics | 49 |
Test Validity | 49 |
Testing | 49 |
Test Reliability | 31 |
Test Construction | 24 |
Scoring | 12 |
Language Tests | 9 |
Test Bias | 9 |
Measurement Techniques | 8 |
Ethnicity | 7 |
Evaluation Methods | 7 |
More ▼ |
Source
Author
Dunne, Michael P. | 2 |
Runyan, Desmond K. | 2 |
Zolotor, Adam J. | 2 |
Ackerman, Debra J. | 1 |
Amery D. Wu | 1 |
Anderson, Scarvia B. | 1 |
Bandeira, Denise R. | 1 |
Bart, William M. | 1 |
Betz, Nancy E. | 1 |
Bonnie T. Zima | 1 |
Bowdon, Jill | 1 |
More ▼ |
Publication Type
Education Level
Early Childhood Education | 8 |
Elementary Education | 8 |
Primary Education | 8 |
Kindergarten | 5 |
Junior High Schools | 4 |
Middle Schools | 4 |
Secondary Education | 4 |
Grade 3 | 3 |
Grade 4 | 3 |
Grade 5 | 3 |
Grade 6 | 3 |
More ▼ |
Audience
Practitioners | 1 |
Location
Illinois | 5 |
Nebraska | 3 |
California | 1 |
Colombia | 1 |
Delaware | 1 |
Iceland | 1 |
India | 1 |
Malawi | 1 |
Maryland | 1 |
North Carolina | 1 |
Ohio | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Practices in Instrument Use and Development in "Chemistry Education Research and Practice" 2010-2021
Lazenby, Katherine; Tenney, Kristin; Marcroft, Tina A.; Komperda, Regis – Chemistry Education Research and Practice, 2023
Assessment instruments that generate quantitative data on attributes (cognitive, affective, behavioral, "etc.") of participants are commonly used in the chemistry education community to draw conclusions in research studies or inform practice. Recently, articles and editorials have stressed the importance of providing evidence for the…
Descriptors: Chemistry, Periodicals, Journal Articles, Science Education
F. Alethea Marti; Nadereh Pourat; Christopher Lee; Bonnie T. Zima – Administration and Policy in Mental Health and Mental Health Services Research, 2022
While many standardized assessment measures exist to track child mental health treatment outcomes, the degree to which such tools have been adequately tested for reliability and validity across race, ethnicity, and class is uneven. This paper examines the corpus of published tests of psychometric properties for the ten standardized measures used…
Descriptors: Mental Health, Outcome Measures, Psychometrics, Standardized Tests
da Silva, Mônia Aparecida; de Mendonça Filho, Euclides J.; Mônego, Bruna G.; Bandeira, Denise R. – Early Child Development and Care, 2020
This study is a systematic review designed to identify the instruments most frequently used to evaluate children's development, describe their operational and psychometric characteristics and determine which are the most accurate. We carried out a systematic search of the online databases PsycINFO and PubMed Central using the descriptors…
Descriptors: Child Development, Measures (Individuals), Psychometrics, Accuracy
Norris, John; Drackert, Anastasia – Language Testing, 2018
The Test of German as a Foreign Language (TestDaF) plays a critical role as a standardized test of German language proficiency. Developed and administered by the Society for Academic Study Preparation and Test Development (g.a.s.t.), TestDaF was launched in 2001 and has experienced persistent annual growth, with more than 44,000 test takers in…
Descriptors: German, Second Language Learning, Language Tests, Language Proficiency
Schmitz, Florian; Wilhelm, Oliver – Journal of Intelligence, 2019
Current taxonomies of intelligence comprise two factors of mental speed, clerical speed (Gs), and elementary cognitive speed (Gt). Both originated from different research traditions and are conceptualized as dissociable constructs in current taxonomies. However, previous research suggests that tasks of one category can be transferred into the…
Descriptors: Taxonomy, Intelligence Tests, Testing, Test Format
Nebraska Department of Education, 2022
In Winter 2021-2022, the Nebraska Student-Centered Assessment System (NSCAS) assessments are administered in ELA and mathematics in Grades 3-8. In Spring 2021-2022, the NSCAS assessments are administered in English language arts (ELA) and mathematics in Grades 3-8 and in science in Grades 5 and 8. The purposes of the NSCAS assessments are to…
Descriptors: English, Language Arts, Student Centered Learning, Mathematics Tests
Nebraska Department of Education, 2023
In Fall and Winter 2022-2023, the NSCAS assessments were administered in ELA and mathematics for grades 3-8. In Spring 2022-2023, the NSCAS assessments were administered in English language arts (ELA) and mathematics for grades 3-8 and in science for grades 5 and 8. The purposes of the NSCAS assessments are to measure and report Nebraska students'…
Descriptors: English, Language Arts, Student Centered Learning, Mathematics Tests
Nebraska Department of Education, 2018
The 2018 Nebraska Student-Centered Assessment System (NSCAS) Summative technical report documents the processes and procedures implemented to support the Spring 2018 NSCAS Summative English Language Arts (ELA), Mathematics, and Science assessments by NWEA under the supervision of the Nebraska Department of Education (NDE). The technical report…
Descriptors: Summative Evaluation, Language Tests, English, Mathematics Tests
Oliveri, María Elena; von Davier, Alina A. – International Journal of Testing, 2016
In this study, we propose that the unique needs and characteristics of linguistic minorities should be considered throughout the test development process. Unlike most measurement invariance investigations in the assessment of linguistic minorities, which typically are conducted after test administration, we propose strategies that focus on the…
Descriptors: Psychometrics, Linguistics, Test Construction, Testing
Regional Educational Laboratory Midwest, 2019
At least half of states administer or are developing kindergarten entry assessments. In fall 2017 the Illinois State Board of Education began requiring teachers to report data on every child's skills at kindergarten entry using the Kindergarten Individual Development Survey, which was adapted from a California kindergarten assessment. This study…
Descriptors: Kindergarten, School Readiness, Test Validity, Test Reliability
Bowdon, Jill; Dahlke, Katie; Yang, Rui; Pan, Jingtong; Marcus, Jill; Lemieux, Camille – Regional Educational Laboratory Midwest, 2019
At least half of states administer or are developing kindergarten entry assessments. In fall 2017 the Illinois State Board of Education began requiring teachers to report data on every child's skills at kindergarten entry using the Kindergarten Individual Development Survey. State and local stakeholders have asked for more information on the…
Descriptors: Kindergarten, School Readiness, Public Schools, Test Validity
Regional Educational Laboratory Midwest, 2019
At least half of states administer or are developing kindergarten entry assessments. In fall 2017 the Illinois State Board of Education began requiring teachers to report data on every child's skills at kindergarten entry using the Kindergarten Individual Development Survey. State and local stakeholders have asked for more information on the…
Descriptors: Kindergarten, School Readiness, Public Schools, Test Validity
Ackerman, Debra J. – ETS Research Report Series, 2018
Kindergarten entry assessments (KEAs) have increasingly been incorporated into state education policies over the past 5 years, with much of this interest stemming from Race to the Top--Early Learning Challenge (RTT-ELC) awards, Enhanced Assessment Grants, and nationwide efforts to develop common K-12 state learning standards. Drawing on…
Descriptors: Screening Tests, Kindergarten, Test Validity, Test Reliability
International Journal of Testing, 2019
These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…
Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage