Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 7 |
Since 2016 (last 10 years) | 24 |
Since 2006 (last 20 years) | 47 |
Descriptor
Psychometrics | 76 |
Testing | 76 |
Test Validity | 49 |
Test Reliability | 32 |
Test Construction | 27 |
Validity | 18 |
Scoring | 14 |
Language Tests | 12 |
Scores | 12 |
Test Bias | 11 |
Test Items | 10 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Practitioners | 2 |
Community | 1 |
Students | 1 |
Location
Illinois | 5 |
Nebraska | 3 |
California | 1 |
China (Beijing) | 1 |
Colombia | 1 |
Delaware | 1 |
Iceland | 1 |
India | 1 |
Malawi | 1 |
Maryland | 1 |
North Carolina | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Maciej Koscielniak; Jolanta Enko; Agata Gasiorowska – Journal of Academic Ethics, 2024
Examination dishonesty is a global problem that became particularly critical after the outbreak of the COVID-19 pandemic and the shift to remote learning. Academic research has often examined this phenomenon as only one aspect of a broader concept of academic dishonesty and as a one-dimensional construct. This article builds on existing knowledge…
Descriptors: Foreign Countries, Students, Ethics, Cheating
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Practices in Instrument Use and Development in "Chemistry Education Research and Practice" 2010-2021
Lazenby, Katherine; Tenney, Kristin; Marcroft, Tina A.; Komperda, Regis – Chemistry Education Research and Practice, 2023
Assessment instruments that generate quantitative data on attributes (cognitive, affective, behavioral, "etc.") of participants are commonly used in the chemistry education community to draw conclusions in research studies or inform practice. Recently, articles and editorials have stressed the importance of providing evidence for the…
Descriptors: Chemistry, Periodicals, Journal Articles, Science Education
F. Alethea Marti; Nadereh Pourat; Christopher Lee; Bonnie T. Zima – Administration and Policy in Mental Health and Mental Health Services Research, 2022
While many standardized assessment measures exist to track child mental health treatment outcomes, the degree to which such tools have been adequately tested for reliability and validity across race, ethnicity, and class is uneven. This paper examines the corpus of published tests of psychometric properties for the ten standardized measures used…
Descriptors: Mental Health, Outcome Measures, Psychometrics, Standardized Tests
da Silva, Mônia Aparecida; de Mendonça Filho, Euclides J.; Mônego, Bruna G.; Bandeira, Denise R. – Early Child Development and Care, 2020
This study is a systematic review designed to identify the instruments most frequently used to evaluate children's development, describe their operational and psychometric characteristics and determine which are the most accurate. We carried out a systematic search of the online databases PsycINFO and PubMed Central using the descriptors…
Descriptors: Child Development, Measures (Individuals), Psychometrics, Accuracy
Torre, Jimmy de la; Akbay, Lokman – Eurasian Journal of Educational Research, 2019
Purpose: Well-designed assessment methodologies and various cognitive diagnosis models (CDMs) to extract diagnostic information about examinees' individual strengths and weaknesses have been developed. Due to this novelty, as well as educational specialists' lack of familiarity with CDMs, their applications are not widespread. This article aims at…
Descriptors: Cognitive Measurement, Models, Computer Software, Testing
Nixi Wang – ProQuest LLC, 2022
Measurement errors attributable to cultural issues are complex and challenging for educational assessments. We need assessment tests sensitive to the cultural heterogeneity of populations, and psychometric methods appropriate to address fairness and equity concerns. Built on the research of culturally responsive assessment, this dissertation…
Descriptors: Culturally Relevant Education, Testing, Equal Education, Validity
Embretson, Susan E. – Educational Measurement: Issues and Practice, 2016
Examinees' thinking processes have become an increasingly important concern in testing. The responses processes aspect is a major component of validity, and contemporary tests increasingly involve specifications about the cognitive complexity of examinees' response processes. Yet, empirical research findings on examinees' cognitive processes are…
Descriptors: Testing, Cognitive Processes, Test Construction, Test Items
Norris, John; Drackert, Anastasia – Language Testing, 2018
The Test of German as a Foreign Language (TestDaF) plays a critical role as a standardized test of German language proficiency. Developed and administered by the Society for Academic Study Preparation and Test Development (g.a.s.t.), TestDaF was launched in 2001 and has experienced persistent annual growth, with more than 44,000 test takers in…
Descriptors: German, Second Language Learning, Language Tests, Language Proficiency
Schmitz, Florian; Wilhelm, Oliver – Journal of Intelligence, 2019
Current taxonomies of intelligence comprise two factors of mental speed, clerical speed (Gs), and elementary cognitive speed (Gt). Both originated from different research traditions and are conceptualized as dissociable constructs in current taxonomies. However, previous research suggests that tasks of one category can be transferred into the…
Descriptors: Taxonomy, Intelligence Tests, Testing, Test Format
Lenz, A. Stephen; Wester, Kelly L. – Measurement and Evaluation in Counseling and Development, 2017
It is imperative that counselors understand how to critically evaluate assessments before using them to make clinical decisions. This evaluation can be conducted through integrating the 5 sources of validity. Each source of validity is discussed, along with methods to appraise psychometric quality, throughout this special issue.
Descriptors: Counseling Techniques, Educational Assessment, Psychological Evaluation, Clinical Diagnosis
Nebraska Department of Education, 2022
In Winter 2021-2022, the Nebraska Student-Centered Assessment System (NSCAS) assessments are administered in ELA and mathematics in Grades 3-8. In Spring 2021-2022, the NSCAS assessments are administered in English language arts (ELA) and mathematics in Grades 3-8 and in science in Grades 5 and 8. The purposes of the NSCAS assessments are to…
Descriptors: English, Language Arts, Student Centered Learning, Mathematics Tests
Nebraska Department of Education, 2023
In Fall and Winter 2022-2023, the NSCAS assessments were administered in ELA and mathematics for grades 3-8. In Spring 2022-2023, the NSCAS assessments were administered in English language arts (ELA) and mathematics for grades 3-8 and in science for grades 5 and 8. The purposes of the NSCAS assessments are to measure and report Nebraska students'…
Descriptors: English, Language Arts, Student Centered Learning, Mathematics Tests
Nebraska Department of Education, 2018
The 2018 Nebraska Student-Centered Assessment System (NSCAS) Summative technical report documents the processes and procedures implemented to support the Spring 2018 NSCAS Summative English Language Arts (ELA), Mathematics, and Science assessments by NWEA under the supervision of the Nebraska Department of Education (NDE). The technical report…
Descriptors: Summative Evaluation, Language Tests, English, Mathematics Tests
Oliveri, María Elena; von Davier, Alina A. – International Journal of Testing, 2016
In this study, we propose that the unique needs and characteristics of linguistic minorities should be considered throughout the test development process. Unlike most measurement invariance investigations in the assessment of linguistic minorities, which typically are conducted after test administration, we propose strategies that focus on the…
Descriptors: Psychometrics, Linguistics, Test Construction, Testing