Publication Date
In 2025 | 2 |
Since 2024 | 5 |
Since 2021 (last 5 years) | 19 |
Since 2016 (last 10 years) | 50 |
Since 2006 (last 20 years) | 152 |
Descriptor
Psychometrics | 269 |
Testing | 269 |
Test Construction | 61 |
Measurement Techniques | 51 |
Test Validity | 49 |
Evaluation Methods | 47 |
Test Reliability | 47 |
Test Items | 42 |
Models | 41 |
Measurement | 38 |
Scores | 37 |
More ▼ |
Source
Author
Dunne, Michael P. | 3 |
Lord, Frederic M. | 3 |
Mislevy, Robert J. | 3 |
Runyan, Desmond K. | 3 |
Zolotor, Adam J. | 3 |
Angoff, William H. | 2 |
Arendasy, Martin E. | 2 |
Bartram, Dave | 2 |
Embretson, Susan E. | 2 |
Isaeva, Oksana | 2 |
Jain, Dipty | 2 |
More ▼ |
Publication Type
Education Level
Audience
Practitioners | 7 |
Researchers | 6 |
Students | 4 |
Administrators | 2 |
Teachers | 2 |
Community | 1 |
Counselors | 1 |
Policymakers | 1 |
Location
Illinois | 6 |
Australia | 4 |
India | 4 |
United States | 4 |
Nebraska | 3 |
Canada | 2 |
China | 2 |
Colombia | 2 |
Germany | 2 |
Greece | 2 |
Japan | 2 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Kaiwen Man – Educational and Psychological Measurement, 2024
In various fields, including college admission, medical board certifications, and military recruitment, high-stakes decisions are frequently made based on scores obtained from large-scale assessments. These decisions necessitate precise and reliable scores that enable valid inferences to be drawn about test-takers. However, the ability of such…
Descriptors: Prior Learning, Testing, Behavior, Artificial Intelligence
Nixi Wang – ProQuest LLC, 2022
Measurement errors attributable to cultural issues are complex and challenging for educational assessments. We need assessment tests sensitive to the cultural heterogeneity of populations, and psychometric methods appropriate to address fairness and equity concerns. Built on the research of culturally responsive assessment, this dissertation…
Descriptors: Culturally Relevant Education, Testing, Equal Education, Validity
Kentaro Fukushima; Nao Uchida; Kensuke Okada – Journal of Educational and Behavioral Statistics, 2025
Diagnostic tests are typically administered in a multiple-choice (MC) format due to their advantages of objectivity and time efficiency. The MC-deterministic input, noisy "and" gate (DINA) family of models, a representative class of cognitive diagnostic models for MC items, efficiently and parsimoniously estimates the mastery profiles of…
Descriptors: Diagnostic Tests, Cognitive Measurement, Multiple Choice Tests, Educational Assessment
Kim, Sooyeon; Walker, Michael – ETS Research Report Series, 2021
In this investigation, we used real data to assess potential differential effects associated with taking a test in a test center (TC) versus testing at home using remote proctoring (RP). We used a pseudo-equivalent groups (PEG) approach to examine group equivalence at the item level and the total score level. If our assumption holds that the PEG…
Descriptors: Testing, Distance Education, Comparative Analysis, Test Items
Dombrowski, Stefan C.; Casey, Corinne – Journal of Psychoeducational Assessment, 2022
This article reviews the administrative, scoring, and psychometric properties of the Wechsler Individual Achievement Test, Fourth Edition (WIAT-4, NCS Pearson, 2020). The WIAT-4 is one of the more commonly administered broadband measures of academic achievement. The instrument was determined to be well-conceptualized, and generally…
Descriptors: Achievement Tests, Testing, Scoring, Psychometrics
Maciej Koscielniak; Jolanta Enko; Agata Gasiorowska – Journal of Academic Ethics, 2024
Examination dishonesty is a global problem that became particularly critical after the outbreak of the COVID-19 pandemic and the shift to remote learning. Academic research has often examined this phenomenon as only one aspect of a broader concept of academic dishonesty and as a one-dimensional construct. This article builds on existing knowledge…
Descriptors: Foreign Countries, Students, Ethics, Cheating
Perkins, Beth A.; Satkus, Paulius; Finney, Sara J. – Journal of Psychoeducational Assessment, 2020
Few studies have examined the psychometric properties of the test-related items from the Achievement Emotions Questionnaire (AEQ). Using a sample of 955 university students, we examined the factor structure of 12 emotion items measuring test-related anger, boredom, enjoyment, and pride. Results indicated the four emotions were distinct, allowing…
Descriptors: Affective Measures, Questionnaires, Psychometrics, Test Items
Tekalign Geleta Kenea; Fisseha Mikire; Zenebe Negawo – Cogent Education, 2024
The core of the educational system is students' academic performance, which demands sensitive measures. In this situation, teacher-made tests (TMTs) are more promising, but they can be susceptible to measurement error if not well designed. Hence, this study aimed to investigate the relationship between the properties of TMTs and students' academic…
Descriptors: Foreign Countries, Psychometrics, Performance, Testing
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS). The goal of the QTS is to provide guidance to states that are interested in including content from the New Meridian item bank and intend to make comparability claims with "other assessments" that include New…
Descriptors: Testing, Standards, Comparative Analysis, Guidelines
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS). The goal of the QTS is to provide guidance to states that are interested in including content from the New Meridian item bank and intend to make comparability claims with "other assessments" that include New…
Descriptors: Testing, Standards, Comparative Analysis, Guidelines
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Schmitz, Florian; Wilhelm, Oliver – Journal of Intelligence, 2019
Current taxonomies of intelligence comprise two factors of mental speed, clerical speed (Gs), and elementary cognitive speed (Gt). Both originated from different research traditions and are conceptualized as dissociable constructs in current taxonomies. However, previous research suggests that tasks of one category can be transferred into the…
Descriptors: Taxonomy, Intelligence Tests, Testing, Test Format
Nebraska Department of Education, 2022
In Winter 2021-2022, the Nebraska Student-Centered Assessment System (NSCAS) assessments are administered in ELA and mathematics in Grades 3-8. In Spring 2021-2022, the NSCAS assessments are administered in English language arts (ELA) and mathematics in Grades 3-8 and in science in Grades 5 and 8. The purposes of the NSCAS assessments are to…
Descriptors: English, Language Arts, Student Centered Learning, Mathematics Tests
Practices in Instrument Use and Development in "Chemistry Education Research and Practice" 2010-2021
Lazenby, Katherine; Tenney, Kristin; Marcroft, Tina A.; Komperda, Regis – Chemistry Education Research and Practice, 2023
Assessment instruments that generate quantitative data on attributes (cognitive, affective, behavioral, "etc.") of participants are commonly used in the chemistry education community to draw conclusions in research studies or inform practice. Recently, articles and editorials have stressed the importance of providing evidence for the…
Descriptors: Chemistry, Periodicals, Journal Articles, Science Education
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS) to provide guidance to states that are interested in including New Meridian content and would like to either keep reporting scores on the New Meridian Scale or use the New Meridian performance levels; that is, the state…
Descriptors: Testing, Standards, Comparative Analysis, Test Content