Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 21 |
| Since 2017 (last 10 years) | 33 |
| Since 2007 (last 20 years) | 50 |
Descriptor
| Classification | 59 |
| Item Analysis | 59 |
| Test Items | 59 |
| Foreign Countries | 22 |
| Item Response Theory | 18 |
| Comparative Analysis | 14 |
| Test Construction | 14 |
| Achievement Tests | 11 |
| Difficulty Level | 11 |
| Accuracy | 10 |
| Diagnostic Tests | 10 |
| More ▼ | |
Source
Author
Publication Type
| Journal Articles | 49 |
| Reports - Research | 38 |
| Reports - Evaluative | 9 |
| Reports - Descriptive | 8 |
| Speeches/Meeting Papers | 6 |
| Information Analyses | 3 |
| Tests/Questionnaires | 3 |
| Dissertations/Theses -… | 2 |
| Guides - Non-Classroom | 1 |
Education Level
Audience
| Researchers | 1 |
Location
| Taiwan | 2 |
| Turkey | 2 |
| Arkansas | 1 |
| Australia | 1 |
| Bangladesh | 1 |
| China (Beijing) | 1 |
| Czech Republic | 1 |
| Europe | 1 |
| Greece | 1 |
| Iran | 1 |
| Japan | 1 |
| More ▼ | |
Laws, Policies, & Programs
| Individuals with Disabilities… | 1 |
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Kang, Hyeon-Ah; Han, Suhwa; Kim, Doyoung; Kao, Shu-Chuan – Educational and Psychological Measurement, 2022
The development of technology-enhanced innovative items calls for practical models that can describe polytomous testlet items. In this study, we evaluate four measurement models that can characterize polytomous items administered in testlets: (a) generalized partial credit model (GPCM), (b) testlet-as-a-polytomous-item model (TPIM), (c)…
Descriptors: Goodness of Fit, Item Response Theory, Test Items, Scoring
Sedat Sen; Allan S. Cohen – Educational and Psychological Measurement, 2024
A Monte Carlo simulation study was conducted to compare fit indices used for detecting the correct latent class in three dichotomous mixture item response theory (IRT) models. Ten indices were considered: Akaike's information criterion (AIC), the corrected AIC (AICc), Bayesian information criterion (BIC), consistent AIC (CAIC), Draper's…
Descriptors: Goodness of Fit, Item Response Theory, Sample Size, Classification
Alallo, Hajir Mahmood Ibrahim; Mohammed, Aisha; Hamid, Zayad Khalaf; Hassan, Aalaa Yaseen; Kadhim, Qasim Khlaif – International Journal of Language Testing, 2023
Diagnostic classification models (DCMs) have recently become very popular both for research purposes and for real testing endeavors for student assessment. A plethora of DCM models give researchers and practitioners a wide range of options for student diagnosis and classification. One intriguing option that some DCM models offer is the possibility…
Descriptors: Language Tests, Diagnostic Tests, Classification, Clinical Diagnosis
Cronin, Sean D. – ProQuest LLC, 2023
This convergent, parallel, mixed-methods study with qualitative and quantitative content analysis methods was conducted to identify what type of thinking is required by the College and Career Readiness Assessment (CCRA+) by (a) determining the frequency and percentage of questions categorized as higher-level thinking within each cell of Hess'…
Descriptors: Cues, College Readiness, Career Readiness, Test Items
Nagy, Gabriel; Ulitzsch, Esther – Educational and Psychological Measurement, 2022
Disengaged item responses pose a threat to the validity of the results provided by large-scale assessments. Several procedures for identifying disengaged responses on the basis of observed response times have been suggested, and item response theory (IRT) models for response engagement have been proposed. We outline that response time-based…
Descriptors: Item Response Theory, Hierarchical Linear Modeling, Predictor Variables, Classification
Yoo Jeong Jang – ProQuest LLC, 2022
Despite the increasing demand for diagnostic information, observed subscores have been often reported to lack adequate psychometric qualities such as reliability, distinctiveness, and validity. Therefore, several statistical techniques based on CTT and IRT frameworks have been proposed to improve the quality of subscores. More recently, DCM has…
Descriptors: Classification, Accuracy, Item Response Theory, Correlation
Lundgren, Erik – Journal of Educational Data Mining, 2022
Response process data have the potential to provide a rich description of test-takers' thinking processes. However, retrieving insights from these data presents a challenge for educational assessments and educational data mining as they are complex and not well annotated. The present study addresses this challenge by developing a computational…
Descriptors: Problem Solving, Classification, Accuracy, Foreign Countries
Saatcioglu, Fatima Munevver; Atar, Hakan Yavuz – International Journal of Assessment Tools in Education, 2022
This study aims to examine the effects of mixture item response theory (IRT) models on item parameter estimation and classification accuracy under different conditions. The manipulated variables of the simulation study are set as mixture IRT models (Rasch, 2PL, 3PL); sample size (600, 1000); the number of items (10, 30); the number of latent…
Descriptors: Accuracy, Classification, Item Response Theory, Programming Languages
Zeynep Colak Seker; Cihat Demir – Online Submission, 2022
The aim of this study is to investigate the end-of-unit evaluation questions in 5th grade primary school science workbook according to dimensions of the Revised Bloom's Taxonomy (RBT). For that purpose, 191 end-of-unit evaluation questions of 5th grade science workbook which is approved by the Republic of Turkey -the Ministry of National…
Descriptors: Textbook Evaluation, Grade 5, Science Instruction, Workbooks
Delican, Burak – International Journal of Curriculum and Instruction, 2022
In this research, the questions in the Turkish Course (2,3,4) Worksheets were examined in terms of various classification systems. In this direction, the questions in the worksheets were evaluated with the document-material analysis technique in accordance with the structure of the qualitative research. During the research process, Turkish Course…
Descriptors: Worksheets, Elementary School Students, Turkish, Classification
Tim Stoeckel; Tomoko Ishii – Vocabulary Learning and Instruction, 2024
In an upcoming coverage-comprehension study, we plan to assess learners' meaning-recall knowledge of words as they occur in the study's reading passage. As several meaning-recall test formats exist, the purpose of this small-scale study (N = 10) was to determine which of three formats was most similar to a criterion interview regarding mean score…
Descriptors: Vocabulary Development, Language Tests, Second Language Learning, Classification
Aleyna Altan; Zehra Taspinar Sener – Online Submission, 2023
This research aimed to develop a valid and reliable test to be used to detect sixth grade students' misconceptions and errors regarding the subject of fractions. A misconception diagnostic test has been developed that includes the concept of fractions, different representations of fractions, ordering and comparing fractions, equivalence of…
Descriptors: Diagnostic Tests, Mathematics Tests, Fractions, Misconceptions
Sivakorn Tangsakul; Kornwipa Poonpon – rEFLections, 2024
Given the significant global influence of the Common European Framework of Reference for Languages: Teaching, Learning, and Assessment (CEFR) on English language education, this study deals with aligning a university's academic reading tests to the CEFR. It aimed at validating the test construct of the academic reading tests in relation to the…
Descriptors: Alignment (Education), Reading Tests, Second Language Learning, Language Proficiency
Shafipoor, Mahdieh; Ravand, Hamdollah; Maftoon, Parviz – Language Testing in Asia, 2021
The current study compared the model fit indices, skill mastery probabilities, and classification accuracy of six Diagnostic Classification Models (DCMs): a general model (G-DINA) against five specific models (LLM, RRUM, ACDM, DINA, and DINO). To do so, the response data to the grammar and vocabulary sections of a General English Achievement Test,…
Descriptors: Goodness of Fit, Models, Classification, Grammar
Wang, Wenyi; Song, Lihong; Chen, Ping; Ding, Shuliang – Journal of Educational Measurement, 2019
Most of the existing classification accuracy indices of attribute patterns lose effectiveness when the response data is absent in diagnostic testing. To handle this issue, this article proposes new indices to predict the correct classification rate of a diagnostic test before administering the test under the deterministic noise input…
Descriptors: Cognitive Tests, Classification, Accuracy, Diagnostic Tests

Peer reviewed
Direct link
