Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 5 |
| Since 2007 (last 20 years) | 14 |
Descriptor
| Gender Differences | 15 |
| Models | 15 |
| Test Items | 15 |
| Item Response Theory | 8 |
| Foreign Countries | 7 |
| College Students | 4 |
| Goodness of Fit | 4 |
| Item Analysis | 4 |
| Statistical Analysis | 4 |
| Difficulty Level | 3 |
| Factor Analysis | 3 |
| More ▼ | |
Source
Author
| Bringula, Rex P. | 1 |
| Chiang, Jui-Ling | 1 |
| Darling, Andrew | 1 |
| Dolan, Conor V. | 1 |
| Duffin, Kirk | 1 |
| Emily A. Brown | 1 |
| Engelhard, George, Jr. | 1 |
| George, Ann Cathrice | 1 |
| Gerick, Julia | 1 |
| Goldhammer, Frank | 1 |
| Hong, Sehee | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 13 |
| Reports - Research | 12 |
| Dissertations/Theses -… | 2 |
| Reports - Evaluative | 1 |
Education Level
| Higher Education | 5 |
| Postsecondary Education | 3 |
| Elementary Secondary Education | 2 |
| High Schools | 2 |
| Secondary Education | 2 |
| Elementary Education | 1 |
| Grade 12 | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
| Advanced Placement… | 1 |
| National Assessment of… | 1 |
| Program for International… | 1 |
| Trends in International… | 1 |
What Works Clearinghouse Rating
Emily A. Brown – ProQuest LLC, 2024
Previous research has been limited regarding the measurement of computational thinking, particularly as a learning progression in K-12. This study proposes to apply a multidimensional item response theory (IRT) model to a newly developed measure of computational thinking utilizing both selected response and open-ended polytomous items to establish…
Descriptors: Models, Computation, Thinking Skills, Item Response Theory
Tabatabaee-Yazdi, Mona – SAGE Open, 2020
The Hierarchical Diagnostic Classification Model (HDCM) reflects on the sequences of the presentation of the essential materials and attributes to answer the items of a test correctly. In this study, a foreign language reading comprehension test was analyzed employing HDCM and the generalized deterministic-input, noisy and gate (G-DINA) model to…
Descriptors: Diagnostic Tests, Classification, Models, Reading Comprehension
Luo, Wei; Smith, Thomas J.; Whalley, Kyle; Darling, Andrew; Ormand, Carol; Hung, Wei-Chen; Chiang, Jui-Ling; Pelletier, Jon; Duffin, Kirk – British Journal of Educational Technology, 2019
This paper presents results from a randomized experimental design replicated over four semesters that compared students' performance in understanding landform evolution processes as measured by the pretest to posttest score growth between two treatment methods: an online interactive simulation tool and a paper-based exercise. While both methods…
Descriptors: Earth Science, Models, Science Tests, Computer Simulation
George, Ann Cathrice; Robitzsch, Alexander – Applied Measurement in Education, 2018
This article presents a new perspective on measuring gender differences in the large-scale assessment study Trends in International Science Study (TIMSS). The suggested empirical model is directly based on the theoretical competence model of the domain mathematics and thus includes the interaction between content and cognitive sub-competencies.…
Descriptors: Achievement Tests, Elementary Secondary Education, Mathematics Achievement, Mathematics Tests
Ihme, Jan Marten; Senkbeil, Martin; Goldhammer, Frank; Gerick, Julia – European Educational Research Journal, 2017
The combination of different item formats is found quite often in large scale assessments, and analyses on the dimensionality often indicate multi-dimensionality of tests regarding the task format. In ICILS 2013, three different item types (information-based response tasks, simulation tasks, and authoring tasks) were used to measure computer and…
Descriptors: Foreign Countries, Computer Literacy, Information Literacy, International Assessment
Okumura, Taichi – Educational and Psychological Measurement, 2014
This study examined the empirical differences between the tendency to omit items and reading ability by applying tree-based item response (IRTree) models to the Japanese data of the Programme for International Student Assessment (PISA) held in 2009. For this purpose, existing IRTree models were expanded to contain predictors and to handle…
Descriptors: Foreign Countries, Item Response Theory, Test Items, Reading Ability
Bringula, Rex P. – Education and Information Technologies, 2015
This study attempted to develop valid and reliable Capstone Project Attitude Scales (CPAS). Among the scales reviewed, the Modified Fennema-Shermann Mathematics Attitude Scales was adapted in the construction of the CPAS. Usefulness, Confidence, and Gender View were the three subscales of the CPAS. Four hundred sixty-three students answered the…
Descriptors: Program Attitudes, Attitude Measures, Questionnaires, Test Construction
Ong, Yoke Mooi; Williams, Julian; Lamprianou, Iasonas – International Journal of Testing, 2015
The purpose of this article is to explore crossing differential item functioning (DIF) in a test drawn from a national examination of mathematics for 11-year-old pupils in England. An empirical dataset was analyzed to explore DIF by gender in a mathematics assessment. A two-step process involving the logistic regression (LR) procedure for…
Descriptors: Mathematics Tests, Gender Differences, Test Bias, Test Items
Kaliski, Pamela K.; Wind, Stefanie A.; Engelhard, George, Jr.; Morgan, Deanna L.; Plake, Barbara S.; Reshetar, Rosemary A. – Educational and Psychological Measurement, 2013
The many-faceted Rasch (MFR) model has been used to evaluate the quality of ratings on constructed response assessments; however, it can also be used to evaluate the quality of judgments from panel-based standard setting procedures. The current study illustrates the use of the MFR model for examining the quality of ratings obtained from a standard…
Descriptors: Item Response Theory, Models, Standard Setting (Scoring), Science Tests
Dolan, Conor V.; Oort, Frans J.; Stoel, Reinoud D.; Wicherts, Jelte M. – Structural Equation Modeling: A Multidisciplinary Journal, 2009
We propose a method to investigate measurement invariance in the multigroup exploratory factor model, subject to target rotation. We consider both oblique and orthogonal target rotation. This method has clear advantages over other approaches, such as the use of congruence measures. We demonstrate that the model can be implemented readily in the…
Descriptors: Test Items, Psychology, Models, College Students
Wang, Wen-Chung; Jin, Kuan-Yu – Applied Psychological Measurement, 2010
In this study, all the advantages of slope parameters, random weights, and latent regression are acknowledged when dealing with component and composite items by adding slope parameters and random weights into the standard item response model with internal restrictions on item difficulty and formulating this new model within a multilevel framework…
Descriptors: Test Items, Difficulty Level, Regression (Statistics), Generalization
Sandmann, Lorilee R.; Jordan, Jenny W.; Mull, Casey D.; Valentine, Thomas – Journal of Higher Education Outreach and Engagement, 2014
Community engagement professionals and partners serve as, work with, study, and build the capacity of boundary spanners. To augment knowledge about these functions, the Weerts-Sandmann Boundary Spanning Conceptual Framework (2010) has been operationalized through a survey instrument to examine community engagement boundary-spanning behaviors by…
Descriptors: Outreach Programs, Change Agents, Community Involvement, Employee Attitudes
Jia, Yujie – ProQuest LLC, 2013
This study employed Bachman and Palmer's (2010) Assessment Use Argument framework to investigate to what extent the use of a second language oral test as an exit test in a Hong Kong university can be justified. It also aimed to help test developers of this oral test identify the most critical areas in the current test design that might need…
Descriptors: Test Use, Language Tests, Oral Language, Second Language Learning
Hong, Sehee; Min, Sae-Young – Educational and Psychological Measurement, 2007
In this study, mixed Rasch modeling was used on the Self-Rating Depression Scale (SDS), a widely used measure of depression, among a non-Western sample of 618 Korean college students. The results revealed three latent classes and confirmed the unidimensionality of the SDS. In addition, there was a significant effect for gender in terms of class…
Descriptors: Rating Scales, Depression (Psychology), Models, Self Evaluation (Individuals)
Xu, Xueli; von Davier, Matthias – ETS Research Report Series, 2006
More than a dozen statistical models have been developed for the purpose of cognitive diagnosis. These models are supposed to extract a much finer level of information from item responses than traditional unidimensional item response models. In this paper, a general diagnostic model (GDM) was used to analyze a set of simulated sparse data and real…
Descriptors: Statistical Analysis, National Competency Tests, Diagnostic Tests, Item Response Theory

Direct link
Peer reviewed
