Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 9 |
Descriptor
Models | 15 |
Test Construction | 15 |
Scaling | 8 |
Multidimensional Scaling | 7 |
Test Items | 6 |
Item Response Theory | 5 |
Measurement Techniques | 5 |
Evaluation Methods | 4 |
Test Validity | 4 |
Decision Making | 3 |
Higher Education | 3 |
More ▼ |
Source
Author
Bimler, David | 1 |
Callear, Angela | 1 |
Cheng, Li | 1 |
Deng, Meng | 1 |
Denison, D. Brian, Ed. | 1 |
Edirisooriya, Gunapala | 1 |
Ercikan, Kadriye | 1 |
Güler Yavuz Temel | 1 |
Hambleton, Ronald K. | 1 |
Harvey, Shane Trevor | 1 |
Kane, Michael T. | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Secondary Education | 1 |
Two Year Colleges | 1 |
Audience
Researchers | 1 |
Location
California | 1 |
China | 1 |
New Zealand | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Self Description Questionnaire | 1 |
What Works Clearinghouse Rating
Güler Yavuz Temel – Journal of Educational Measurement, 2024
The purpose of this study was to investigate multidimensional DIF with a simple and nonsimple structure in the context of multidimensional Graded Response Model (MGRM). This study examined and compared the performance of the IRT-LR and Wald test using MML-EM and MHRM estimation approaches with different test factors and test structures in…
Descriptors: Computation, Multidimensional Scaling, Item Response Theory, Models
Callear, Angela; Harvey, Shane Trevor; Bimler, David – International Journal of Behavioral Development, 2017
Emotion regulation is a central feature in human emotional development. However, measures based on children's observable emotion regulation behaviors are largely absent. An inventory of children's emotion regulation strategies was developed from current measures and four focus group discussions with experts in child behavior and emotion. From…
Descriptors: Children, Emotional Development, Child Behavior, Affective Behavior
Straat, J. Hendrik; van der Ark, L. Andries; Sijtsma, Klaas – Educational and Psychological Measurement, 2014
An automated item selection procedure in Mokken scale analysis partitions a set of items into one or more Mokken scales, if the data allow. Two algorithms are available that pursue the same goal of selecting Mokken scales of maximum length: Mokken's original automated item selection procedure (AISP) and a genetic algorithm (GA). Minimum…
Descriptors: Sampling, Test Items, Effect Size, Scaling
Ercikan, Kadriye; Oliveri, María Elena – Applied Measurement in Education, 2016
Assessing complex constructs such as those discussed under the umbrella of 21st century constructs highlights the need for a principled assessment design and validation approach. In our discussion, we made a case for three considerations: (a) taking construct complexity into account across various stages of assessment development such as the…
Descriptors: Evaluation Methods, Test Construction, Design, Scaling
Wang, Yan; Mu, Guanglun Michael; Wang, Zhiqing; Deng, Meng; Cheng, Li; Wang, Hongxia – International Journal of Disability, Development and Education, 2015
Classroom support plays a salient role in successful inclusive education, hence it has been widely debated in the literature. Much extant work has only focused on a particular aspect of classroom support. A comprehensive, systematic discussion of classroom support is sporadic in the literature. Relevant research concerning the Chinese context is…
Descriptors: Multidimensional Scaling, Inclusion, Classroom Techniques, Classroom Environment
Kane, Michael T.; Mroch, Andrew A.; Suh, Youngsuk; Ripkey, Douglas R. – Measurement: Interdisciplinary Research and Perspectives, 2009
This paper analyzes five linear equating models for the "nonequivalent groups with anchor test" (NEAT) design with internal anchors (i.e., the anchor test is part of the full test). The analysis employs a two-dimensional framework. The first dimension contrasts two general approaches to developing the equating relationship. Under a "parameter…
Descriptors: Scaling, Equated Scores, Methods, Test Items
Seok, Soonhwa – Educational Technology Research and Development, 2009
The purpose of this study was to identify and validate items applicable to evaluating online courses at the postsecondary level. Items were derived from a review of the literature. Four judges rated the similarity of the items by making pair-wise comparisons utilizing multidimensional scaling (MDS). The study consisted of five stages. Stage I…
Descriptors: Online Courses, Multidimensional Scaling, Course Evaluation, Test Items
Edirisooriya, Gunapala – 1997
This paper suggests a new approach to attitude scale construction. Instead of asking respondents to express the extent or the degree of opinion on a particular issue, respondents should be asked about the factors that are relevant for the issue of interest and how much weight respondents are willing to attach to each relevant piece of evidence.…
Descriptors: Abortions, Attitude Measures, Comparative Analysis, Decision Making
Luecht, Richard M. – Foreign Language Annals, 2003
This article contends that the necessary links between constructs and test scores/decisions in language assessment must be established through principled design procedures that align three models: (1) a theoretical construct model; (2) a test development model; and (3) a psychometric scoring model. The theoretical construct model articulates the…
Descriptors: Scoring, Psychometrics, Language Proficiency, Language Tests
Marsh, Herbert W.; And Others – 1984
The Self Description Questionnaire II (SDQ II) was administered to 901 students (11 to 18 years old) in grades 7 through 12 who attended one public coeducational high school. The 11 factors the SDQ II was designed to measure were clearly identified in a conventional/exploratory factor analysis and in a confirmatory factor analysis using LISREL.…
Descriptors: Factor Analysis, Factor Structure, Models, Multidimensional Scaling

Snyder, Scott; Sheehan, Robert – Journal of Early Intervention, 1992
This examination of the Rasch scaling model concludes that the model could potentially facilitate objective comparisons of status and change of young children with disabilities at individual and group levels. The paper discusses applications of the model to early childhood assessment in the areas of item banking, test analysis, and subject…
Descriptors: Disabilities, Evaluation Methods, Item Response Theory, Measurement Techniques
Yao, Lihua; Schwarz, Richard D. – Applied Psychological Measurement, 2006
Multidimensional item response theory (IRT) models have been proposed for better understanding the dimensional structure of data or to define diagnostic profiles of student learning. A compensatory multidimensional two-parameter partial credit model (M-2PPC) for constructed-response items is presented that is a generalization of those proposed to…
Descriptors: Models, Item Response Theory, Markov Processes, Monte Carlo Methods
Hambleton, Ronald K. – 1989
A brief overview of item response theory is provided, and a 186-item bibliography of books and articles on the subject dating from 1953 to June 1989 is presented. The overview includes a definition of the theory, a discussion of its development and application, and comparisons with classical test theory. All publications in the bibliography were…
Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Software, Equated Scores
Secolsky, Charles, Ed.; Denison, D. Brian, Ed. – Routledge, Taylor & Francis Group, 2011
Increased demands for colleges and universities to engage in outcomes assessment for accountability purposes have accelerated the need to bridge the gap between higher education practice and the fields of measurement, assessment, and evaluation. The "Handbook on Measurement, Assessment, and Evaluation in Higher Education" provides higher…
Descriptors: Generalizability Theory, Higher Education, Institutional Advancement, Teacher Effectiveness
Stacks, Don W.; And Others – 1983
A study provided the initial test of a multidimensional instrument based on the idea that syntactic language choice might predict writing apprehension. The test measured six factors: (1) blank page paralysis, (2) general affect toward writing, (3) positive/negative business affect, (4) alternative modes, (5) attitude toward writing competence, and…
Descriptors: Business Communication, College Students, Evaluation Methods, Higher Education