Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 5 |
Since 2016 (last 10 years) | 13 |
Since 2006 (last 20 years) | 26 |
Descriptor
Comparative Analysis | 81 |
Test Reliability | 81 |
Models | 43 |
Mathematical Models | 34 |
Test Validity | 25 |
Statistical Analysis | 17 |
Error of Measurement | 14 |
Test Items | 14 |
Correlation | 12 |
Foreign Countries | 12 |
Test Construction | 11 |
More ▼ |
Source
Author
Bashaw, W. L. | 2 |
Benson, Jeri | 2 |
Hansen, Duncan N. | 2 |
Huynh, Huynh | 2 |
Lubiano, Michael Leonard D. | 2 |
Magpantay, Marife S. | 2 |
Reckase, Mark D. | 2 |
Rentz, R. Robert | 2 |
Stallings, Jane A. | 2 |
Ackerman, Terry A. | 1 |
Adams, R. J. | 1 |
More ▼ |
Publication Type
Education Level
Audience
Location
China | 3 |
Philippines | 3 |
Turkey | 3 |
France | 2 |
Germany | 2 |
Portugal | 2 |
Spain | 2 |
United States | 2 |
Asia | 1 |
Australia | 1 |
Brazil | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Mingfeng Xue; Ping Chen – Journal of Educational Measurement, 2025
Response styles pose great threats to psychological measurements. This research compares IRTree models and anchoring vignettes in addressing response styles and estimating the target traits. It also explores the potential of combining them at the item level and total-score level (ratios of extreme and middle responses to vignettes). Four models…
Descriptors: Item Response Theory, Models, Comparative Analysis, Vignettes
Maïano, Christophe; Morin, Alexandre J. S.; Tietjens, Maike; Bastos, Tânia; Luiggi, Maxime; Corredeira, Rui; Griffet, Jean; Sánchez-Oliva, David – Measurement in Physical Education and Exercise Science, 2023
The present study sought to examine the psychometric properties of new German, Portuguese, and Spanish versions of the Revised Short Form of the Physical Self-Inventory (PSI-S-"R"), and to contrast these properties against those from the original French version of this instrument. Participants (n = 1802) were 288 French youth, 177 German…
Descriptors: German, Portuguese, Spanish, Test Construction
Madison, Matthew J. – Educational Measurement: Issues and Practice, 2019
Recent advances have enabled diagnostic classification models (DCMs) to accommodate longitudinal data. These longitudinal DCMs were developed to study how examinees change, or transition, between different attribute mastery statuses over time. This study examines using longitudinal DCMs as an approach to assessing growth and serves three purposes:…
Descriptors: Longitudinal Studies, Item Response Theory, Psychometrics, Criterion Referenced Tests
Bao, Lei; Koenig, Kathleen; Xiao, Yang; Fritchman, Joseph; Zhou, Shaona; Chen, Cheng – Physical Review Physics Education Research, 2022
Abilities in scientific thinking and reasoning have been emphasized as core areas of initiatives, such as the Next Generation Science Standards or the College Board Standards for College Success in Science, which focus on the skills the future will demand of today's students. Although there is rich literature on studies of how these abilities…
Descriptors: Physics, Science Instruction, Teaching Methods, Thinking Skills
Storme, Martin; Myszkowski, Nils; Baron, Simon; Bernard, David – Journal of Intelligence, 2019
Assessing job applicants' general mental ability online poses psychometric challenges due to the necessity of having brief but accurate tests. Recent research (Myszkowski & Storme, 2018) suggests that recovering distractor information through Nested Logit Models (NLM; Suh & Bolt, 2010) increases the reliability of ability estimates in…
Descriptors: Intelligence Tests, Item Response Theory, Comparative Analysis, Test Reliability
Braumoeller, Bear F. – Sociological Methods & Research, 2017
Fuzzy-set qualitative comparative analysis (fsQCA) has become one of the most prominent methods in the social sciences for capturing causal complexity, especially for scholars with small- and medium-"N" data sets. This research note explores two key assumptions in fsQCA's methodology for testing for necessary and sufficient…
Descriptors: Qualitative Research, Comparative Analysis, Social Science Research, Research Methodology
Zaidi, Nikki L.; Swoboda, Christopher M.; Kelcey, Benjamin M.; Manuel, R. Stephen – Advances in Health Sciences Education, 2017
The extant literature has largely ignored a potentially significant source of variance in multiple mini-interview (MMI) scores by "hiding" the variance attributable to the sample of attributes used on an evaluation form. This potential source of hidden variance can be defined as rating items, which typically comprise an MMI evaluation…
Descriptors: Interviews, Scores, Generalizability Theory, Monte Carlo Methods
Horn, Aaron S.; Horner, Olena G.; Lee, Giljae – Studies in Higher Education, 2019
Researchers in higher education frequently evaluate institutional effectiveness as the difference between an actual and predicted graduation rate, but little is known about whether such a method is reliable or valid. This study examines the measurement properties of effectiveness scores derived from regression residuals for community colleges in…
Descriptors: Instructional Effectiveness, Two Year Colleges, Comparative Analysis, Raw Scores
Lubiano, Michael Leonard D.; Magpantay, Marife S. – International Journal of Research in Education and Science, 2021
This study enhanced the 7E instructional model towards enriching the science inquiry skills of senior high school learners in General Chemistry 1. A total of 136 Grade 12 learners enrolled in the Science, Technology, Engineering, and Mathematics (STEM) strand participated in the study. The study was composed of three phases. In Phase I, the…
Descriptors: Science Instruction, Teaching Methods, Inquiry, High School Students
Lubiano, Michael Leonard D.; Magpantay, Marife S. – International Society for Technology, Education, and Science, 2021
This study enhanced the 7E instructional model towards enriching the science inquiry skills of senior high school learners in General Chemistry 1. A total of 136 Grade 12 learners enrolled in the Science, Technology, Engineering, and Mathematics (STEM) strand participated in the study. The study was composed of three phases. In Phase I, the…
Descriptors: Foreign Countries, Science Instruction, Teaching Methods, Inquiry
Haberman, Shelby J.; Liu, Yang; Lee, Yi-Hsuan – ETS Research Report Series, 2019
Distractor analyses are routinely conducted in educational assessments with multiple-choice items. In this research report, we focus on three item response models for distractors: (a) the traditional nominal response (NR) model, (b) a combination of a two-parameter logistic model for item scores and a NR model for selections of incorrect…
Descriptors: Multiple Choice Tests, Scores, Test Reliability, High Stakes Tests
Bush, Martin – Assessment & Evaluation in Higher Education, 2015
The humble multiple-choice test is very widely used within education at all levels, but its susceptibility to guesswork makes it a suboptimal assessment tool. The reliability of a multiple-choice test is partly governed by the number of items it contains; however, longer tests are more time consuming to take, and for some subject areas, it can be…
Descriptors: Guessing (Tests), Multiple Choice Tests, Test Format, Test Reliability
Yang, Sophie Xin; Jowett, Sophia – Measurement in Physical Education and Exercise Science, 2013
The Coach-Athlete Relationship Questionnaire was developed to effectively measure affective, cognitive, and behavioral aspects, represented by the interpersonal constructs of closeness, commitment, and complementarity, of the quality of the relationship within the context of sport coaching. The current study sought to determine the internal…
Descriptors: Foreign Countries, Athletes, Athletic Coaches, Interpersonal Relationship
Ling, Guangming – ETS Research Report Series, 2012
To assess the value of individual students' subscores on the Major Field Test in Business (MFT Business), I examined the test's internal structure with factor analysis and structural equation model methods, and analyzed the subscore reliabilities using the augmented scores method. Analyses of the internal structure suggested that the MFT Business…
Descriptors: Factor Analysis, Construct Validity, Structural Equation Models, Correlation
Anafarta, Ayse; Apaydin, Çigdem – International Education Studies, 2016
Mentoring has received considerable attention from scholars, and in the relevant literature, a number of studies give reference to the mentoring programs developed at universities and to the mentoring relations in higher education. Yet, most of these studies either only have a theoretical basis or deal with the mentoring relationships between…
Descriptors: Mentors, Job Satisfaction, Success, Structural Equation Models