NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Teachers1
What Works Clearinghouse Rating
Showing 1 to 15 of 260 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
William C. M. Belzak; Daniel J. Bauer – Journal of Educational and Behavioral Statistics, 2024
Testing for differential item functioning (DIF) has undergone rapid statistical developments recently. Moderated nonlinear factor analysis (MNLFA) allows for simultaneous testing of DIF among multiple categorical and continuous covariates (e.g., sex, age, ethnicity, etc.), and regularization has shown promising results for identifying DIF among…
Descriptors: Test Bias, Algorithms, Factor Analysis, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Ryan M. Cook; Stefanie A. Wind – Measurement and Evaluation in Counseling and Development, 2024
The purpose of this article is to discuss reliability and precision through the lens of a modern measurement approach, item response theory (IRT). Reliability evidence in the field of counseling is primarily generated using Classical Test Theory (CTT) approaches, although recent studies in the field of counseling have shown the benefits of using…
Descriptors: Item Response Theory, Measurement, Reliability, Accuracy
Peer reviewed Peer reviewed
Direct linkDirect link
Chia-Lin Tsai; Stefanie Wind; Samantha Estrada – Measurement: Interdisciplinary Research and Perspectives, 2025
Researchers who work with ordinal rating scales sometimes encounter situations where the scale categories do not function in the intended or expected way. For example, participants' use of scale categories may result in an empirical difficulty ordering for the categories that does not match what was intended. Likewise, the level of distinction…
Descriptors: Rating Scales, Item Response Theory, Psychometrics, Self Efficacy
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Siqi Huang – North American Chapter of the International Group for the Psychology of Mathematics Education, 2023
The goal of this paper is twofold. First, the paper clarifies and elaborates on an important theoretical construct called orientation with respect to understanding in mathematics, which denotes the degree to which students exhibit an inclination towards and demonstrate an earnest concern for understanding in mathematical learning. Second, the…
Descriptors: Mathematics Instruction, Teaching Methods, Problem Solving, Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Huiying Cai; Xun Yan – Language Testing, 2024
Rater comments tend to be qualitatively analyzed to indicate raters' application of rating scales. This study applied natural language processing (NLP) techniques to quantify meaningful, behavioral information from a corpus of rater comments and triangulated that information with a many-facet Rasch measurement (MFRM) analysis of rater scores. The…
Descriptors: Natural Language Processing, Item Response Theory, Rating Scales, Writing Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Sung-eun Baek; Christine Myung-hee Ahn – Journal of Psychoeducational Assessment, 2025
The purpose of this study was to evaluate the reliability and validity of the Korean Behavior Assessment System for Children 3rd Edition Teacher Rating Scales--Child Form (K·BASC-3 TRS-C). We used the generalized partial credit model based on item response theory (IRT) to examine the internal validity of the scale's items and the latent trait in a…
Descriptors: Reliability, Teacher Attitudes, Elementary Secondary Education, Asians
Peer reviewed Peer reviewed
Direct linkDirect link
DeMars, Christine E. – Applied Measurement in Education, 2021
Estimation of parameters for the many-facets Rasch model requires that conditional on the values of the facets, such as person ability, item difficulty, and rater severity, the observed responses within each facet are independent. This requirement has often been discussed for the Rasch models and 2PL and 3PL models, but it becomes more complex…
Descriptors: Item Response Theory, Test Items, Ability, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Grimm, Kevin J.; Fine, Kimberly; Stegmann, Gabriela – International Journal of Behavioral Development, 2021
Modeling within-person change over time and between-person differences in change over time is a primary goal in prevention science. When modeling change in an observed score over time with multilevel or structural equation modeling approaches, each observed score counts toward the estimation of model parameters equally. However, observed scores…
Descriptors: Error of Measurement, Weighted Scores, Accuracy, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Peitao Zhu; Ching-Chen Chen; Qiu Wang; Melissa M. Luke; Yanhong Liu – Measurement and Evaluation in Counseling and Development, 2025
Objective: This study aimed to validate the Cultural Humility and Enactment Scale (CHES) through (a) examining its factor structure with multiple samples; (b) employing item response theory (IRT) analysis to examine its item-level characteristics; (c) reducing potential redundancies among items; and (d) conducting measurement invariance (MI)…
Descriptors: Item Response Theory, Cultural Awareness, Measurement Techniques, Construct Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Wind, Stefanie A.; Wolfe, Edward W.; Engelhard, George, Jr.; Foltz, Peter; Rosenstein, Mark – International Journal of Testing, 2018
Automated essay scoring engines (AESEs) are becoming increasingly popular as an efficient method for performance assessments in writing, including many language assessments that are used worldwide. Before they can be used operationally, AESEs must be "trained" using machine-learning techniques that incorporate human ratings. However, the…
Descriptors: Computer Assisted Testing, Essay Tests, Writing Evaluation, Scoring
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Shan Lin; Jian Wang – Journal of Baltic Science Education, 2024
Scientific thinking constitutes a vital component of scientific competencies, crucial for citizens to adapt to the evolving societal landscape. To cultivate students' scientific thinking, teachers should possess an adequate professional knowledge foundation, which encompasses pedagogical content knowledge (PCK). Assessing teachers' PCK of…
Descriptors: Secondary School Teachers, Teacher Attitudes, Biology, Pedagogical Content Knowledge
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Sayin, Ayfer; Sata, Mehmet – International Journal of Assessment Tools in Education, 2022
The aim of the present study was to examine Turkish teacher candidates' competency levels in writing different types of test items by utilizing Rasch analysis. In addition, the effect of the expertise of the raters scoring the items written by the teacher candidates was examined within the scope of the study. 84 Turkish teacher candidates…
Descriptors: Foreign Countries, Item Response Theory, Evaluators, Expertise
Peer reviewed Peer reviewed
Direct linkDirect link
Huaxia Xiong; Mingfeng Xue; Guan Di; Yaqing Mao; Enhui Qiao – Journal of Psychoeducational Assessment, 2024
The impact of teachers' beliefs on the implementation and effectiveness of Social and Emotional Learning (SEL) programs underscores the essential need for reliable measures of these beliefs. This study aims to explore and validate the psychometric properties of the Teacher Social and Emotional Learning Beliefs Scale (TSELBS) within the Chinese…
Descriptors: Social Emotional Learning, Teacher Attitudes, Program Effectiveness, Correlation
Peer reviewed Peer reviewed
Direct linkDirect link
van der Scheer, Emmelien A.; Bijlsma, Hannah J. E.; Glas, Cees A. W. – School Effectiveness and School Improvement, 2019
A Bayesian IRT-model approach was used to investigate the validity and reliability of student perceptions of teaching quality. Furthermore, the student perceptions were compared with ratings of teaching quality by external observers. Grade 4 students (n = 675) filled out a questionnaire that was used to measure their opinions about the lessons of…
Descriptors: Student Attitudes, Validity, Interrater Reliability, Correlation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Polat, Murat; Toraman, Çetin; Turhan, Nihan Sölpük – International Journal of Curriculum and Instruction, 2022
PISA (Program for International Student Assessment) tests have enabled the OECD countries to see not only the success of their students in gaining the ability to solve some daily problems they may encounter in their lives but also the place in the world rankings as a result of an objective evaluation comparing the achievement results of…
Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  ...  |  18