NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
No Child Left Behind Act 20011
What Works Clearinghouse Rating
Showing 1 to 15 of 61 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Hwanggyu Lim; Danqi Zhu; Edison M. Choe; Kyung T. Han – Journal of Educational Measurement, 2024
This study presents a generalized version of the residual differential item functioning (RDIF) detection framework in item response theory, named GRDIF, to analyze differential item functioning (DIF) in multiple groups. The GRDIF framework retains the advantages of the original RDIF framework, such as computational efficiency and ease of…
Descriptors: Item Response Theory, Test Bias, Test Reliability, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Hung-Yu Huang – Educational and Psychological Measurement, 2025
The use of discrete categorical formats to assess psychological traits has a long-standing tradition that is deeply embedded in item response theory models. The increasing prevalence and endorsement of computer- or web-based testing has led to greater focus on continuous response formats, which offer numerous advantages in both respondent…
Descriptors: Response Style (Tests), Psychological Characteristics, Item Response Theory, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Stella Y. Kim; Carl Westine; Tong Wu; Derek Maher – Journal of College Student Retention: Research, Theory & Practice, 2024
The primary purpose of this study is to validate a student engagement measure for its use in evaluation of a learning assistant (LA) program. A series of psychometric evaluations were made for both the original scale of Higher Education Student Engagement Scale (HESES) and its adapted version designed to be used in gauging the effectiveness of…
Descriptors: Learner Engagement, Teaching Assistants, Test Validity, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Cui, Zhongmin; He, Yong – Measurement: Interdisciplinary Research and Perspectives, 2023
Careful considerations are necessary when there is a need to choose an anchor test form from a list of old test forms for equating under the random groups design. The choice of the anchor form potentially affects the accuracy of equated scores on new test forms. Few guidelines, however, can be found in the literature on choosing the anchor form.…
Descriptors: Test Format, Equated Scores, Best Practices, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Anders Holm; Anders Hjorth-Trolle; Robert Andersen – Sociological Methods & Research, 2025
Lagged dependent variables (LDVs) are often used as predictors in ordinary least squares (OLS) models in the social sciences. Although several estimators are commonly employed, little is known about their relative merits in the presence of classical measurement error and different longitudinal processes. We assess the performance of four commonly…
Descriptors: Elementary Education, Scores, Error of Measurement, Predictor Variables
Peer reviewed Peer reviewed
Direct linkDirect link
D. Steger; S. Weiss; O. Wilhelm – Creativity Research Journal, 2023
Creativity can be measured with a variety of methods including self-reports, others reports, and ability tests. While typical self-reports are best understood as weak proxies of creativity, biographical reports that assess previous creative activities seem more promising. Drawbacks of such measures -- including skewed item distributions, a lack of…
Descriptors: Creativity, Creativity Tests, Test Construction, Algorithms
Peer reviewed Peer reviewed
Direct linkDirect link
Frazier, Thomas W.; Khaliq, Izma; Scullin, Keeley; Uljarevic, Mirko; Shih, Andy; Karpur, Arun – Journal of Autism and Developmental Disorders, 2023
At present, there are no brief, freely-available, informant-report measures that evaluate key challenging behaviors relevant to youth with autism spectrum disorder (ASD) or other developmental disabilities (DD). This paper describes the development, refinement, and initial psychometric evaluation of a new 18-item measure, the Open-Source…
Descriptors: Test Construction, Psychometrics, Behavior Problems, Autism Spectrum Disorders
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Firdissa J. Aga – Intersection: A Journal at the Intersection of Assessment and Learning, 2024
The study investigated hurdles to the quality of student learning assessment by examining issues related to assessment procedures and practices, learners and learning, learning resources and test constructs, and test admin and feedback. Quantitative and qualitative data were collected from two Ethiopian universities using two types of…
Descriptors: Foreign Countries, College Faculty, College Students, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Jones, Andrew T.; Kopp, Jason P.; Ong, Thai Q. – Educational Measurement: Issues and Practice, 2020
Studies investigating invariance have often been limited to measurement or prediction invariance. Selection invariance, wherein the use of test scores for classification results in equivalent classification accuracy between groups, has received comparatively little attention in the psychometric literature. Previous research suggests that some form…
Descriptors: Test Construction, Test Bias, Classification, Accuracy
Peer reviewed Peer reviewed
Direct linkDirect link
Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022
The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…
Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Demirtas, Zülfü; Çaçan, Hanifi; Uslukaya, Alper – International Journal of Contemporary Educational Research, 2023
This work is intended to develop a measuring tool for determining teacher perception of informal relationships. The pool of items created by researchers through a literature review has been presented with expert assessment of the validity of the content, face, and meaning, and a draft scale has been created by making necessary revisions to the…
Descriptors: Foreign Countries, Teacher Attitudes, Likert Scales, Test Construction
Patrick C. Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Institute, 2024
Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international largescale assessments of cognitive and…
Descriptors: Performance Based Assessment, Evaluation Criteria, Evaluation Methods, Test Bias
Peer reviewed Peer reviewed
Direct linkDirect link
Lau, Chloe; Chiesi, Francesca; Hofmann, Jennifer; Ruch, Willibald; Saklofske, Donald H. – Journal of Psychoeducational Assessment, 2020
The State-Trait Cheerfulness Inventory--Trait Version (STCI-T60) measures the temperamental basis of sense of humor involving theoretically derived personality dispositions of cheerfulness, seriousness, and bad mood. The reliability and validity of the newly developed STCI-T60 Italian version were assessed in a sample of Italian speakers (N =…
Descriptors: Foreign Countries, Personality Traits, Psychometrics, Test Construction
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Erdem, Devrim – Eurasian Journal of Educational Research, 2020
Purpose: This study reports on the development, validation and measurement invariance of the Multicultural Competency Scale (MCS) for pre-service teachers. Research Methods: Data from 640 pre-service teachers were collected for two studies. After data screening procedures 628 responses were left. The data were divided into two sets for exploratory…
Descriptors: Rating Scales, Multicultural Education, Cultural Awareness, Teacher Competencies
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Erdem, Devrim – International Journal of Assessment Tools in Education, 2020
The purpose of this study was to develop a scale measuring attitudes toward women's working. In line with this main purpose, two studies were conducted to develop the tool and investigate its psychometric properties in two different samples. The study 1 started with generating item pool, conducting exploratory factor analysis to identify…
Descriptors: Young Adults, Employed Women, Test Construction, Error of Measurement
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5