Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 16 |
Descriptor
Error of Measurement | 29 |
Item Analysis | 29 |
Statistical Analysis | 29 |
Test Items | 10 |
Foreign Countries | 9 |
Comparative Analysis | 8 |
Mathematical Models | 8 |
Factor Analysis | 6 |
Goodness of Fit | 6 |
Test Reliability | 6 |
Achievement Tests | 5 |
More ▼ |
Source
Author
Publication Type
Education Level
Higher Education | 4 |
Elementary Secondary Education | 3 |
Middle Schools | 3 |
Elementary Education | 2 |
Grade 3 | 2 |
Grade 5 | 2 |
Grade 8 | 2 |
Junior High Schools | 2 |
Postsecondary Education | 2 |
Early Childhood Education | 1 |
Grade 2 | 1 |
More ▼ |
Audience
Location
Japan | 3 |
Greece | 1 |
Maryland | 1 |
Portugal | 1 |
Saudi Arabia | 1 |
South Korea | 1 |
Spain | 1 |
Sudan | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Foreign Language Classroom… | 1 |
Motivated Strategies for… | 1 |
Test of English for… | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
Jinjin Huang – ProQuest LLC, 2020
Measurement invariance is crucial for an effective and valid measure of a construct. Invariance holds when the latent trait varies consistently across subgroups; in other words, the mean differences among subgroups are only due to true latent ability differences. Differential item functioning (DIF) occurs when measurement invariance is violated.…
Descriptors: Robustness (Statistics), Item Response Theory, Test Items, Item Analysis
Hidalgo, Ma Dolores; Benítez, Isabel; Padilla, Jose-Luis; Gómez-Benito, Juana – Sociological Methods & Research, 2017
The growing use of scales in survey questionnaires warrants the need to address how does polytomous differential item functioning (DIF) affect observed scale score comparisons. The aim of this study is to investigate the impact of DIF on the type I error and effect size of the independent samples t-test on the observed total scale scores. A…
Descriptors: Test Items, Test Bias, Item Response Theory, Surveys
Ra, Jongmin; Rhee, Ki Jong – Educational Sciences: Theory and Practice, 2018
A fundamental challenge to understanding effects of foreign language anxiety on the foreign language learning lies in implementing reliable and valid measures. Considering importance of measurement bias and widespread usage of the foreign language classroom anxiety scale (FLCAS) in education, the aim of the current study was to detect differential…
Descriptors: Gender Differences, Second Language Learning, Second Language Instruction, Anxiety
Ayala-Nunes, Lara; Jiménez, Lucía; Hidalgo, Victoria; Dekovic, Maja; Jesus, Saul – Research on Social Work Practice, 2018
Objective: The measurement of Family Feedback on Child Welfare Services (FF-CWS) is gaining prominence as an efficacy indicator and is coherent with concerns about family-centered practice and empowerment. The aim of this study was to develop and validate an instrument that would overcome the scarcity of psychometrically sound measures in this…
Descriptors: Feedback (Response), Error of Measurement, Validity, Child Welfare
Gómez-Benito, Juana; Hidalgo, Maria Dolores; Zumbo, Bruno D. – Educational and Psychological Measurement, 2013
The objective of this article was to find an optimal decision rule for identifying polytomous items with large or moderate amounts of differential functioning. The effectiveness of combining statistical tests with effect size measures was assessed using logistic discriminant function analysis and two effect size measures: R[superscript 2] and…
Descriptors: Item Analysis, Test Items, Effect Size, Statistical Analysis
Skinner, Ellen; Saxton, Emily; Currie, Cailin; Shusterman, Gwen – International Journal of Science Education, 2017
As part of long-standing efforts to promote undergraduates' success in science, researchers have investigated the instructional strategies and motivational factors that promote student learning and persistence in science coursework and majors. This study aimed to create a set of brief measures that educators and researchers can use as tools to…
Descriptors: Undergraduate Students, Science Instruction, Majors (Students), Biology
Pereira, Nielsen; Bakhiet, Salaheldin Farah; Gentry, Marcia; Balhmar, Tahani Abdulrahman; Hakami, Sultan Mohammed – Journal of Advanced Academics, 2017
This study examined the psychometric properties and measurement invariance of the Arabic version of "My Class Activities" (MCA), an instrument designed to measure students' perceptions of interest, challenge, choice, and enjoyment in classrooms. Scores of 3,516 Sudanese students in Grades 2 to 8 were used. Confirmatory factor analysis…
Descriptors: Student Attitudes, Factor Analysis, Comparative Analysis, Gifted
Suzuki, Yuichi – Language Testing, 2015
Self-assessment has been used to assess second language proficiency; however, as sources of measurement errors vary, they may threaten the validity and reliability of the tools. The present paper investigated the role of experiences in using Japanese as a second language in the naturalistic acquisition context on the accuracy of the…
Descriptors: Self Evaluation (Individuals), Error of Measurement, Japanese, Second Language Learning
McLean, Stuart; Kramer, Brandon; Beglar, David – Language Teaching Research, 2015
An important gap in the field of second language vocabulary assessment concerns the lack of validated tests measuring aural vocabulary knowledge. The primary purpose of this study is to introduce and provide preliminary validity evidence for the Listening Vocabulary Levels Test (LVLT), which has been designed as a diagnostic tool to measure…
Descriptors: Test Construction, Test Validity, English (Second Language), Second Language Learning
Svetina, Dubravka; Rutkowski, Leslie – Large-scale Assessments in Education, 2014
Background: When studying student performance across different countries or cultures, an important aspect for comparisons is that of score comparability. In other words, it is imperative that the latent variable (i.e., construct of interest) is understood and measured equivalently across all participating groups or countries, if our inferences…
Descriptors: Test Items, Item Response Theory, Item Analysis, Regression (Statistics)
Zwick, Rebecca – ETS Research Report Series, 2012
Differential item functioning (DIF) analysis is a key component in the evaluation of the fairness and validity of educational tests. The goal of this project was to review the status of ETS DIF analysis procedures, focusing on three aspects: (a) the nature and stringency of the statistical rules used to flag items, (b) the minimum sample size…
Descriptors: Test Bias, Sample Size, Bayesian Statistics, Evaluation Methods
Schafer, William D.; Coverdale, Bradley J.; Luxenberg, Harlan; Jin, Ying – Practical Assessment, Research & Evaluation, 2011
There are relatively few examples of quantitative approaches to quality control in educational assessment and accountability contexts. Among the several techniques that are used in other fields, Shewart charts have been found in a few instances to be applicable in educational settings. This paper describes Shewart charts and gives examples of how…
Descriptors: Charts, Quality Control, Educational Assessment, Statistical Analysis
Keiffer, Elizabeth Ann – ProQuest LLC, 2011
A differential item functioning (DIF) simulation study was conducted to explore the type and level of impact that contamination had on type I error and power rates in DIF analyses when the suspect item favored the same or opposite group as the DIF items in the matching subtest. Type I error and power rates were displayed separately for the…
Descriptors: Test Items, Sample Size, Simulation, Identification
Argyropoulos, Vassilios; Sideridis, Georgios D.; Botsas, George; Padeliadu, Susana – Assessment for Effective Intervention, 2012
The purpose of the present study was to assess self-regulation of students with visual impairments across two academic subjects, language and math. The participants were 46 Greek students with visual impairments who completed self-regulation measures across the subject matters of language and math. Initially, the factorial validity of the scale…
Descriptors: Visual Impairments, Self Evaluation (Individuals), Mathematics, Metacognition
Kwon, Hyungil Harry; Pyun, Do Young; Han, Siwan; Ogasawara, Etsuko – Asia Pacific Journal of Education, 2011
The objective of this study was to provide empirical evidence to support psychometric properties of a modified four-dimensional model of the Leadership Scale for Sports (LSS). The study tested invariance of all parameters (i.e., factor loadings, error variances, and factor variances-covariances) in the four-dimensional measurement model between…
Descriptors: Feedback (Response), Testing, Athletes, Factor Structure
Previous Page | Next Page »
Pages: 1 | 2