Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 6 |
Descriptor
Source
Applied Measurement in… | 8 |
Author
Publication Type
Journal Articles | 8 |
Reports - Research | 7 |
Reports - Evaluative | 1 |
Tests/Questionnaires | 1 |
Education Level
Elementary Education | 3 |
Secondary Education | 2 |
Early Childhood Education | 1 |
Elementary Secondary Education | 1 |
Grade 1 | 1 |
Grade 2 | 1 |
Grade 3 | 1 |
Grade 8 | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Primary Education | 1 |
More ▼ |
Audience
Location
Canada | 2 |
Iran (Tehran) | 1 |
Netherlands | 1 |
Tennessee | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 2 |
Stanford Achievement Tests | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
Oliveri, Maria Elena; Ercikan, Kadriye; Lyons-Thomas, Juliette; Holtzman, Steven – Applied Measurement in Education, 2016
Differential item functioning (DIF) analyses have been used as the primary method in large-scale assessments to examine fairness for subgroups. Currently, DIF analyses are conducted utilizing manifest methods using observed characteristics (gender and race/ethnicity) for grouping examinees. Homogeneity of item responses is assumed denoting that…
Descriptors: Test Bias, Language Minorities, Effect Size, Foreign Countries
Grover, Raman K.; Ercikan, Kadriye – Applied Measurement in Education, 2017
In gender differential item functioning (DIF) research it is assumed that all members of a gender group have similar item response patterns and therefore generalizations from group level to subgroup and individual levels can be made accurately. However DIF items do not necessarily disadvantage every member of a gender group to the same degree,…
Descriptors: Gender Differences, Test Bias, Socioeconomic Status, Reading Achievement
Hawley, Leslie R.; Bovaird, James A.; Wu, ChaoRong – Applied Measurement in Education, 2017
Value-added assessment methods have been criticized by researchers and policy makers for a number of reasons. One issue includes the sensitivity of model results across different outcome measures. This study examined the utility of incorporating multivariate latent variable approaches within a traditional value-added framework. We evaluated the…
Descriptors: Value Added Models, Reliability, Multivariate Analysis, Scaling
Fagginger Auer, Marije F.; Hickendorff, Marian; Van Putten, Cornelis M.; Béguin, Anton A.; Heiser, Willem J. – Applied Measurement in Education, 2016
A first application of multilevel latent class analysis (MLCA) to educational large-scale assessment data is demonstrated. This statistical technique addresses several of the challenges that assessment data offers. Importantly, MLCA allows modeling of the often ignored teacher effects and of the joint influence of teacher and student variables.…
Descriptors: Educational Assessment, Multivariate Analysis, Classification, Data
Kabiri, Masoud; Ghazi-Tabatabaei, Mahmood; Bazargan, Abbas; Shokoohi-Yekta, Mohsen; Kharrazi, Kamal – Applied Measurement in Education, 2017
Numerous diagnostic studies have been conducted on large-scale assessments to illustrate the students' mastery profile in the areas of math and reading; however, for science a limited number of investigations are reported. This study investigated Iranian eighth graders' competency mastery of science and examined the utility of the General…
Descriptors: Elementary Secondary Education, Achievement Tests, International Assessment, Foreign Countries
Mroch, Andrew A.; Bolt, Daniel M. – Applied Measurement in Education, 2006
Recently, nonparametric methods have been proposed that provide a dimensionally based description of test structure for tests with dichotomous items. Because such methods are based on different notions of dimensionality than are assumed when using a psychometric model, it remains unclear whether these procedures might lead to a different…
Descriptors: Simulation, Comparative Analysis, Psychometrics, Methods Research

Schaefer, Lyn; And Others – Applied Measurement in Education, 1992
Studied methods for structuring a performance domain for a certification test in emergency nursing based on task frequency ratings from 659 emergency nurses or task similarity ratings from 21 experts. A 125-job analysis survey was used. Similarity judgment results are more easily interpreted and adequately modeled by multivariate analysis. (SLD)
Descriptors: Certification, Comparative Testing, Job Analysis, Licensing Examinations (Professions)

Witt, Elizabeth A.; And Others – Applied Measurement in Education, 1994
Differential prediction of course grades (language arts, mathematics, social studies, and science) from achievement test scores was studied for 634 female and 623 male eighth and ninth graders through multivariate analysis. Female performance was overpredicted except in language arts, but differences were seldom large enough to have practical…
Descriptors: Academic Achievement, Adolescents, Elementary Secondary Education, Females