Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 25 |
Descriptor
Testing Programs | 25 |
Test Bias | 17 |
Test Items | 8 |
Academic Achievement | 7 |
Item Response Theory | 7 |
Equated Scores | 6 |
Racial Bias | 6 |
Mathematics Tests | 5 |
Models | 5 |
Sample Size | 5 |
Scoring | 5 |
More ▼ |
Source
Author
Dorans, Neil J. | 3 |
Akour, Mutasem | 1 |
Albano, Anthony D. | 1 |
Ariel, Adelaide | 1 |
Benítez, Isabel | 1 |
Biddanda, Haley C. | 1 |
Bilir, Mustafa Kuzey | 1 |
Blatt, Jessica | 1 |
Bolt, Sara E. | 1 |
Doorey, Nancy A. | 1 |
Eastwell, Peter | 1 |
More ▼ |
Publication Type
Journal Articles | 15 |
Reports - Research | 13 |
Numerical/Quantitative Data | 6 |
Reports - Descriptive | 6 |
Reports - Evaluative | 4 |
Dissertations/Theses -… | 2 |
Opinion Papers | 1 |
Education Level
Grade 7 | 7 |
Grade 4 | 6 |
Grade 5 | 6 |
Grade 6 | 6 |
Grade 8 | 6 |
Elementary Secondary Education | 5 |
Early Childhood Education | 3 |
Elementary Education | 3 |
Grade 10 | 3 |
Grade 11 | 3 |
Grade 12 | 3 |
More ▼ |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 3 |
California Achievement Tests | 1 |
Graduate Record Examinations | 1 |
Law School Admission Test | 1 |
SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
Biddanda, Haley C. – Communique, 2022
Race-based traumatic stress, also called racial trauma, refers to "mental and emotional injury caused by encounters with racial bias and ethnic discrimination, racism, and hate crimes" (Mental Health America, n.d.). While much research on racism-based stress in schools focuses on teachers, school psychologists can just as easily cause…
Descriptors: Racial Discrimination, Anxiety, African American Students, Testing Programs
Akour, Mutasem; Sabah, Saed; Hammouri, Hind – Journal of Psychoeducational Assessment, 2015
The purpose of this study was to apply two types of Differential Item Functioning (DIF), net and global DIF, as well as the framework of Differential Step Functioning (DSF) to real testing data to investigate measurement invariance related to test language. Data from the Program for International Student Assessment (PISA)-2006 polytomously scored…
Descriptors: Test Bias, Science Tests, Test Items, Scoring
Benítez, Isabel; Padilla, José-Luis – Journal of Mixed Methods Research, 2014
Differential item functioning (DIF) can undermine the validity of cross-lingual comparisons. While a lot of efficient statistics for detecting DIF are available, few general findings have been found to explain DIF results. The objective of the article was to study DIF sources by using a mixed method design. The design involves a quantitative phase…
Descriptors: Foreign Countries, Mixed Methods Research, Test Bias, Cross Cultural Studies
Keller, Lisa A.; Keller, Robert R. – Educational and Psychological Measurement, 2011
This article investigates the accuracy of examinee classification into performance categories and the estimation of the theta parameter for several item response theory (IRT) scaling techniques when applied to six administrations of a test. Previous research has investigated only two administrations; however, many testing programs equate tests…
Descriptors: Item Response Theory, Scaling, Sustainability, Classification
Albano, Anthony D. – Journal of Educational Measurement, 2013
In many testing programs it is assumed that the context or position in which an item is administered does not have a differential effect on examinee responses to the item. Violations of this assumption may bias item response theory estimates of item and person parameters. This study examines the potentially biasing effects of item position. A…
Descriptors: Test Items, Item Response Theory, Test Format, Questioning Techniques
Paek, Insu; Guo, Hongwen – Applied Psychological Measurement, 2011
This study examined how much improvement was attainable with respect to accuracy of differential item functioning (DIF) measures and DIF detection rates in the Mantel-Haenszel procedure when employing focal and reference groups with notably unbalanced sample sizes where the focal group has a fixed small sample which does not satisfy the minimum…
Descriptors: Test Bias, Accuracy, Reference Groups, Investigations
New York State Education Department, 2016
This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 Common Core English Language Arts (ELA) and Mathematics 2016 Operational Tests. This report includes information about test content and test development, item (i.e.,…
Descriptors: Testing Programs, English, Language Arts, Mathematics Tests
French, Brian F.; Finch, W. Holmes – Journal of Educational Measurement, 2010
The purpose of this study was to examine the performance of differential item functioning (DIF) assessment in the presence of a multilevel structure that often underlies data from large-scale testing programs. Analyses were conducted using logistic regression (LR), a popular, flexible, and effective tool for DIF detection. Data were simulated…
Descriptors: Test Bias, Testing Programs, Evaluation, Measurement
New York State Education Department, 2015
This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 Common Core English Language Arts (ELA) and Mathematics 2015 Operational Tests. This report includes information about test content and test development, item (i.e.,…
Descriptors: Testing Programs, English, Language Arts, Mathematics Tests
Sinharay, Sandip; Dorans, Neil J.; Liang, Longjuan – Educational Measurement: Issues and Practice, 2011
Over the past few decades, those who take tests in the United States have exhibited increasing diversity with respect to native language. Standard psychometric procedures for ensuring item and test fairness that have existed for some time were developed when test-taking groups were predominantly native English speakers. A better understanding of…
Descriptors: Test Bias, Testing Programs, Psychometrics, Language Proficiency
Pang, Valerie Ooka; Han, Peggy P.; Pang, Jennifer M. – Educational Researcher, 2011
The authors studied more than 1 million Asian American and Pacific Islander (AAPI) and White seventh graders in a statewide California testing program between 2003 and 2008, examining their reading and math achievement. AAPI student performance is often reported as an aggregate in discussions of the success of schoolchildren and issues of racial…
Descriptors: Achievement Gap, Testing Programs, Pacific Islanders, Grade 7
New York State Education Department, 2014
This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 Common Core English Language Arts (ELA) and Mathematics 2014 Operational Tests. This report includes information about test content and test development, item (i.e.,…
Descriptors: Testing Programs, English, Language Arts, Mathematics Tests
Doorey, Nancy A. – Council of Chief State School Officers, 2011
The work reported in this paper reflects a collaborative effort of many individuals representing multiple organizations. It began during a session at the October 2008 meeting of TILSA when a representative of a member state asked the group if any of their programs had experienced unexpected fluctuations in the annual state assessment scores, and…
Descriptors: Testing, Sampling, Expertise, Testing Programs
Bilir, Mustafa Kuzey – ProQuest LLC, 2009
This study uses a new psychometric model (mixture item response theory-MIMIC model) that simultaneously estimates differential item functioning (DIF) across manifest groups and latent classes. Current DIF detection methods investigate DIF from only one side, either across manifest groups (e.g., gender, ethnicity, etc.), or across latent classes…
Descriptors: Test Items, Testing Programs, Markov Processes, Psychometrics
Puhan, Gautam; Moses, Timothy P.; Yu, Lei; Dorans, Neil J. – Journal of Educational Measurement, 2009
This study examined the extent to which log-linear smoothing could improve the accuracy of differential item functioning (DIF) estimates in small samples of examinees. Examinee responses from a certification test were analyzed using White examinees in the reference group and African American examinees in the focal group. Using a simulation…
Descriptors: Test Items, Reference Groups, Testing Programs, Raw Scores
Previous Page | Next Page »
Pages: 1 | 2