Publication Date
| In 2026 | 0 |
| Since 2025 | 38 |
| Since 2022 (last 5 years) | 225 |
| Since 2017 (last 10 years) | 570 |
| Since 2007 (last 20 years) | 1377 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 110 |
| Practitioners | 107 |
| Teachers | 46 |
| Administrators | 25 |
| Policymakers | 24 |
| Counselors | 12 |
| Parents | 7 |
| Students | 7 |
| Support Staff | 4 |
| Community | 2 |
Location
| California | 61 |
| Canada | 60 |
| United States | 57 |
| Turkey | 47 |
| Australia | 43 |
| Florida | 34 |
| Germany | 26 |
| Texas | 26 |
| China | 25 |
| Netherlands | 25 |
| Iran | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Schmelkes, Silvia – Education Policy Analysis Archives, 2018
The National Institute for the Evaluation of Education (INEE) in Mexico has begun to meet the challenges in evaluating indigenous children and teachers and the educational programs and policies targeted to them. Several evaluation projects are described in this paper. One is the "Previous, Free and Informed Consultation of Indigenous…
Descriptors: Foreign Countries, Educational Assessment, Indigenous Populations, Educational Quality
Ercikan, Kadriye; Roth, Wolff-Michael; Simon, Marielle; Sandilands, Debra; Lyons-Thomas, Juliette – Applied Measurement in Education, 2014
Diversity and heterogeneity among language groups have been well documented. Yet most fairness research that focuses on measurement comparability considers linguistic minority students such as English language learners (ELLs) or Francophone students living in minority contexts in Canada as a single group. Our focus in this research is to examine…
Descriptors: Test Bias, Language Minorities, French Canadians, Measurement
Steiner, Peter M.; Kim, Yongnam – Society for Research on Educational Effectiveness, 2014
In contrast to randomized experiments, the estimation of unbiased treatment effects from observational data requires an analysis that conditions on all confounding covariates. Conditioning on covariates can be done via standard parametric regression techniques or nonparametric matching like propensity score (PS) matching. The regression or…
Descriptors: Observation, Research Methodology, Test Bias, Regression (Statistics)
Liu, Ou Lydia; Mao, Liyang; Zhao, Tingting; Yang, Yi; Xu, Jun; Wang, Zhen – ETS Research Report Series, 2016
Chinese higher education is experiencing rapid development and growth. With tremendous resources invested in higher education, policy makers have requested more direct evidence of student learning. However, assessment tools that can be used to measure college-level learning are scarce in China. To mitigate this situation, we translated the…
Descriptors: Foreign Countries, Higher Education, Critical Thinking, College Students
Yu, Guoxing; Zhang, Jing – Language Assessment Quarterly, 2017
In this special issue on high-stakes English language testing in China, the two articles on computer-based testing (Jin & Yan; He & Min) highlight a number of consistent, ongoing challenges and concerns in the development and implementation of the nationwide IB-CET (Internet Based College English Test) and institutional computer-adaptive…
Descriptors: Foreign Countries, Computer Assisted Testing, English (Second Language), Language Tests
Oberski, Daniel L.; Vermunt, Jeroen K. – Measurement: Interdisciplinary Research and Perspectives, 2013
These authors congratulate Albert Maydeu-Olivares on his lucid and timely overview of goodness-of-fit assessment in IRT models, a field to which he himself has contributed considerably in the form of limited information statistics. In this commentary, Oberski and Vermunt focus on two aspects of model fit: (1) what causes there may be of misfit;…
Descriptors: Goodness of Fit, Item Response Theory, Models, Test Bias
Yalcin, Seher – Eurasian Journal of Educational Research, 2018
Purpose: Studies in the literature have generally demonstrated that the causes of differential item functioning (DIF) are complex and not directly related to defined groups. The purpose of this study is to determine the DIF according to the mixture item response theory (MixIRT) model, based on the latent group approach, as well as the…
Descriptors: Item Response Theory, Test Items, Test Bias, Error of Measurement
Oon, Pey Tee; Subramaniam, R. – International Journal of Science Education, 2018
We report here on a comparative study of middle school students' attitudes towards science involving three countries: England, Singapore and the U.S.A. Complete attitudinal data sets from TIMSS (Trends in International Mathematics and Science Study) 2011 were used, thus giving a very large sample size (N = 20,246), compared to other studies in the…
Descriptors: Foreign Countries, Comparative Education, Middle School Students, Student Attitudes
Mao, Liyang; Liu, Ou Lydia; Roohr, Katrina; Belur, Vinetha; Mulholland, Matthew; Lee, Hee-Sun; Pallant, Amy – Educational Assessment, 2018
Scientific argumentation is one of the core practices for teachers to implement in science classrooms. We developed a computer-based formative assessment to support students' construction and revision of scientific arguments. The assessment is built upon automated scoring of students' arguments and provides feedback to students and teachers.…
Descriptors: Computer Assisted Testing, Science Tests, Scoring, Automation
Terzi, Ragip; Suh, Youngsuk – Journal of Educational Measurement, 2015
An odds ratio approach (ORA) under the framework of a nested logit model was proposed for evaluating differential distractor functioning (DDF) in multiple-choice items and was compared with an existing ORA developed under the nominal response model. The performances of the two ORAs for detecting DDF were investigated through an extensive…
Descriptors: Test Bias, Multiple Choice Tests, Test Items, Comparative Analysis
Ravand, Hamdollah – Practical Assessment, Research & Evaluation, 2015
Multilevel models (MLMs) are flexible in that they can be employed to obtain item and person parameters, test for differential item functioning (DIF) and capture both local item and person dependence. Papers on the MLM analysis of item response data have focused mostly on theoretical issues where applications have been add-ons to simulation…
Descriptors: Item Response Theory, Hierarchical Linear Modeling, Educational Testing, Reading Comprehension
Kun, András István – Assessment & Evaluation in Higher Education, 2015
This paper introduces an empirical study testing three kinds of bias in higher education student assessment. All of them are connected to the repetitive use of the same test questions which may facilitate academic cheating. The "same tests effect" may appear if two or more groups of students are writing the same test one after the other…
Descriptors: Test Bias, College Students, Student Evaluation, Evaluation Methods
Federer, Meghan Rector; Nehm, Ross H.; Opfer, John E.; Pearl, Dennis – Research in Science Education, 2015
A large body of work has been devoted to reducing assessment biases that distort inferences about students' science understanding, particularly in multiple-choice instruments (MCI). Constructed-response instruments (CRI), however, have invited much less scrutiny, perhaps because of their reputation for avoiding many of the documented biases of…
Descriptors: Science Education, Science Process Skills, Scientific Concepts, Concept Formation
Garcia, Ernest – Multicultural Education, 2015
Other than being African American, little is known of Larry, the lead plaintiff in the legal case known as "Larry P. v. Riles" in 1971, which banned the use of standardized intelligence testing on African-American students in the State of California. As a result of such intelligence testing, Larry was diagnosed as being mildly mentally…
Descriptors: Court Litigation, Intelligence Tests, African American Students, Clinical Diagnosis
Farrington, Amber L.; Lonigan, Christopher J. – Journal of Learning Disabilities, 2015
Children's emergent literacy skills are highly predictive of later reading abilities. To determine which children have weaker emergent literacy skills and are in need of intervention, it is necessary to assess emergent literacy skills accurately and reliably. In this study, 1,351 children were administered the "Revised Get Ready to…
Descriptors: Emergent Literacy, Preschool Children, Reading Tests, Item Response Theory

Peer reviewed
Direct link
