Publication Date
| In 2026 | 0 |
| Since 2025 | 38 |
| Since 2022 (last 5 years) | 225 |
| Since 2017 (last 10 years) | 570 |
| Since 2007 (last 20 years) | 1377 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 110 |
| Practitioners | 107 |
| Teachers | 46 |
| Administrators | 25 |
| Policymakers | 24 |
| Counselors | 12 |
| Parents | 7 |
| Students | 7 |
| Support Staff | 4 |
| Community | 2 |
Location
| California | 61 |
| Canada | 60 |
| United States | 57 |
| Turkey | 47 |
| Australia | 43 |
| Florida | 34 |
| Germany | 26 |
| Texas | 26 |
| China | 25 |
| Netherlands | 25 |
| Iran | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Ruairc, Gerry Mac – Irish Educational Studies, 2009
The recent decision by the Department of Education and Science in the Republic of Ireland to introduce the mandatory testing of children in Irish primary schools provides the broad context for this paper. This decision has particular implications for schools designated as disadvantaged. The main focus of this study is on identifying the strategies…
Descriptors: Foreign Countries, Disadvantaged Schools, Standardized Tests, Disadvantaged Youth
Janus, Magdalena; Hertzman, Clyde; Guhn, Martin; Brinkman, Sally; Goldfeld, Sharon – Early Years: An International Journal of Research and Development, 2009
This article presents a response to the paper by Li, D'ngiulli and Kendall (2007). The authors address two key aspects of this paper. The first concerns a number of errors and misconceptions in the paper that the authors think are important to clarify and correct. The second issue relates to the significant amount of research and effort that has…
Descriptors: Foreign Countries, Misconceptions, Student Diversity, Indigenous Populations
Echternacht, Gary – 1976
This paper proposes a method of transforming item p-values (the proportion answering a test item correctly) to what are termed "delta" values. First used by Conrad in l948, deltas are routine statistics computed in all analyses at Educational Testing Service. Using this approach one would conclude no test bias if differences in resulting deltas…
Descriptors: Item Analysis, Test Bias
Garcia Laborda, Jesus; Magal-Royo, Teresa; de Siqueira Rocha, Jose Macario; Alvarez, Miguel Fernandez – Computers & Education, 2010
Although much has been said about ergonomics in interface and in computer tools and interface design, very few articles in major journals have addressed this topic in relation to language testing. This article describes an experiment carried out at the Polytechnic University of Valencia, Spain, in which 27 Media and Communication students provided…
Descriptors: Educational Testing, Language Tests, Foreign Countries, Internet
Tristan, Agustin; Vidal, Rafael – Online Submission, 2007
Wright and Stone had proposed three features to assess the quality of the distribution of the items difficulties in a test, on the so called "most probable response map": line, stack and gap. Once a line is accepted as a design model for a test, gaps and stacks are practically eliminated, producing an evidence of the "scale…
Descriptors: Test Validity, Models, Difficulty Level, Test Items
PEPNet, 2009
PEPNet's "Perspectives" is the collaborative newsletter of the four PEPNet regional centers. This newsletter combines each centers individual strengths into a single resource that can be used on a national level. The issue focuses on the following topics: (1) Web Tool Locates Needed Resources; (2) Family Center on Technology and Disability (Ana…
Descriptors: Disabilities, Deafness, Web Sites, Access to Information
Koedel, Cory – Economics of Education Review, 2009
This paper examines whether educational production in secondary school involves joint production among teachers across subjects. In doing so, it also provides insights into the reliability of value-added modeling. Teacher value-added to reading test scores is estimated for four different teacher types: English, math, science and social-studies.…
Descriptors: Teacher Role, Reading Tests, English Teachers, Secondary School Teachers
Kato, Kentaro; Moen, Ross E.; Thurlow, Martha L. – Educational Measurement: Issues and Practice, 2009
Large data sets from a state reading assessment for third and fifth graders were analyzed to examine differential item functioning (DIF), differential distractor functioning (DDF), and differential omission frequency (DOF) between students with particular categories of disabilities (speech/language impairments, learning disabilities, and emotional…
Descriptors: Learning Disabilities, Language Impairments, Behavior Disorders, Affective Behavior
Klenowski, Val – Teaching Education, 2009
This article provides the background and context to the important issue of assessment and equity in relation to Indigenous students in Australia. Questions about the validity and fairness of assessment are raised and ways forward are suggested by attending to assessment questions in relation to equity and culture-fair assessment. Patterns of…
Descriptors: Indigenous Populations, Foreign Countries, Culturally Relevant Education, Test Bias
Carle, Adam C. – Hispanic Journal of Behavioral Sciences, 2008
Confirmatory factor analyses for ordered-categorical measures probed for differential item functioning on a standardized measure of alcohol dependence across Hispanics (n = 834) and non-Hispanic Caucasians (n = 14,001) in a nationally representative survey of alcohol use in the United States conducted in 1992. Analyses investigated whether 30…
Descriptors: Test Bias, Validity, Drinking, Factor Analysis
Brennan, David J. – Higher Education Research and Development, 2008
This paper provides an overview of the issue of student anonymity in the summative assessment of student work in higher education. It considers both theoretical literature pertaining to bias in the evaluation of the work of others and the limited empirical work undertaken on this issue in higher education. It then describes the experience of three…
Descriptors: Higher Education, Student Evaluation, Interrater Reliability, Test Bias
Lamprianou, Iasonas – International Journal of Testing, 2008
This study investigates the effect of reporting the unadjusted raw scores in a high-stakes language exam when raters differ significantly in severity and self-selected questions differ significantly in difficulty. More sophisticated models, introducing meaningful facets and parameters, are successively used to investigate the characteristics of…
Descriptors: High Stakes Tests, Raw Scores, Item Response Theory, Language Tests
Liu, Ou Lydia; Schedl, Mary; Malloy, Jeanne; Kong, Nan – Educational Testing Service, 2009
The TOEFL iBT[TM] has increased the length of the reading passages in the reading section compared to the passages on the TOEFL[R] computer-based test (CBT) to better approximate academic reading in North American universities, resulting in a reduced number of passages in the reading test. A concern arising from this change is whether the decrease…
Descriptors: English (Second Language), Language Tests, Internet, Computer Assisted Testing
Bond, Lloyd – Carnegie Foundation for the Advancement of Teaching, 2007
The writer examines a variety of reasons why test performance may not always be a valid measure of a person's competence or potential. Citing that a sizable percentage of students perform well in their schoolwork but poorly on standardized, multiple-choice tests, Bond defines and discusses four candidates as source factors for the phenomenon: (1)…
Descriptors: Test Bias, Test Anxiety, Standardized Tests, Multiple Choice Tests
Assessment and Accountability Comprehensive Center, 2007
This body of evidence summary reports the results of the evaluation of technical evidence in support of the California English Language Development Test (CELDT), as analyzed against a validated list of technical adequacy criteria. The table presented in this paper outlines the types of validity, reliability, and bias and sensitivity evidence…
Descriptors: Evidence, Validity, Language Acquisition, Language Proficiency

Peer reviewed
Direct link
