Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 9 |
Descriptor
Evaluation Methods | 15 |
Test Items | 15 |
Testing Programs | 15 |
Psychometrics | 5 |
Standardized Tests | 5 |
Item Response Theory | 4 |
Scoring | 4 |
Test Construction | 4 |
Elementary Secondary Education | 3 |
Foreign Countries | 3 |
Item Analysis | 3 |
More ▼ |
Source
Author
Ajuonuma, Juliet O. | 1 |
Albano, Anthony D. | 1 |
Brian F. French | 1 |
Cabrera, George A. | 1 |
Cabrera, Nolan L. | 1 |
Cook, Linda L. | 1 |
Cresswell, John | 1 |
Doorey, Nancy | 1 |
Fenton, Ray | 1 |
Friedman, Greg | 1 |
Keating, Xiaofen Deng | 1 |
More ▼ |
Publication Type
Journal Articles | 8 |
Reports - Evaluative | 7 |
Reports - Research | 5 |
Speeches/Meeting Papers | 4 |
Reports - Descriptive | 2 |
Guides - General | 1 |
Tests/Questionnaires | 1 |
Education Level
Secondary Education | 3 |
Elementary Secondary Education | 2 |
Grade 8 | 2 |
Higher Education | 2 |
Postsecondary Education | 2 |
Elementary Education | 1 |
Grade 11 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Grade 6 | 1 |
High Schools | 1 |
More ▼ |
Audience
Researchers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 3 |
Graduate Record Examinations | 1 |
Progress in International… | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
Traditional vs Intersectional DIF Analysis: Considerations and a Comparison Using State Testing Data
Tony Albano; Brian F. French; Thao Thu Vo – Applied Measurement in Education, 2024
Recent research has demonstrated an intersectional approach to the study of differential item functioning (DIF). This approach expands DIF to account for the interactions between what have traditionally been treated as separate grouping variables. In this paper, we compare traditional and intersectional DIF analyses using data from a state testing…
Descriptors: Test Items, Item Analysis, Data Use, Standardized Tests
Doorey, Nancy; Polikoff, Morgan – Thomas B. Fordham Institute, 2016
Approximately one-third of American freshmen at two-year and four-year colleges require remedial coursework and over 40 percent of employers rate new hires with a high school diploma as "deficient" in their overall preparation for entry-level jobs. Yet, over the past decade, as these students marched through America's public education…
Descriptors: Standardized Tests, State Standards, Test Items, Evaluation Criteria
Albano, Anthony D. – Journal of Educational Measurement, 2013
In many testing programs it is assumed that the context or position in which an item is administered does not have a differential effect on examinee responses to the item. Violations of this assumption may bias item response theory estimates of item and person parameters. This study examines the potentially biasing effects of item position. A…
Descriptors: Test Items, Item Response Theory, Test Format, Questioning Techniques
Cresswell, John; Schwantner, Ursula; Waters, Charlotte – OECD Publishing, 2015
This report reviews the major international and regional large-scale educational assessments, including international surveys, school-based surveys and household-based surveys. The report compares and contrasts the cognitive and contextual data collection instruments and implementation methods used by the different assessments in order to identify…
Descriptors: International Assessment, Educational Assessment, Data Collection, Comparative Analysis
Cabrera, Nolan L.; Cabrera, George A. – Educational Horizons, 2011
Just like all the high-stakes tests that determine students' futures nowadays, The Chorizo Test is a standardized test rooted in the culture of the test makers. It was originally created to be used with students in teacher training programs to sensitize them to the pitfalls inherent in standardized pencil-and-paper tests, such as linguistic bias…
Descriptors: Test Use, Standardized Tests, Social Sciences, High Stakes Tests
Puhan, Gautam – Applied Measurement in Education, 2009
The purpose of this study is to determine the extent of scale drift on a test that employs cut scores. It was essential to examine scale drift for this testing program because new forms in this testing program are often put on scale through a series of intermediate equatings (known as equating chains). This process may cause equating error to…
Descriptors: Testing Programs, Testing, Measurement Techniques, Item Response Theory
Wyse, Adam E.; Mapuranga, Raymond – International Journal of Testing, 2009
Differential item functioning (DIF) analysis is a statistical technique used for ensuring the equity and fairness of educational assessments. This study formulates a new DIF analysis method using the information similarity index (ISI). ISI compares item information functions when data fits the Rasch model. Through simulations and an international…
Descriptors: Test Bias, Evaluation Methods, Test Items, Educational Assessment
Sadler, Troy D.; Zeidler, Dana L. – Journal of Research in Science Teaching, 2009
In this article, we explore the Programme for International Student Assessment (PISA) with a lens informed by the socioscientific issues (SSI) movement. We consider the PISA definition of scientific literacy and how it is situated with respect to broader discussions of the aims of science education. We also present an overview of the SSI framework…
Descriptors: Test Items, Scientific Literacy, Science Education, Science Process Skills
Ajuonuma, Juliet O. – African Higher Education Review, 2008
This study was designed to carry out a survey of the implementation of continuous assessment (CA) in Nigerian universities. Two research questions and one hypothesis were formulated to guide the study. The sample for the study consisted of 1,340 respondents. A 24 item self-report instrument was used for the study. The data generated, were analyzed…
Descriptors: Foreign Countries, Program Implementation, Testing Programs, Test Items
Shorey, Leonard – 1991
Tests in social studies and integrated science given in Saint Vincent, Saint Lucia, Grenada, and Dominica were analyzed by the Organization for Co-operation in Overseas Development (OCOD) Comprehensive Teacher Training Program (CTTP) for discrimination, difficulty, and reliability, as well as other characteristics. There were 767 examinees for the…
Descriptors: Difficulty Level, Elementary Secondary Education, Evaluation Methods, Foreign Countries
Fenton, Ray; Straugh, Tom; Stofflet, Fred – 1997
Writing assessment began in Alaska in the 1970s, and the Alaska Writing Assessment (AWA) that was piloted in 1997 built on previous efforts. The 1997 AWA involved more than 20,000 students in grades 5, 7, and 10 from 43 school districts, and the mandatory assessment planned for 1998 will include approximately 28,000 students. This review of the…
Descriptors: Elementary Secondary Education, Evaluation Methods, Program Implementation, Resource Allocation
Yen, Shu Jing; Ochieng, Charles; Michaels, Hillary; Friedman, Greg – Online Submission, 2005
Year-to-year rater variation may result in constructed response (CR) parameter changes, making CR items inappropriate to use in anchor sets for linking or equating. This study demonstrates how rater severity affected the writing and reading scores. Rater adjustments were made to statewide results using an item response theory (IRT) methodology…
Descriptors: Test Items, Writing Tests, Reading Tests, Measures (Individuals)
Keating, Xiaofen Deng – Quest, 2003
This paper aims to examine current nationwide youth fitness test programs, address problems embedded in the programs, and possible solutions. The current Fitnessgram, President's Challenge, and YMCA youth fitness test programs were selected to represent nationwide youth fitness test programs. Sponsors of the nationwide youth fitness test programs…
Descriptors: Physical Education, Test Items, Physical Fitness, Youth Programs
Cook, Linda L.; Petersen, Nancy S. – 1986
This paper examines how various equating methods are affected by: (1) sampling error; (2) sample characteristics; and (3) characteristics of anchor test items. It reviews empirical studies that investigated the invariance of equating transformations, and it discusses empirical and simulation studies that focus on how the properties of anchor tests…
Descriptors: Educational Research, Equated Scores, Error of Measurement, Evaluation Methods
Valette, Rebecca M. – 1977
This handbook, intended for language teachers at all levels, is an introduction to foreign language testing. It is a revision and expansion of the edition that appeared ten years ago. This edition reflects contemporary concerns in measurement and evaluation and contemporary changes in teaching aims, particularly toward communicative competence.…
Descriptors: Achievement Tests, Affective Objectives, Bilingual Education, Communicative Competence (Languages)