NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 30 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Tony Albano; Brian F. French; Thao Thu Vo – Applied Measurement in Education, 2024
Recent research has demonstrated an intersectional approach to the study of differential item functioning (DIF). This approach expands DIF to account for the interactions between what have traditionally been treated as separate grouping variables. In this paper, we compare traditional and intersectional DIF analyses using data from a state testing…
Descriptors: Test Items, Item Analysis, Data Use, Standardized Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Raker, Jeffrey R.; Holme, Thomas A. – Journal of Chemical Education, 2013
Standardized examinations, such as those developed and disseminated by the ACS Examinations Institute, are artifacts of the teaching of a course and over time may provide a historical perspective on how curricula have changed and evolved. This study investigated changes in organic chemistry curricula across a 60-year period by evaluating 18 ACS…
Descriptors: Organic Chemistry, Science Education History, Curriculum Research, Educational Development
Doorey, Nancy; Polikoff, Morgan – Thomas B. Fordham Institute, 2016
Approximately one-third of American freshmen at two-year and four-year colleges require remedial coursework and over 40 percent of employers rate new hires with a high school diploma as "deficient" in their overall preparation for entry-level jobs. Yet, over the past decade, as these students marched through America's public education…
Descriptors: Standardized Tests, State Standards, Test Items, Evaluation Criteria
Peer reviewed Peer reviewed
Direct linkDirect link
Cabrera, Nolan L.; Cabrera, George A. – Educational Horizons, 2011
Just like all the high-stakes tests that determine students' futures nowadays, The Chorizo Test is a standardized test rooted in the culture of the test makers. It was originally created to be used with students in teacher training programs to sensitize them to the pitfalls inherent in standardized pencil-and-paper tests, such as linguistic bias…
Descriptors: Test Use, Standardized Tests, Social Sciences, High Stakes Tests
Kelley, Ronald Scott – ProQuest LLC, 2012
Scope and Method of Study: This study focused on the development and use of the AT-SAT test battery and the Initial En Route Qualification training course for the selection, training, and evaluation of air traffic controller candidates. The Pearson product moment correlation coefficient was used to measure the linear relationship between the…
Descriptors: Traffic Safety, Scores, Equated Scores, Multiple Regression Analysis
Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Park, Bitnara Jasmine; Tindal, Gerald – Behavioral Research and Teaching, 2012
In this technical report, we present the results of a reliability study of the seventh-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Descriptors: Reading Comprehension, Testing Programs, Statistical Analysis, Grade 7
Meyers, Jason L.; Murphy, Stephen; Goodman, Joshua; Turhan, Ahmet – Pearson, 2012
Operational testing programs employing item response theory (IRT) applications benefit from of the property of item parameter invariance whereby item parameter estimates obtained from one sample can be applied to other samples (when the underlying assumptions are satisfied). In theory, this feature allows for applications such as computer-adaptive…
Descriptors: Equated Scores, Test Items, Test Format, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Lowrie, Tom; Diezmann, Carmel M. – Australian Journal of Education, 2009
Mandatory numeracy tests have become commonplace in many countries, heralding a new era in school assessment. New forms of accountability and an increased emphasis on national and international standards (and benchmarks) have the potential to reshape mathematics curricula. It is noteworthy that the mathematics items used in these tests are rich in…
Descriptors: Testing Programs, Numeracy, Foreign Countries, Standardized Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Cohen, Jon; Chan, Tsze; Jiang, Tao; Seburn, Mary – Applied Psychological Measurement, 2008
U.S. state educational testing programs administer tests to track student progress and hold schools accountable for educational outcomes. Methods from item response theory, especially Rasch models, are usually used to equate different forms of a test. The most popular method for estimating Rasch models yields inconsistent estimates and relies on…
Descriptors: Testing Programs, Educational Testing, Item Response Theory, Computation
Peer reviewed Peer reviewed
Wadkins, J. R. Jefferson – American Mathematical Monthly, 1978
Some background information is given about the GRE. A detailed account of its construction, its recent history, and some of the thinking that has gone into it is related. (MP)
Descriptors: College Mathematics, Graduate Study, Higher Education, Standardized Tests
Bassler, Otto C.; Caulkins, Thomas G. – 1984
A model for summarizing test scores and using them to modify instructional programs is presented. The proposed model consists of two types of summaries of the data gathered through standardized tests. The first summary contains individual and single class results. Information in a "Class Item Response Record" chart provides individual student…
Descriptors: Elementary Secondary Education, Instructional Improvement, Models, Scores
Kingston, Neal M.; Dorans, Neil J. – 1982
The feasibility of using item response theory (IRT) as a psychometric model for the Graduate Record Examination (GRE) Aptitude Test was addressed by assessing the reasonableness of the assumptions of item response theory for GRE item types and examinee populations. Items from four forms and four administrations of the GRE Aptitude Test were…
Descriptors: Aptitude Tests, Graduate Study, Higher Education, Latent Trait Theory
Green, Donald Ross – 1985
The use of item banks and item response theory has resulted in new ways to misinterpret and misuse tests through customized, yet standardized, achievement test batteries. The new test batteries create the possibility of serious misunderstandings based on the idea that any subset of items from the pool with a proper range of difficulties will…
Descriptors: Academic Achievement, Achievement Gains, Item Banks, Latent Trait Theory
Colvin, Stephen S. – Bureau of Education, Department of the Interior, 1924
A decade ago intelligence testing was in its beginnings in the United States. There were no standardized tests available except those of the Binet-Simon scale. These tests had been used but little, and chiefly for the detection and classification of the backward and the feeble-minded. Goddard had just begun pioneer work in this field, while…
Descriptors: Intelligence Tests, Intelligence, Performance Tests, Testing
Shorey, Leonard – 1991
Tests in social studies and integrated science given in Saint Vincent, Saint Lucia, Grenada, and Dominica were analyzed by the Organization for Co-operation in Overseas Development (OCOD) Comprehensive Teacher Training Program (CTTP) for discrimination, difficulty, and reliability, as well as other characteristics. There were 767 examinees for the…
Descriptors: Difficulty Level, Elementary Secondary Education, Evaluation Methods, Foreign Countries
Previous Page | Next Page ยป
Pages: 1  |  2