Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 10 |
Since 2006 (last 20 years) | 25 |
Descriptor
Comparative Analysis | 43 |
Test Items | 43 |
Test Construction | 13 |
Foreign Countries | 10 |
Computer Assisted Testing | 9 |
Difficulty Level | 9 |
Evaluation Methods | 8 |
Item Analysis | 8 |
Scores | 8 |
Academic Achievement | 7 |
Test Format | 7 |
More ▼ |
Source
Author
Foy, Pierre, Ed. | 2 |
Abedi, Jamal | 1 |
Acquaye, Rosemary | 1 |
Arora, Alka, Ed. | 1 |
Arth, Thomas O. | 1 |
Baldwin, Peter | 1 |
Belur, Madhu N. | 1 |
Benderson, Albert, Ed. | 1 |
Beretvas, S. Natasha | 1 |
Briggs, Derek C. | 1 |
Brusco, Michael J. | 1 |
More ▼ |
Publication Type
Reports - Descriptive | 43 |
Journal Articles | 27 |
Speeches/Meeting Papers | 5 |
Guides - Non-Classroom | 4 |
Opinion Papers | 2 |
Collected Works - General | 1 |
Collected Works - Serials | 1 |
Reports - Evaluative | 1 |
Education Level
Elementary Secondary Education | 7 |
Elementary Education | 3 |
Grade 4 | 2 |
Higher Education | 2 |
Grade 6 | 1 |
Grade 8 | 1 |
High School Equivalency… | 1 |
Intermediate Grades | 1 |
Postsecondary Education | 1 |
Secondary Education | 1 |
Audience
Practitioners | 2 |
Administrators | 1 |
Location
Canada | 2 |
Botswana | 1 |
Colorado (Boulder) | 1 |
Czech Republic | 1 |
Florida | 1 |
Hong Kong | 1 |
Israel | 1 |
Japan | 1 |
South Africa | 1 |
Tennessee | 1 |
United Kingdom (Great Britain) | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Gyamfi, Abraham; Acquaye, Rosemary – Acta Educationis Generalis, 2023
Introduction: Item response theory (IRT) has received much attention in validation of assessment instrument because it allows the estimation of students' ability from any set of the items. Item response theory allows the difficulty and discrimination levels of each item on the test to be estimated. In the framework of IRT, item characteristics are…
Descriptors: Item Response Theory, Models, Test Items, Difficulty Level
Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022
While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…
Descriptors: Scoring, Testing, Test Items, Test Format
Fuchimoto, Kazuma; Ishii, Takatoshi; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2022
Educational assessments often require uniform test forms, for which each test form has equivalent measurement accuracy but with a different set of items. For uniform test assembly, an important issue is the increase of the number of assembled uniform tests. Although many automatic uniform test assembly methods exist, the maximum clique algorithm…
Descriptors: Simulation, Efficiency, Test Items, Educational Assessment
Puhan, Gautam; Kim, Sooyeon – Journal of Educational Measurement, 2022
As a result of the COVID-19 pandemic, at-home testing has become a popular delivery mode in many testing programs. When programs offer at-home testing to expand their service, the score comparability between test takers testing remotely and those testing in a test center is critical. This article summarizes statistical procedures that could be…
Descriptors: Scores, Scoring, Comparative Analysis, Testing
Li, Jie; van der Linden, Wim J. – Journal of Educational Measurement, 2018
The final step of the typical process of developing educational and psychological tests is to place the selected test items in a formatted form. The step involves the grouping and ordering of the items to meet a variety of formatting constraints. As this activity tends to be time-intensive, the use of mixed-integer programming (MIP) has been…
Descriptors: Programming, Automation, Test Items, Test Format
Sinharay, Sandip – Educational Measurement: Issues and Practice, 2018
The choice of anchor tests is crucial in applications of the nonequivalent groups with anchor test design of equating. Sinharay and Holland (2006, 2007) suggested "miditests," which are anchor tests that are content-representative and have the same mean item difficulty as the total test but have a smaller spread of item difficulties.…
Descriptors: Test Content, Difficulty Level, Test Items, Test Construction
Moothedath, Shana; Chaporkar, Prasanna; Belur, Madhu N. – Perspectives in Education, 2016
In recent years, the computerised adaptive test (CAT) has gained popularity over conventional exams in evaluating student capabilities with desired accuracy. However, the key limitation of CAT is that it requires a large pool of pre-calibrated questions. In the absence of such a pre-calibrated question bank, offline exams with uncalibrated…
Descriptors: Guessing (Tests), Computer Assisted Testing, Adaptive Testing, Maximum Likelihood Statistics
Guskey, Thomas R. – Journal of Staff Development, 2016
Effective professional learning evaluation requires consideration of five critical stages or levels of information. These five levels, which are presented in this article, represent an adaptation of an evaluation model developed by Kirkpatrick (1959, 1998) for judging the value of supervisory training programs in business and industry.…
Descriptors: Hierarchical Linear Modeling, Outcomes of Education, Supervisory Training, Faculty Development
Pelánek, Radek; Rihák, Ji?rí – International Educational Data Mining Society, 2016
In online educational systems we can easily collect and analyze extensive data about student learning. Current practice, however, focuses only on some aspects of these data, particularly on correctness of students answers. When a student answers incorrectly, the submitted wrong answer can give us valuable information. We provide an overview of…
Descriptors: Foreign Countries, Online Systems, Geography, Anatomy
Mitchell, Alison M.; Truckenmiller, Adrea; Petscher, Yaacov – Communique, 2015
As part of the Race to the Top initiative, the United States Department of Education made nearly 1 billion dollars available in State Educational Technology grants with the goal of ramping up school technology. One result of this effort is that states, districts, and schools across the country are using computerized assessments to measure their…
Descriptors: Computer Assisted Testing, Educational Technology, Testing, Efficiency
GED Testing Service, 2016
This guide is designed to help adult educators and administrators better understand the content of the GED® test. This guide is tailored to each test subject and highlights the test's item types, assessment targets, and guidelines for how items will be scored. This 2016 edition has been updated to include the most recent information about the…
Descriptors: Guidelines, Teaching Guides, High School Equivalency Programs, Test Items
Maydeu-Olivares, Alberto – Measurement: Interdisciplinary Research and Perspectives, 2013
In this rejoinder, Maydeu-Olivares states that, in item response theory (IRT) measurement applications, the application of goodness-of-fit (GOF) methods informs researchers of the discrepancy between the model and the data being fitted (the room for improvement). By routinely reporting the GOF of IRT models, together with the substantive results…
Descriptors: Goodness of Fit, Models, Evaluation Methods, Item Response Theory
Lyons, Douglas; Niblock, Andrew W. – Independent School, 2014
Independent schools are, for the most part, exempt from mandatory participation in standardized tests designed for state and federal comparisons, nor are they required to take part in comparative international assessments. The anxiety in the broader culture, however, is driving a growing interest among independent school parents (and prospective…
Descriptors: Global Approach, Comparative Analysis, Comparative Education, Educational Practices
Reeves, Cheryl; Major, Thenjiwe – Prospects: Quarterly Review of Comparative Education, 2012
This article describes how a detailed notebook analysis was used to assess and compare the opportunity to learn of a sample of grade 6 students from 126 classes in South East Botswana and North West Province, South Africa. Students' mathematics notebooks provided the main data source for estimating how much time is spent on the subject during the…
Descriptors: Academic Achievement, Test Items, Foreign Countries, Grade 6
Soh, Kay Cheng – Higher Education Review, 2012
Three university ranking systems in vogue have been shown in the previous issue of "Higher Education Review" to be capable of modifications to make them more parsimonious by using only about half of the number of predictors currently in use. This makes some of the predictors "redundant" as they contributed little to the overall ranking. It is…
Descriptors: Higher Education, Predictor Variables, Profiles, Test Items