Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 2 |
| Since 2007 (last 20 years) | 19 |
Descriptor
| Difficulty Level | 41 |
| Educational Testing | 41 |
| Test Items | 16 |
| Higher Education | 9 |
| Test Construction | 9 |
| Test Interpretation | 9 |
| Achievement Tests | 8 |
| Educational Assessment | 8 |
| Item Analysis | 8 |
| Multiple Choice Tests | 8 |
| Elementary Secondary Education | 7 |
| More ▼ | |
Source
Author
| Cahen, Leonard S. | 3 |
| Al-A'ali, Mansoor | 1 |
| Arhin, Ato Kwamina | 1 |
| Barden, Tiffannie M. | 1 |
| Blasius, Jorg | 1 |
| Buckendahl, Chad W. | 1 |
| Camara, Wayne | 1 |
| Chen, Deng-Jyi | 1 |
| Chen, Shu-Ling | 1 |
| Chu, Hui-Chun | 1 |
| Davis-Becker, Susan L. | 1 |
| More ▼ | |
Publication Type
Education Level
| Elementary Secondary Education | 6 |
| Higher Education | 5 |
| Elementary Education | 4 |
| Postsecondary Education | 4 |
| Grade 4 | 2 |
| Grade 6 | 2 |
| High Schools | 2 |
| Secondary Education | 2 |
| Grade 12 | 1 |
| Grade 5 | 1 |
| Grade 8 | 1 |
| More ▼ | |
Audience
Location
| Taiwan | 2 |
| Arizona | 1 |
| Australia | 1 |
| Ghana | 1 |
| New York | 1 |
| North Carolina | 1 |
| Pennsylvania | 1 |
| Tennessee | 1 |
| United Kingdom | 1 |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 2 |
| Elementary and Secondary… | 1 |
Assessments and Surveys
| National Assessment of… | 3 |
| Program for International… | 2 |
| SAT (College Admission Test) | 2 |
| Graduate Record Examinations | 1 |
What Works Clearinghouse Rating
Metsämuuronen, Jari – International Journal of Educational Methodology, 2020
Pearson product-moment correlation coefficient between item g and test score X, known as item-test or item-total correlation ("Rit"), and item-rest correlation ("Rir") are two of the most used classical estimators for item discrimination power (IDP). Both "Rit" and "Rir" underestimate IDP caused by the…
Descriptors: Correlation, Test Items, Scores, Difficulty Level
Veldkamp, Bernard P. – Journal of Educational Measurement, 2016
Many standardized tests are now administered via computer rather than paper-and-pencil format. The computer-based delivery mode brings with it certain advantages. One advantage is the ability to adapt the difficulty level of the test to the ability level of the test taker in what has been termed computerized adaptive testing (CAT). A second…
Descriptors: Computer Assisted Testing, Reaction Time, Standardized Tests, Difficulty Level
DeMars, Christine E.; Jurich, Daniel P. – Educational and Psychological Measurement, 2015
In educational testing, differential item functioning (DIF) statistics must be accurately estimated to ensure the appropriate items are flagged for inspection or removal. This study showed how using the Rasch model to estimate DIF may introduce considerable bias in the results when there are large group differences in ability (impact) and the data…
Descriptors: Test Bias, Guessing (Tests), Ability, Differences
Quaigrain, Kennedy; Arhin, Ato Kwamina – Cogent Education, 2017
Item analysis is essential in improving items which will be used again in later tests; it can also be used to eliminate misleading items in a test. The study focused on item and test quality and explored the relationship between difficulty index (p-value) and discrimination index (DI) with distractor efficiency (DE). The study was conducted among…
Descriptors: Item Analysis, Teacher Developed Materials, Test Reliability, Educational Assessment
Zwick, Rebecca; Ye, Lei; Isham, Steven – ETS Research Report Series, 2013
Differential item functioning (DIF) analysis is a key component in the evaluation of the fairness and validity of educational tests. Although it is often assumed that refinement of the matching criterion always provides more accurate DIF results, the actual situation proves to be more complex. To explore the effectiveness of refinement, we…
Descriptors: Test Bias, Statistical Analysis, Simulation, Educational Testing
Liu, Hsin-min – ProQuest LLC, 2014
One of the fundamental problems in language testing is the lack of adequate generalizability between what a test is measuring and what fulfills the learners' real world language use needs. It is important to recognize that no matter how precise a test measures a construct, if the way that a construct is defined and the way that test tasks are…
Descriptors: Reading Tests, Language Tests, Task Analysis, Generalizability Theory
Davis-Becker, Susan L.; Buckendahl, Chad W.; Gerrow, Jack – International Journal of Testing, 2011
Throughout the world, cut scores are an important aspect of a high-stakes testing program because they are a key operational component of the interpretation of test scores. One method for setting standards that is prevalent in educational testing programs--the Bookmark method--is intended to be a less cognitively complex alternative to methods…
Descriptors: Standard Setting (Scoring), Cutting Scores, Educational Testing, Licensing Examinations (Professions)
Camara, Wayne – College Board, 2011
This presentation was presented at the 2011 National Conference on Student Assessment (CCSSO). The focus of this presentation is how to validate the common core state standards (CCSS) in math and ELA and the subsequent assessments that will be developed by state consortia. The CCSS specify the skills students need to be ready for post-secondary…
Descriptors: College Readiness, Career Readiness, Benchmarking, Student Evaluation
Liekar, Christine Y. – ProQuest LLC, 2012
Since the time of Sputnik, American educators and policymakers have recognized the need to raise expectations by increasing rigor in high schools across the United States. Copious studies attest to the fact that students who take Advanced Placement coursework experience success in college (Adelman, 1999; Camara, 2003; College Board, 2005;…
Descriptors: High School Students, Advanced Placement Programs, Educational Policy, Educational Practices
Schutz, Dick – Education Policy Analysis Archives, 2013
The commentary (1) uses the U. S. National Assessment of Educational Progress (NAEP) as a prototype for examining standardized reading achievement tests at the item level, and (2) sketches an alternative based on an initiative underway in the United Kingdom.
Descriptors: Educational Testing, Educational Change, Achievement Tests, Reading Achievement
Wheeler, Edward W. – ProQuest LLC, 2012
In early 1995, the University of Tennessee at Martin (UTM) sought permission to terminate three existing engineering technology degree programs and replace them with a single Bachelor of Science in Engineering (BSE) degree. As part of the requirements to proceed with the implementation of an engineering program, the University of Tennessee system…
Descriptors: Engineering, Engineering Education, Models, Prediction
Han, Kyung T. – Practical Assessment, Research & Evaluation, 2012
For several decades, the "three-parameter logistic model" (3PLM) has been the dominant choice for practitioners in the field of educational measurement for modeling examinees' response data from multiple-choice (MC) items. Past studies, however, have pointed out that the c-parameter of 3PLM should not be interpreted as a guessing…
Descriptors: Statistical Analysis, Models, Multiple Choice Tests, Guessing (Tests)
Chu, Hui-Chun; Hwang, Gwo-Jen; Huang, Yueh-Min – Innovations in Education and Teaching International, 2010
Conventional testing systems usually give students a score as their test result, but do not show them how to improve their learning performance. Researchers have indicated that students would benefit more if individual learning guidance could be provided. However, most of the existing learning diagnosis models ignore the fact that one concept…
Descriptors: Test Results, Teaching Methods, Elementary School Students, Elementary School Teachers
Isler, Tesha – ProQuest LLC, 2012
The problem examined in this study: Does the majority of teachers use rigorous teaching and testing practices? The purpose of this qualitative exploratory case study was to explore the classroom techniques of six effective teachers who use rigorous teaching and testing practices. The hypothesis for this study is that the examination of the…
Descriptors: Difficulty Level, Teaching Methods, Educational Testing, Achievement Gap
Al-A'ali, Mansoor – Educational Technology & Society, 2007
Computer adaptive testing is the study of scoring tests and questions based on assumptions concerning the mathematical relationship between examinees' ability and the examinees' responses. Adaptive student tests, which are based on item response theory (IRT), have many advantages over conventional tests. We use the least square method, a…
Descriptors: Educational Testing, Higher Education, Elementary Secondary Education, Student Evaluation

Peer reviewed
Direct link
