Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 10 |
Since 2016 (last 10 years) | 21 |
Since 2006 (last 20 years) | 59 |
Descriptor
Source
Author
Tindal, Gerald | 4 |
Alonzo, Julie | 3 |
Kubinger, Klaus D. | 3 |
Camilli, Gregory | 2 |
Cawthon, Stephanie W. | 2 |
De Boeck, Paul | 2 |
Linacre, John M. | 2 |
Liu, Kimy | 2 |
Prowker, Adam | 2 |
Revuelta, Javier | 2 |
Acquaye, Rosemary | 1 |
More ▼ |
Publication Type
Reports - Descriptive | 82 |
Journal Articles | 58 |
Speeches/Meeting Papers | 6 |
Numerical/Quantitative Data | 3 |
Tests/Questionnaires | 3 |
Computer Programs | 2 |
Collected Works - Serials | 1 |
Education Level
Audience
Teachers | 4 |
Policymakers | 3 |
Practitioners | 1 |
Location
Canada | 2 |
Australia | 1 |
Austria | 1 |
Belgium | 1 |
California | 1 |
Florida | 1 |
Greece | 1 |
Ireland | 1 |
Japan | 1 |
Norway | 1 |
Saudi Arabia | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Gyamfi, Abraham; Acquaye, Rosemary – Acta Educationis Generalis, 2023
Introduction: Item response theory (IRT) has received much attention in validation of assessment instrument because it allows the estimation of students' ability from any set of the items. Item response theory allows the difficulty and discrimination levels of each item on the test to be estimated. In the framework of IRT, item characteristics are…
Descriptors: Item Response Theory, Models, Test Items, Difficulty Level
Bolt, Daniel M.; Liao, Xiangyi – Journal of Educational Measurement, 2021
We revisit the empirically observed positive correlation between DIF and difficulty studied by Freedle and commonly seen in tests of verbal proficiency when comparing populations of different mean latent proficiency levels. It is shown that a positive correlation between DIF and difficulty estimates is actually an expected result (absent any true…
Descriptors: Test Bias, Difficulty Level, Correlation, Verbal Tests
Haladyna, Thomas M.; Rodriguez, Michael C. – Educational Assessment, 2021
Full-information item analysis provides item developers and reviewers comprehensive empirical evidence of item quality, including option response frequency, point-biserial index (PBI) for distractors, mean-scores of respondents selecting each option, and option trace lines. The multi-serial index (MSI) is introduced as a more informative…
Descriptors: Test Items, Item Analysis, Reading Tests, Mathematics Tests
Dahl, Laura S.; Staples, B. Ashley; Mayhew, Matthew J.; Rockenbach, Alyssa N. – Innovative Higher Education, 2023
Surveys with rating scales are often used in higher education research to measure student learning and development, yet testing and reporting on the longitudinal psychometric properties of these instruments is rare. Rasch techniques allow scholars to map item difficulty and individual aptitude on the same linear, continuous scale to compare…
Descriptors: Surveys, Rating Scales, Higher Education, Educational Research
Wellberg, Sarah – Assessment in Education: Principles, Policy & Practice, 2023
Classroom assessment research in the United States has shifted away from the examination of teacher-made tests, but such tests are still widely used and have an enormous impact on students' educational experiences. Given the major shifts in educational policy in the United States, including the widespread adoption of the Common Core State…
Descriptors: Teacher Made Tests, Mathematics Tests, Common Core State Standards, Test Items
Pelanek, Radek – Journal of Learning Analytics, 2021
In this work, we consider learning analytics for primary and secondary schools from the perspective of the designer of a learning system. We provide an overview of practically useful analytics techniques with descriptions of their applications and specific illustrations. We highlight data biases and caveats that complicate the analysis and its…
Descriptors: Learning Analytics, Elementary Schools, Secondary Schools, Educational Technology
Achieve, Inc., 2019
In 2013, the Council of Chief State School Officers (CCSSO), working collaboratively with state education agencies, released a set of criteria for states to use to evaluate and procure high-quality assessments. The mathematics section of the document included five content-specific criteria to evaluate alignment of assessments to college- and…
Descriptors: Mathematics Tests, Difficulty Level, Evaluation Criteria, Cognitive Processes
DeCarlo, Lawrence T. – Journal of Educational Measurement, 2021
In a signal detection theory (SDT) approach to multiple choice exams, examinees are viewed as choosing, for each item, the alternative that is perceived as being the most plausible, with perceived plausibility depending in part on whether or not an item is known. The SDT model is a process model and provides measures of item difficulty, item…
Descriptors: Perception, Bias, Theories, Test Items
Krzic, Maja; Brown, Sandra – Natural Sciences Education, 2022
The transition of our large ([approximately]300 student) introductory soil science course to the online setting created several challenges, including engaging first- and second-year students, providing meaningful hands-on learning activities, and setting up online exams. The objective of this paper is to describe the development and use of…
Descriptors: Introductory Courses, Social Sciences, Online Courses, Educational Change
Item Order and Speededness: Implications for Test Fairness in Higher Educational High-Stakes Testing
Becker, Benjamin; van Rijn, Peter; Molenaar, Dylan; Debeer, Dries – Assessment & Evaluation in Higher Education, 2022
A common approach to increase test security in higher educational high-stakes testing is the use of different test forms with identical items but different item orders. The effects of such varied item orders are relatively well studied, but findings have generally been mixed. When multiple test forms with different item orders are used, we argue…
Descriptors: Information Security, High Stakes Tests, Computer Security, Test Items
Stewart, Gail; Strachan, Andrea – TESL Canada Journal, 2022
Since its implementation in 2004, the Canadian English Language Benchmark Assessment for Nurses (CELBAN) has been accepted as evidence of language ability for licensure of internationally educated nurses (IENs) in Canada. This article focuses on the complexities of sustaining an occupation-specific assessment over time. The authors reference the…
Descriptors: Language Tests, English for Special Purposes, Benchmarking, Nurses
Sinharay, Sandip – Educational Measurement: Issues and Practice, 2018
The choice of anchor tests is crucial in applications of the nonequivalent groups with anchor test design of equating. Sinharay and Holland (2006, 2007) suggested "miditests," which are anchor tests that are content-representative and have the same mean item difficulty as the total test but have a smaller spread of item difficulties.…
Descriptors: Test Content, Difficulty Level, Test Items, Test Construction
El Rahman, Sahar Abd; Zolait, Ali Hussein – International Journal of Web-Based Learning and Teaching Technologies, 2019
This article describes how with the advent of computer-based technology, there is movement from manual to automated systems for different aspects of the education system. Testing is an essential part of teaching process that helps academics in classifying the level of students and evaluating the outcomes of their teaching process. The testing…
Descriptors: Test Items, Computer Uses in Education, Computers, Web Based Instruction
Shoufan, Abdulhadi – IEEE Transactions on Education, 2017
The concept of intrinsic complexity explains why different problems of the same type, tackled by the same problem solver, can require different times to solve and yield solutions of different quality. This paper proposes a general four-step approach that can be used to establish a model for the intrinsic complexity of a problem class in terms of…
Descriptors: Test Items, Difficulty Level, Problem Solving, Models
Achieve, Inc., 2019
Assessment is a key lever for educational improvement. Assessments can be used to monitor, signal, and influence science teaching and learning -- provided that they are of high quality, reflect the rigor and intent of academic standards, and elicit meaningful student performances. Since the release of "A Framework for K-12 Science…
Descriptors: Difficulty Level, Evaluation Criteria, Cognitive Processes, Test Items