Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 9 |
Descriptor
Difficulty Level | 13 |
Pretests Posttests | 13 |
Test Construction | 13 |
Test Items | 10 |
Item Analysis | 5 |
Multiple Choice Tests | 5 |
Foreign Countries | 4 |
Item Response Theory | 4 |
Test Reliability | 4 |
Criterion Referenced Tests | 3 |
Scores | 3 |
More ▼ |
Source
Author
Roid, Gale | 3 |
Haladyna, Tom | 2 |
Barniol, Pablo | 1 |
Beichner, Robert J. | 1 |
Bristow, M. | 1 |
Bucak, S. Deniz | 1 |
Chen, Guanhua | 1 |
Cook Smith, Nancy | 1 |
Coyle, Harold | 1 |
DiBartolomeo, Matthew | 1 |
Erkorkmaz, K. | 1 |
More ▼ |
Publication Type
Journal Articles | 8 |
Reports - Research | 8 |
Reports - Evaluative | 3 |
Dissertations/Theses -… | 2 |
Information Analyses | 2 |
Speeches/Meeting Papers | 2 |
Education Level
Higher Education | 3 |
Elementary Education | 2 |
Adult Education | 1 |
Grade 5 | 1 |
Intermediate Grades | 1 |
Middle Schools | 1 |
Postsecondary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Flesch Kincaid Grade Level… | 1 |
Flesch Reading Ease Formula | 1 |
Praxis Series | 1 |
United States Medical… | 1 |
What Works Clearinghouse Rating
Zavala, Genaro; Tejeda, Santa; Barniol, Pablo; Beichner, Robert J. – Physical Review Physics Education Research, 2017
In this article, we present several modifications to the Test of Understanding Graphs in Kinematics. The most significant changes are (i) the addition and removal of items to achieve parallelism in the objectives (dimensions) of the test, thus allowing comparisons of students' performance that were not possible with the original version, and (ii)…
Descriptors: Graphs, Test Items, Test Construction, Science Tests
Chen, Guanhua – ProQuest LLC, 2018
This study is part of a larger design study that iteratively improves a robotics programming curriculum as well as a computational thinking (CT) instrument. Its focus was majorly on CT assessment and particularly on an online CT instrument with logging functionality that can store a student's problem-solving process by recording interactions…
Descriptors: Elementary School Students, Test Construction, Cognitive Tests, Computer Assisted Testing
Stoffel, Heather; Raymond, Mark R.; Bucak, S. Deniz; Haist, Steven A. – Practical Assessment, Research & Evaluation, 2014
Previous research on the impact of text and formatting changes on test-item performance has produced mixed results. This matter is important because it is generally acknowledged that "any" change to an item requires that it be recalibrated. The present study investigated the effects of seven classes of stylistic changes on item…
Descriptors: Test Construction, Test Items, Standardized Tests, Physicians
Sadler, Philip M.; Coyle, Harold; Cook Smith, Nancy; Miller, Jaimie; Mintzes, Joel; Tanner, Kimberly; Murray, John – CBE - Life Sciences Education, 2013
We report on the development of an item test bank and associated instruments based on the National Research Council (NRC) K-8 life sciences content standards. Utilizing hundreds of studies in the science education research literature on student misconceptions, we constructed 476 unique multiple-choice items that measure the degree to which test…
Descriptors: National Standards, Knowledge Level, Biological Sciences, Item Banks
Bristow, M.; Erkorkmaz, K.; Huissoon, J. P.; Jeon, Soo; Owen, W. S.; Waslander, S. L.; Stubley, G. D. – IEEE Transactions on Education, 2012
Any meaningful initiative to improve the teaching and learning in introductory control systems courses needs a clear test of student conceptual understanding to determine the effectiveness of proposed methods and activities. The authors propose a control systems concept inventory. Development of the inventory was collaborative and iterative. The…
Descriptors: Diagnostic Tests, Concept Formation, Undergraduate Students, Engineering Education
Razi, Salim – Online Submission, 2012
This study presents the processes of developing and establishing reliability and validity of a reading test by administering an integrative approach as conventional reliability and validity measures superficially reveals the difficulty of a reading test. In this respect, analysing vocabulary frequency of the test is regarded as a more eligible way…
Descriptors: Foreign Countries, Undergraduate Students, Reading Tests, Test Validity
Kolloffel, Bas; Eysink, Tessa H. S.; de Jong, Ton; Wilhelm, Pascal – Instructional Science: An International Journal of the Learning Sciences, 2009
The current study investigated the effects of different external representational formats on learning combinatorics and probability theory in an inquiry based learning environment. Five conditions were compared in a pre-test post-test design: three conditions each using a single external representational format (Diagram, Arithmetic, or Text), and…
Descriptors: Computer Simulation, Inquiry, Active Learning, Cognitive Processes
DiBartolomeo, Matthew – ProQuest LLC, 2010
Multiple factors have influenced testing agencies to more carefully consider the manner and frequency in which pretest item data are collected and analyzed. One potentially promising development is judges' estimates of item difficulty. Accurate estimates of item difficulty may be used to reduce pretest samples sizes, supplement insufficient…
Descriptors: Test Items, Group Discussion, Athletics, Pretests Posttests

Haladyna, Tom; Roid, Gale – Journal of Educational Measurement, 1981
The rationale for use of instructional sensitivity in the empirical review of test items is examined, and the results of a study that distinguishes instructional sensitivity from other item concepts are presented. Research is reviewed which indicates the existence of instructional sensitivity as a unique criterion-referenced test item concept. (RL)
Descriptors: Criterion Referenced Tests, Difficulty Level, Evaluation Criteria, Pretests Posttests
Lievens, Filip; Sackett, Paul R. – Journal of Applied Psychology, 2007
This study used principles underlying item generation theory to posit competing perspectives about which features of situational judgment tests might enhance or impede consistent measurement across repeat test administrations. This led to 3 alternate-form development approaches (random assignment, incident isomorphism, and item isomorphism). The…
Descriptors: Validity, High Stakes Tests, Test Construction, Testing
Sheehan, Kathleen; Mislevy, Robert J. – 1994
The operating characteristics of 114 mathematics pretest items from the Praxis I: Computer Based Test were analyzed in terms of item attributes and test developers' judgments of item difficulty. Item operating characteristics were defined as the difficulty, discrimination, and asymptote parameters of a three parameter logistic item response theory…
Descriptors: Basic Skills, Computer Assisted Testing, Difficulty Level, Educational Assessment
Roid, Gale; And Others – 1980
Using informal, objectives-based, or linguistic methods, three elementary school teachers and three experienced item writers developed criterion-referenced pretests-posttests to accompany a prose passage. Item difficulites were tabulated on the responses of 364 elementary students. The informal-subjective method, used by many achievement test…
Descriptors: Criterion Referenced Tests, Difficulty Level, Elementary Education, Elementary School Teachers
Haladyna, Tom; Roid, Gale – 1980
An empirical review of test items is described as an essential step in criterion-referenced test development. The concept of test items' instructional sensitivity is introduced, and research is briefly reviewed which describes four theoretical contexts in which instructional sensitivity indexes have been observed: criterion-referenced; classical…
Descriptors: Achievement Tests, Bayesian Statistics, Course Objectives, Criterion Referenced Tests