Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 5 |
Since 2016 (last 10 years) | 13 |
Since 2006 (last 20 years) | 24 |
Descriptor
Comparative Analysis | 40 |
Difficulty Level | 40 |
Test Reliability | 40 |
Test Items | 27 |
Test Validity | 18 |
Foreign Countries | 15 |
Correlation | 10 |
Test Format | 10 |
Multiple Choice Tests | 9 |
Higher Education | 7 |
Item Analysis | 7 |
More ▼ |
Source
Author
Bauer, Daniel | 2 |
Fischer, Martin R. | 2 |
Hansen, Duncan N. | 2 |
Lubiano, Michael Leonard D. | 2 |
Magpantay, Marife S. | 2 |
Winke, Paula | 2 |
Ahn, Jieun Irene | 1 |
Aktas, Elif | 1 |
Aleyna Altan | 1 |
Alpayar, Cagla | 1 |
Arth, Thomas O. | 1 |
More ▼ |
Publication Type
Reports - Research | 32 |
Journal Articles | 24 |
Speeches/Meeting Papers | 8 |
Tests/Questionnaires | 5 |
Reports - Descriptive | 4 |
Reports - Evaluative | 3 |
Collected Works - Proceedings | 1 |
Collected Works - Serials | 1 |
Education Level
Audience
Practitioners | 1 |
Teachers | 1 |
Location
Philippines | 3 |
United Kingdom | 3 |
United States | 3 |
Florida | 2 |
Germany | 2 |
Japan | 2 |
South Korea | 2 |
Turkey | 2 |
Asia | 1 |
Australia | 1 |
Brazil | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Defining Issues Test | 1 |
Embedded Figures Test | 1 |
Graduate Record Examinations | 1 |
Rosenberg Self Esteem Scale | 1 |
SAT (College Admission Test) | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025
The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…
Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction
Benton, Tom – Research Matters, 2021
Computer adaptive testing is intended to make assessment more reliable by tailoring the difficulty of the questions a student has to answer to their level of ability. Most commonly, this benefit is used to justify the length of tests being shortened whilst retaining the reliability of a longer, non-adaptive test. Improvements due to adaptive…
Descriptors: Risk, Item Response Theory, Computer Assisted Testing, Difficulty Level
Aleyna Altan; Zehra Taspinar Sener – Online Submission, 2023
This research aimed to develop a valid and reliable test to be used to detect sixth grade students' misconceptions and errors regarding the subject of fractions. A misconception diagnostic test has been developed that includes the concept of fractions, different representations of fractions, ordering and comparing fractions, equivalence of…
Descriptors: Diagnostic Tests, Mathematics Tests, Fractions, Misconceptions
Lubiano, Michael Leonard D.; Magpantay, Marife S. – International Journal of Research in Education and Science, 2021
This study enhanced the 7E instructional model towards enriching the science inquiry skills of senior high school learners in General Chemistry 1. A total of 136 Grade 12 learners enrolled in the Science, Technology, Engineering, and Mathematics (STEM) strand participated in the study. The study was composed of three phases. In Phase I, the…
Descriptors: Science Instruction, Teaching Methods, Inquiry, High School Students
Lubiano, Michael Leonard D.; Magpantay, Marife S. – International Society for Technology, Education, and Science, 2021
This study enhanced the 7E instructional model towards enriching the science inquiry skills of senior high school learners in General Chemistry 1. A total of 136 Grade 12 learners enrolled in the Science, Technology, Engineering, and Mathematics (STEM) strand participated in the study. The study was composed of three phases. In Phase I, the…
Descriptors: Foreign Countries, Science Instruction, Teaching Methods, Inquiry
Lahner, Felicitas-Maria; Lörwald, Andrea Carolin; Bauer, Daniel; Nouns, Zineb Miriam; Krebs, René; Guttormsen, Sissel; Fischer, Martin R.; Huwendiek, Sören – Advances in Health Sciences Education, 2018
Multiple true-false (MTF) items are a widely used supplement to the commonly used single-best answer (Type A) multiple choice format. However, an optimal scoring algorithm for MTF items has not yet been established, as existing studies yielded conflicting results. Therefore, this study analyzes two questions: What is the optimal scoring algorithm…
Descriptors: Scoring Formulas, Scoring Rubrics, Objective Tests, Multiple Choice Tests
Yang, Eunbae B.; Lee, Myung Ae; Park, Yoon Soo – Advances in Health Sciences Education, 2018
In 2012, the National Health Personnel Licensing Examination Board of Korea decided to publicly disclose all test items and answers to satisfy the test takers' right to know and enhance the transparency of tests administered by the government. This study investigated the effects of item disclosure on the medical licensing examination (MLE),…
Descriptors: Certification, Foreign Countries, Test Items, Disclosure
Asikainen, Mervi A. – EURASIA Journal of Mathematics, Science & Technology Education, 2017
The study investigated the use of Quantum Physics Conceptual Survey (QPCS) in probing student understanding of quantum physics. Altogether 103 Finnish university students responded to QPCS. The mean scores of the student responses were calculated and the test was evaluated using common five indices: Item difficulty index, Item discrimination…
Descriptors: Quantum Mechanics, Physics, College Students, Student Surveys
McColgan, Michele W.; Finn, Rose A.; Broder, Darren L.; Hassel, George E. – Physical Review Physics Education Research, 2017
We present the Electricity and Magnetism Conceptual Assessment (EMCA), a new assessment aligned with second-semester introductory physics courses. Topics covered include electrostatics, electric fields, circuits, magnetism, and induction. We have two motives for writing a new assessment. First, we find other assessments such as the Brief…
Descriptors: Energy, Magnets, Scientific Concepts, Student Evaluation
Hamby, Tyler; Taylor, Wyn – Educational and Psychological Measurement, 2016
This study examined the predictors and psychometric outcomes of survey satisficing, wherein respondents provide quick, "good enough" answers (satisficing) rather than carefully considered answers (optimizing). We administered surveys to university students and respondents--half of whom held college degrees--from a for-pay survey website,…
Descriptors: Surveys, Test Reliability, Test Validity, Comparative Analysis
Winke, Paula; Lee, Shinhye; Ahn, Jieun Irene; Choi, Ina; Cui, Yaqiong; Yoon, Hyung-Jo – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2018
This study investigated the cognitive validity of two child English language tests. Some teachers maintain that these types of tests may be cognitively invalid because native-English-speaking children would not do well on them (Winke, 2011). So the researchers had native speakers and learners of English aged 7 to 9 take sample versions of two…
Descriptors: Language Tests, English, English (Second Language), Second Language Learning
Culligan, Brent – Language Testing, 2015
This study compared three common vocabulary test formats, the Yes/No test, the Vocabulary Knowledge Scale (VKS), and the Vocabulary Levels Test (VLT), as measures of vocabulary difficulty. Vocabulary difficulty was defined as the item difficulty estimated through Item Response Theory (IRT) analysis. Three tests were given to 165 Japanese students,…
Descriptors: Language Tests, Test Format, Comparative Analysis, Vocabulary
Alpayar, Cagla; Gulleroglu, H. Deniz – Educational Research and Reviews, 2017
The aim of this research is to determine whether students' test performance and approaches to test questions change based on the type of mathematics questions (visual or verbal) administered to them. This research is based on a mixed-design model. The quantitative data are gathered from 297 seventh grade students, attending seven different middle…
Descriptors: Foreign Countries, Middle School Students, Grade 7, Student Evaluation
Schroeders, Ulrich; Robitzsch, Alexander; Schipolowski, Stefan – Journal of Educational Measurement, 2014
C-tests are a specific variant of cloze tests that are considered time-efficient, valid indicators of general language proficiency. They are commonly analyzed with models of item response theory assuming local item independence. In this article we estimated local interdependencies for 12 C-tests and compared the changes in item difficulties,…
Descriptors: Comparative Analysis, Psychometrics, Cloze Procedure, Language Tests
Mitchell, Alison M.; Truckenmiller, Adrea; Petscher, Yaacov – Communique, 2015
As part of the Race to the Top initiative, the United States Department of Education made nearly 1 billion dollars available in State Educational Technology grants with the goal of ramping up school technology. One result of this effort is that states, districts, and schools across the country are using computerized assessments to measure their…
Descriptors: Computer Assisted Testing, Educational Technology, Testing, Efficiency