Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 10 |
Since 2006 (last 20 years) | 27 |
Descriptor
Item Analysis | 62 |
Models | 62 |
Test Construction | 62 |
Test Items | 28 |
Test Validity | 21 |
Test Reliability | 20 |
Psychometrics | 14 |
Foreign Countries | 12 |
Scores | 12 |
Item Response Theory | 11 |
Measurement Techniques | 11 |
More ▼ |
Source
Author
Publication Type
Education Level
Higher Education | 10 |
Postsecondary Education | 9 |
Secondary Education | 5 |
Elementary Education | 4 |
Middle Schools | 3 |
Grade 5 | 2 |
Grade 6 | 2 |
Intermediate Grades | 2 |
Junior High Schools | 2 |
Early Childhood Education | 1 |
Grade 7 | 1 |
More ▼ |
Audience
Researchers | 1 |
Location
Canada | 4 |
California | 2 |
Australia | 1 |
Belgium | 1 |
Brazil | 1 |
Georgia | 1 |
Hong Kong | 1 |
Idaho | 1 |
Indonesia | 1 |
Kazakhstan | 1 |
Malaysia | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Graduate Record Examinations | 2 |
Alberta Grade Twelve Diploma… | 1 |
National Assessment of… | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Alpizar, David; Li, Tongyun; Norris, John M.; Gu, Lixiong – Language Testing, 2023
The C-test is a type of gap-filling test designed to efficiently measure second language proficiency. The typical C-test consists of several short paragraphs with the second half of every second word deleted. The words with deleted parts are considered as items nested within the corresponding paragraph. Given this testlet structure, it is commonly…
Descriptors: Psychometrics, Language Tests, Second Language Learning, Test Items
Abu-Ghazalah, Rashid M.; Dubins, David N.; Poon, Gregory M. K. – Applied Measurement in Education, 2023
Multiple choice results are inherently probabilistic outcomes, as correct responses reflect a combination of knowledge and guessing, while incorrect responses additionally reflect blunder, a confidently committed mistake. To objectively resolve knowledge from responses in an MC test structure, we evaluated probabilistic models that explicitly…
Descriptors: Guessing (Tests), Multiple Choice Tests, Probability, Models
Mardiana – Eurasian Journal of Applied Linguistics, 2023
Written inquiries, which are more frequent and have less of a focus on complex thinking, are issues at school. Students are not taught how to respond to questions found in High-Level Thinking Skills (HOTS) tests, hence, their thinking abilities are generally weak. The issue for teachers is that neither they nor anyone else has been able to create…
Descriptors: Skill Development, Thinking Skills, Check Lists, Models
Tatarinova, Galiya; Neamah, Nour Raheem; Mohammed, Aisha; Hassan, Aalaa Yaseen; Obaid, Ali Abdulridha; Ismail, Ismail Abdulwahhab; Maabreh, Hatem Ghaleb; Afif, Al Khateeb Nashaat Sultan; Viktorovna, Shvedova Irina – International Journal of Language Testing, 2023
Unidimensionality is an important assumption of measurement but it is violated very often. Most of the time, tests are deliberately constructed to be multidimensional to cover all aspects of the intended construct. In such situations, the application of unidimensional item response theory (IRT) models is not justifieddue to poor model fit and…
Descriptors: Item Response Theory, Test Items, Language Tests, Correlation
Joo, Seang-Hwane; Lee, Philseok; Stark, Stephen – Journal of Educational Measurement, 2018
This research derived information functions and proposed new scalar information indices to examine the quality of multidimensional forced choice (MFC) items based on the RANK model. We also explored how GGUM-RANK information, latent trait recovery, and reliability varied across three MFC formats: pairs (two response alternatives), triplets (three…
Descriptors: Item Response Theory, Models, Item Analysis, Reliability
Inal, Ebru; Altintas, Kerim Hakan; Dogan, Nuri – International Journal of Assessment Tools in Education, 2018
The Health Belief Model (HBM) is one of the oldest and most recognized conceptual framework of health behavior and can be applied to disaster preparedness efforts which focus predominantly on human behavior. The study aims to develop and test the psychometric properties of the General Disaster Preparedness Belief (GDPB) scale based on the HBM. A…
Descriptors: Natural Disasters, Emergency Programs, Health Behavior, Models
Johnson, Martin; Rushton, Nicky – Educational Research, 2019
Background: The development of a set of questions is a central element of examination development, with the validity of an examination resting to a large extent on the quality of the questions that it comprises. This paper reports on the methods and findings of a project that explores how educational examination question writers engage in the…
Descriptors: Writing (Composition), Test Construction, Specialists, Protocol Analysis
Cook, Robert J.; Durning, Steven J. – AERA Online Paper Repository, 2016
In an effort to better align item development to goals of assessing higher-order tasks and decision making, complex decision trees were developed to follow clinical reasoning scripts and used as models on which multiple-choice questions could be built. This approach is compatible with best-practice assessment frameworks like Evidence Centered…
Descriptors: Multiple Choice Tests, Decision Making, Models, Task Analysis
Gierl, Mark J.; Lai, Hollis; Pugh, Debra; Touchie, Claire; Boulais, André-Philippe; De Champlain, André – Applied Measurement in Education, 2016
Item development is a time- and resource-intensive process. Automatic item generation integrates cognitive modeling with computer technology to systematically generate test items. To date, however, items generated using cognitive modeling procedures have received limited use in operational testing situations. As a result, the psychometric…
Descriptors: Psychometrics, Multiple Choice Tests, Test Items, Item Analysis
Kiray, Seyit Ahmet – International Journal of Research in Education and Science, 2016
Today, it is of great importance that teachers have pedagogical and technological knowledge in addition to content knowledge. For this reason, the present study aims to develop a TPACK self-efficacy scale for preservice science teachers by following the theoretical framework of technological pedagogical and content knowledge (TPACK), as suggested…
Descriptors: Technological Literacy, Pedagogical Content Knowledge, Self Efficacy, Test Construction
Bringula, Rex P. – Education and Information Technologies, 2015
This study attempted to develop valid and reliable Capstone Project Attitude Scales (CPAS). Among the scales reviewed, the Modified Fennema-Shermann Mathematics Attitude Scales was adapted in the construction of the CPAS. Usefulness, Confidence, and Gender View were the three subscales of the CPAS. Four hundred sixty-three students answered the…
Descriptors: Program Attitudes, Attitude Measures, Questionnaires, Test Construction
Braeken, Johan – Psychometrika, 2011
Conditional independence is a fundamental principle in latent variable modeling and item response theory. Violations of this principle, commonly known as local item dependencies, are put in a test information perspective, and sharp bounds on these violations are defined. A modeling approach is proposed that makes use of a mixture representation of…
Descriptors: Test Construction, Item Response Theory, Models, Tests
Albano, Anthony D. – Journal of Educational Measurement, 2013
In many testing programs it is assumed that the context or position in which an item is administered does not have a differential effect on examinee responses to the item. Violations of this assumption may bias item response theory estimates of item and person parameters. This study examines the potentially biasing effects of item position. A…
Descriptors: Test Items, Item Response Theory, Test Format, Questioning Techniques
Marek, Lydia I.; Brock, Donna-Jean P.; Savla, Jyoti – American Journal of Evaluation, 2015
Although collaboration is recognized as an effective means to address multifaceted community issues, successful collaboration is difficult to achieve and failure is prevalent. To effectively collaborate, collaborators must recognize the strengths and weaknesses within their own efforts. Using Mattessich and colleagues' work as a springboard, a…
Descriptors: Cooperative Programs, Cooperation, Teamwork, Group Dynamics
Karim, Aidah Abdul; Shah, Parilah M.; Din, Rosseni; Ahmad, Mazalah; Lubis, Maimun Aqhsa – International Education Studies, 2014
This study explored the psychometric properties of a locally developed information skills test for youth students in Malaysia using Rasch analysis. The test was a combination of 24 structured and multiple choice items with a 4-point grading scale. The test was administered to 72 technical college students and 139 secondary school students. The…
Descriptors: Foreign Countries, Information Skills, Item Response Theory, Psychometrics