Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 12 |
Since 2016 (last 10 years) | 27 |
Since 2006 (last 20 years) | 38 |
Descriptor
Difficulty Level | 57 |
Scores | 57 |
Test Construction | 57 |
Test Items | 41 |
Foreign Countries | 20 |
Test Validity | 17 |
Test Reliability | 16 |
Multiple Choice Tests | 15 |
Item Analysis | 13 |
Item Response Theory | 11 |
Statistical Analysis | 10 |
More ▼ |
Source
Author
Lord, Frederic M. | 2 |
Adriana J. Lagier | 1 |
Anderson, Paul S. | 1 |
Barak, Moshe | 1 |
Bejar, Isaac I. | 1 |
Berrin Köseoglu | 1 |
Bishop, Pamela R. | 1 |
Boyer, Michelle | 1 |
Bristow, M. | 1 |
Chen, Jing | 1 |
Cizek, Gregory J. | 1 |
More ▼ |
Publication Type
Reports - Research | 42 |
Journal Articles | 37 |
Dissertations/Theses -… | 5 |
Reports - Evaluative | 5 |
Speeches/Meeting Papers | 5 |
Tests/Questionnaires | 5 |
Numerical/Quantitative Data | 3 |
Reports - Descriptive | 2 |
Guides - General | 1 |
Education Level
Higher Education | 17 |
Postsecondary Education | 15 |
Secondary Education | 5 |
Elementary Education | 4 |
Elementary Secondary Education | 3 |
Middle Schools | 3 |
Grade 12 | 2 |
Grade 4 | 2 |
Grade 5 | 2 |
Grade 6 | 2 |
Grade 8 | 2 |
More ▼ |
Audience
Researchers | 3 |
Policymakers | 1 |
Teachers | 1 |
Location
Indonesia | 2 |
Turkey | 2 |
Alabama | 1 |
Belgium | 1 |
Canada | 1 |
China | 1 |
China (Beijing) | 1 |
Colorado | 1 |
Germany | 1 |
Indiana | 1 |
Iran | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Onur Dönmez; Yavuz Akbulut; Gözde Zabzun; Berrin Köseoglu – Applied Cognitive Psychology, 2025
This study investigates the effect of survey order in measuring self-reported cognitive load. Understanding how survey order influences responses is crucial, but it has been largely overlooked in the context of cognitive load. Using a 2 × 2 experimental design with 319 high school students, the study manipulated intrinsic cognitive load (ICL)…
Descriptors: Surveys, Test Construction, Measurement, Cognitive Processes
Thompson, Kathryn N. – ProQuest LLC, 2023
It is imperative to collect validity evidence prior to interpreting and using test scores. During the process of collecting validity evidence, test developers should consider whether test scores are contaminated by sources of extraneous information. This is referred to as construct irrelevant variance, or the "degree to which test scores are…
Descriptors: Test Wiseness, Test Items, Item Response Theory, Scores
Rodriguez, Rebekah M.; Silvia, Paul J.; Kaufman, James C.; Reiter-Palmon, Roni; Puryear, Jeb S. – Creativity Research Journal, 2023
The original 90-item Creative Behavior Inventory (CBI) was a landmark self-report scale in creativity research, and the 28-item brief form developed nearly 20 years ago continues to be a popular measure of everyday creativity. Relatively little is known, however, about the psychometric properties of this widely used scale. In the current research,…
Descriptors: Creativity Tests, Creativity, Creative Thinking, Psychometrics
Saepuzaman, Duden; Istiyono, Edi; Haryanto – Pegem Journal of Education and Instruction, 2022
HOTS is one part of the skills that need to be developed in the 21st Century . This study aims to determine the characteristics of the Fundamental Physics Higher-order Thinking Skill (FundPhysHOTS) test for prospective physics teachers using Item Response Theory (IRT) analysis. This study uses a quantitative approach. 254 prospective physics…
Descriptors: Thinking Skills, Physics, Science Process Skills, Cognitive Tests
Mohammed Ambusaidi – ProQuest LLC, 2022
There is an increased demand on nursing faculty to provide quality teaching and assessment. Nursing faculty are required to ensure accurate assessment of learning through testing and outcome measurement that are critical elements of the evaluation process. Likewise, nursing faculty should implement a logical evaluation system. However, the…
Descriptors: Nursing Education, College Faculty, Test Construction, Test Validity
Rahmawati, Laili Etika; Sulistyono, Yunus – Asian Journal of University Education, 2021
Nowadays, text readability is of great importance. Simple but very often ignored, readability statistics can provide information about the level of difficulty of the readability of particular documents and increase an evaluator's credibility. Hence, this research aims to examine the readability index of the test instrument for BIPA (Bahasa…
Descriptors: Reading Tests, Readability, Reading Achievement, Test Construction
Adriana J. Lagier – Bioscene: Journal of College Biology Teaching, 2022
In courses with a heterogeneous student population, instructors are often challenged to balance successful course completion with rigor. This difficult task can be confounded in foundational, gateway courses, such as introductory biology, which serves a mix of freshman majors at various levels of preparedness. Research suggests that changes in…
Descriptors: Introductory Courses, Biology, Science Instruction, Science Tests
Designing Computer-Based Tests: Design Guidelines from Multimedia Learning Studied with Eye Tracking
Dirkx, K. J. H.; Skuballa, I.; Manastirean-Zijlstra, C. S.; Jarodzka, H. – Instructional Science: An International Journal of the Learning Sciences, 2021
The use of computer-based tests (CBTs), for both formative and summative purposes, has greatly increased over the past years. One major advantage of CBTs is the easy integration of multimedia. It is unclear, though, how to design such CBT environments with multimedia. The purpose of the current study was to examine whether guidelines for designing…
Descriptors: Test Construction, Computer Assisted Testing, Multimedia Instruction, Eye Movements
Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick – ETS Research Report Series, 2018
For a multiple-choice test under development or redesign, it is important to choose the optimal number of options per item so that the test possesses the desired psychometric properties. On the basis of available data for a multiple-choice assessment with 8 options, we evaluated the effects of changing the number of options on test properties…
Descriptors: Multiple Choice Tests, Test Items, Simulation, Test Construction
Keng, Leslie; Boyer, Michelle – National Center for the Improvement of Educational Assessment, 2020
ACT requested assistance from the National Center for the Improvement of Educational Assessment (Center for Assessment) to investigate declines of scores for states administering the ACT to its 11th grade students in 2018. This request emerged from conversations among state leaders, the Center for Assessment, and ACT in trying to understand the…
Descriptors: College Entrance Examinations, Scores, Test Score Decline, Educational Trends
Lina Anaya; Nagore Iriberri; Pedro Rey-Biel; Gema Zamarro – Annenberg Institute for School Reform at Brown University, 2021
Standardized assessments are widely used to determine access to educational resources with important consequences for later economic outcomes in life. However, many design features of the tests themselves may lead to psychological reactions influencing performance. In particular, the level of difficulty of the earlier questions in a test may…
Descriptors: Test Construction, Test Wiseness, Test Items, Difficulty Level
Lindner, Marlit A.; Schult, Johannes; Mayer, Richard E. – Journal of Educational Psychology, 2022
This classroom experiment investigates the effects of adding representational pictures to multiple-choice and constructed-response test items to understand the role of the response format for the multimedia effect in testing. Participants were 575 fifth- and sixth-graders who answered 28 science test items--seven items in each of four experimental…
Descriptors: Elementary School Students, Grade 5, Grade 6, Multimedia Materials
Yuksel, Ibrahim; Savas, Muhammed Ali – Asian Journal of Education and Training, 2019
In this research, it is aimed to develop a valid and reliable test to determine the drawing a shape-schema and making a table levels of prospective teachers at Mathematics and Science Education, Turkish and Social Sciences Education and Basic Education Departments. In this process, a comprehensive item pool has been prepared with the table of…
Descriptors: Preservice Teachers, Item Banks, Test Validity, Foreign Countries
Yunjiu, Luo; Wei, Wei; Zheng, Ying – SAGE Open, 2022
Artificial intelligence (AI) technologies have the potential to reduce the workload for the second language (L2) teachers and test developers. We propose two AI distractor-generating methods for creating Chinese vocabulary items: semantic similarity and visual similarity. Semantic similarity refers to antonyms and synonyms, while visual similarity…
Descriptors: Chinese, Vocabulary Development, Artificial Intelligence, Undergraduate Students
Liao, Linyu – English Language Teaching, 2020
As a high-stakes standardized test, IELTS is expected to have comparable forms of test papers so that test takers from different test administration on different dates receive comparable test scores. Therefore, this study examined the text difficulty and task characteristics of four parallel academic IELTS reading tests to reveal to what extent…
Descriptors: Second Language Learning, English (Second Language), Language Tests, High Stakes Tests