Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 6 |
Since 2016 (last 10 years) | 10 |
Since 2006 (last 20 years) | 15 |
Descriptor
Difficulty Level | 177 |
Higher Education | 177 |
Test Items | 177 |
Multiple Choice Tests | 64 |
Test Construction | 56 |
Item Analysis | 50 |
College Students | 43 |
Test Format | 38 |
Test Reliability | 31 |
Computer Assisted Testing | 27 |
Test Validity | 24 |
More ▼ |
Source
Author
Plake, Barbara S. | 10 |
Wise, Steven L. | 5 |
Tollefson, Nona | 4 |
Douglass, James B. | 3 |
Green, Kathy | 3 |
Rocklin, Thomas | 3 |
Weiten, Wayne | 3 |
Finney, Sara J. | 2 |
Freedle, Roy | 2 |
Green, Kathy E. | 2 |
Halpin, Glennelle | 2 |
More ▼ |
Publication Type
Education Level
Higher Education | 16 |
Postsecondary Education | 12 |
Elementary Secondary Education | 2 |
Secondary Education | 2 |
Elementary Education | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Audience
Researchers | 11 |
Practitioners | 4 |
Teachers | 2 |
Location
Canada | 2 |
Germany | 2 |
Africa | 1 |
Australia | 1 |
Brazil | 1 |
California | 1 |
Ethiopia | 1 |
Florida | 1 |
Georgia | 1 |
Germany (Berlin) | 1 |
Indonesia | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Dahl, Laura S.; Staples, B. Ashley; Mayhew, Matthew J.; Rockenbach, Alyssa N. – Innovative Higher Education, 2023
Surveys with rating scales are often used in higher education research to measure student learning and development, yet testing and reporting on the longitudinal psychometric properties of these instruments is rare. Rasch techniques allow scholars to map item difficulty and individual aptitude on the same linear, continuous scale to compare…
Descriptors: Surveys, Rating Scales, Higher Education, Educational Research
Ferrara, Steve; Steedle, Jeffrey T.; Frantz, Roger S. – Applied Measurement in Education, 2022
Item difficulty modeling studies involve (a) hypothesizing item features, or item response demands, that are likely to predict item difficulty with some degree of accuracy; and (b) entering the features as independent variables into a regression equation or other statistical model to predict difficulty. In this review, we report findings from 13…
Descriptors: Reading Comprehension, Reading Tests, Test Items, Item Response Theory
Liotino, Marica; Fedeli, Monica; Garone, Anja; Knorn, Steffi; Varagnolo, Damiano; Garone, Emanuele – Commission for International Adult Education, 2021
Formally describing and assessing the difficulty of learning and teaching material is important for quality assurance in university teaching, for aligning teaching and learning activities, and for easing communications among stakeholders such as teachers and students. This paper proposes a novel taxonomy to describe and quantify the difficulty…
Descriptors: Taxonomy, Student Evaluation, Engineering Education, Student Projects
Ika Zenita Ratnaningsih; Unika Prihatsanti; Anggun Resdasari Prasetyo; Bambang Sumintono – Journal of Applied Research in Higher Education, 2025
Purpose: The present study aimed to validate the Indonesian-language version of the psychological capital questionnaire (PCQ), specifically within the context of higher education, by utilising Rasch analysis to evaluate the reliability and validity aspect such as item-fit statistics, rating scale function, and differential item functioning of the…
Descriptors: Foreign Countries, Indonesian Languages, Test Validity, Psychological Characteristics
COVID-19 Lockdown Effects on Student Grades of a University Engineering Course: A Psychometric Study
Santos, Hernan – IEEE Transactions on Education, 2022
Contribution: This article is centered on effects that the COVID-19 lockdown has produced on the student performance in specific engineering course. The study treats to evaluate if the changes in the teaching and student assessment have been suited. Background: Most of higher education courses have had to adapt to this situation and made quick…
Descriptors: COVID-19, Pandemics, Outcomes of Education, Educational Change
Omarov, Nazarbek Bakytbekovich; Mohammed, Aisha; Alghurabi, Ammar Muhi Khleel; Alallo, Hajir Mahmood Ibrahim; Ali, Yusra Mohammed; Hassan, Aalaa Yaseen; Demeuova, Lyazat; Viktorovna, Shvedova Irina; Nazym, Bekenova; Al Khateeb, Nashaat Sultan Afif – International Journal of Language Testing, 2023
The Multiple-choice (MC) item format is commonly used in educational assessments due to its economy and effectiveness across a variety of content domains. However, numerous studies have examined the quality of MC items in high-stakes and higher-education assessments and found many flawed items, especially in terms of distractors. These faulty…
Descriptors: Test Items, Multiple Choice Tests, Item Response Theory, English (Second Language)
Wright, Christian D.; Eddy, Sarah L.; Wenderoth, Mary Pat; Abshire, Elizabeth; Blankenbiller, Margaret; Brownell, Sara E. – CBE - Life Sciences Education, 2016
Recent reform efforts in undergraduate biology have recommended transforming course exams to test at more cognitively challenging levels, which may mean including more cognitively challenging and more constructed-response questions on assessments. However, changing the characteristics of exams could result in bias against historically underserved…
Descriptors: Introductory Courses, Biology, Undergraduate Students, Higher Education
Stiller, Jurik; Hartmann, Stefan; Mathesius, Sabrina; Straube, Philipp; Tiemann, Rüdiger; Nordmeier, Volkhard; Krüger, Dirk; Upmeier zu Belzen, Annette – Assessment & Evaluation in Higher Education, 2016
The aim of this study was to improve the criterion-related test score interpretation of a text-based assessment of scientific reasoning competencies in higher education by evaluating factors which systematically affect item difficulty. To provide evidence about the specific demands which test items of various difficulty make on pre-service…
Descriptors: Logical Thinking, Scientific Concepts, Difficulty Level, Test Items
Kirschner, Sophie; Borowski, Andreas; Fischer, Hans E.; Gess-Newsome, Julie; von Aufschnaiter, Claudia – International Journal of Science Education, 2016
Teachers' professional knowledge is assumed to be a key variable for effective teaching. As teacher education has the goal to enhance professional knowledge of current and future teachers, this knowledge should be described and assessed. Nevertheless, only a limited number of studies quantitatively measures physics teachers' professional…
Descriptors: Evaluation Methods, Tests, Test Format, Science Instruction
Uzuner Yurt, Serap; Aktas, Elif – Educational Research and Reviews, 2016
In this study, the effects of the use of peer tutoring in Effective and Good Speech Course on students' success, perception of speech self-efficacy and speaking skills were examined. The study, designed as a mixed pattern in which quantitative and qualitative research approaches were combined, was carried out together with 57 students in 2014 to…
Descriptors: Peer Teaching, Tutoring, Higher Education, College Students
Fugard, Andrew J. B.; Stewart, Mary E.; Stenning, Keith – Autism: The International Journal of Research and Practice, 2011
People with autism spectrum condition (ASC) perform well on Raven's matrices, a test which loads highly on the general factor in intelligence. However, the mechanisms supporting enhanced performance on the test are poorly understood. Evidence is accumulating that milder variants of the ASC phenotype are present in typically developing individuals,…
Descriptors: Evidence, College Students, Autism, Prediction
Ariel, Robert; Dunlosky, John; Bailey, Heather – Journal of Experimental Psychology: General, 2009
Theories of self-regulated study assume that learners monitor item difficulty when making decisions about which items to select for study. To complement such theories, the authors propose an agenda-based regulation (ABR) model in which learners' study decisions are guided by an agenda that learners develop to prioritize items for study, given…
Descriptors: Test Items, Time Management, Item Analysis, Rewards

Plake, Barbara S. – Journal of Experimental Education, 1980
Three-item orderings and two levels of knowledge of ordering were used to study differences in test results, student's perception of the test's fairness and difficulty, and student's estimation of test performance. No significant order effect was found. (Author/GK)
Descriptors: Difficulty Level, Higher Education, Scores, Test Format
Content Characteristics of GRE Analytical Reasoning Items. GRE Board Professional Report No. 84-14P.
Chalifour, Clark; Powers, Donald E. – 1988
In actual test development practice, the number of test items that must be developed and pretested is typically greater, and sometimes much greater, than the number eventually judged suitable for use in operational test forms. This has proven to be especially true for analytical reasoning items, which currently form the bulk of the analytical…
Descriptors: Coding, Difficulty Level, Higher Education, Test Construction

Vidler, Derek; Hansen, Richard – Journal of Experimental Education, 1980
Relationships among patterns of answer changing and item characteristics on multiple-choice tests are discussed. Results obtained were similar to those found in previous studies but pointed to further relationships among these variables. (Author/GK)
Descriptors: College Students, Difficulty Level, Higher Education, Multiple Choice Tests