Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 10 |
Since 2006 (last 20 years) | 11 |
Descriptor
Difficulty Level | 17 |
Statistical Analysis | 17 |
Testing | 17 |
Test Items | 12 |
College Students | 4 |
English (Second Language) | 4 |
Language Tests | 4 |
Scores | 4 |
Achievement Tests | 3 |
Comparative Analysis | 3 |
Computation | 3 |
More ▼ |
Source
Author
Publication Type
Journal Articles | 13 |
Reports - Research | 13 |
Books | 1 |
Collected Works - General | 1 |
Reports - Descriptive | 1 |
Reports - Evaluative | 1 |
Speeches/Meeting Papers | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 5 |
Postsecondary Education | 4 |
High Schools | 2 |
Junior High Schools | 2 |
Middle Schools | 2 |
Secondary Education | 2 |
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Audience
Practitioners | 1 |
Teachers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 3 |
Iowa Tests of Basic Skills | 1 |
SAT (College Admission Test) | 1 |
Test of English for… | 1 |
What Works Clearinghouse Rating
Inga Laukaityte; Marie Wiberg – Practical Assessment, Research & Evaluation, 2024
The overall aim was to examine effects of differences in group ability and features of the anchor test form on equating bias and the standard error of equating (SEE) using both real and simulated data. Chained kernel equating, Postratification kernel equating, and Circle-arc equating were studied. A college admissions test with four different…
Descriptors: Ability Grouping, Test Items, College Entrance Examinations, High Stakes Tests
Lozano, José H.; Revuelta, Javier – Applied Measurement in Education, 2021
The present study proposes a Bayesian approach for estimating and testing the operation-specific learning model, a variant of the linear logistic test model that allows for the measurement of the learning that occurs during a test as a result of the repeated use of the operations involved in the items. The advantages of using a Bayesian framework…
Descriptors: Bayesian Statistics, Computation, Learning, Testing
Luke G. Eglington; Philip I. Pavlik – Grantee Submission, 2020
Decades of research has shown that spacing practice trials over time can improve later memory, but there are few concrete recommendations concerning how to optimally space practice. We show that existing recommendations are inherently suboptimal due to their insensitivity to time costs and individual- and item-level differences. We introduce an…
Descriptors: Scheduling, Drills (Practice), Memory, Testing
Luke G. Eglington; Philip I. Pavlik Jr. – npj Science of Learning, 2020
Decades of research has shown that spacing practice trials over time can improve later memory, but there are few concrete recommendations concerning how to optimally space practice. We show that existing recommendations are inherently suboptimal due to their insensitivity to time costs and individual- and item-level differences. We introduce an…
Descriptors: Scheduling, Drills (Practice), Memory, Testing
Tullis, Jonathan G.; Fiechter, Joshua L.; Benjamin, Aaron S. – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2018
Practice tests provide large mnemonic benefits over restudying, but learners judge practice tests as less effective than restudying. Consequently, learners infrequently utilize testing when controlling their study and often choose to be tested only on well-learned items. In 5 experiments, we examined whether learners' choices about testing and…
Descriptors: Testing, Review (Reexamination), Selection, Memory
Pawade, Yogesh R.; Diwase, Dipti S. – Journal of Educational Technology, 2016
Item analysis of Multiple Choice Questions (MCQs) is the process of collecting, summarizing and utilizing information from students' responses to evaluate the quality of test items. Difficulty Index (p-value), Discrimination Index (DI) and Distractor Efficiency (DE) are the parameters which help to evaluate the quality of MCQs used in an…
Descriptors: Test Items, Item Analysis, Multiple Choice Tests, Curriculum Development
Patterson, Michael C. – Teaching of Psychology, 2017
The present study investigated the use of multiple digital media technologies, including social networking platforms, by students while preparing for an examination (media multitasking) and the subsequent effects on exam performance. The level of media multitasking (number of simultaneous media technologies) and duration of study were used as…
Descriptors: Testing, Performance, Study Habits, Study Skills
Herrmann-Abell, Cari F.; DeBoer, George E. – Grantee Submission, 2016
Energy is a core concept in the teaching of science. Therefore, it is important to know how students' thinking about energy develops so that elementary, middle, and high school students can be appropriately supported in their understanding of energy. This study tests the validity of a proposed theoretical model of students' growth of understanding…
Descriptors: Item Response Theory, Science Tests, Scientific Concepts, Energy
Izmirli, Serkan; Kurt, Adile Askim – Journal of Educational Computing Research, 2016
The purpose of the study was to examine the effects of instruction given with different multimedia modalities (written text + animation or narration + animation) on the academic achievement, cognitive load, and positive affect in different paces (learner-paced or system-paced); 97 freshmen university students divided into four groups taught in…
Descriptors: Cognitive Processes, Difficulty Level, Academic Achievement, Educational Environment
Warne, Russell T.; Doty, Kristine J.; Malbica, Anne Marie; Angeles, Victor R.; Innes, Scott; Hall, Jared; Masterson-Nixon, Kelli – Journal of Psychoeducational Assessment, 2016
"Above-level testing" (also called "above-grade testing," "out-of-level testing," and "off-level testing") is the practice of administering to a child a test that is designed for an examinee population that is older or in a more advanced grade. Above-level testing is frequently used to help educators design…
Descriptors: Test Items, Testing, Academically Gifted, Talent Identification
Crossley, Scott; Clevinger, Amanda; Kim, YouJin – Language Assessment Quarterly, 2014
There has been a growing interest in the use of integrated tasks in the field of second language testing to enhance the authenticity of language tests. However, the role of text integration in test takers' performance has not been widely investigated. The purpose of the current study is to examine the effects of text-based relational (i.e.,…
Descriptors: Language Proficiency, Connected Discourse, Language Tests, English (Second Language)

Albanese, Mark A. – Journal of Educational Measurement, 1988
Estimates of the effects of use of formula scoring on the individual examinee's score are presented. Results for easy, moderate, and hard tests are examined. Using test characteristics from several studies shows that some examinees would increase scores substantially if they were to answer items omitted under formula directions. (SLD)
Descriptors: Difficulty Level, Guessing (Tests), Scores, Scoring Formulas

Perkins, Kyle; And Others – Language Testing, 1995
This article reports the results of using a three-layer back propagation artificial neural network to predict item difficulty in a reading comprehension test. Three classes of variables were examined: text structure, propositional analysis, and cognitive demand. Results demonstrate that the networks can consistently predict item difficulty. (JL)
Descriptors: Artificial Intelligence, Difficulty Level, English (Second Language), Language Tests
Betz, Nancy E.; Weiss, David J. – 1976
The effects of providing immediate knowledge of results (KR) and adaptive testing on test anxiety and test-taking motivation were investigated. Also studied was the accuracy of student perceptions of the difficulty of adaptive and conventional tests administered with or without immediate knowledge of results. Testees were 350 college students…
Descriptors: Ability, Achievement Tests, Anxiety, Branching
Rippey, Robert M. – 1971
Technical improvements, which may be made in the reliability and validity of tests through confidence scores, are discussed. However, studies indicate that subjects do not handle their confidence uniformly. (MS)
Descriptors: Computer Programs, Confidence Testing, Correlation, Difficulty Level
Previous Page | Next Page »
Pages: 1 | 2