NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Practitioners1
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Showing all 15 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Sparks, Jesse R.; van Rijn, Peter W.; Deane, Paul – Educational Assessment, 2021
Effectively evaluating the credibility and accuracy of multiple sources is critical for college readiness. We developed 24 source evaluation tasks spanning four predicted difficulty levels of a hypothesized learning progression (LP) and piloted these tasks to evaluate the utility of an LP-based approach to designing formative literacy assessments.…
Descriptors: Middle School Students, Information Sources, Grade 6, Grade 7
Peer reviewed Peer reviewed
Direct linkDirect link
Papenberg, Martin; Diedenhofen, Birk; Musch, Jochen – Journal of Experimental Education, 2021
Testwiseness may introduce construct-irrelevant variance to multiple-choice test scores. Presenting response options sequentially has been proposed as a potential solution to this problem. In an experimental validation, we determined the psychometric properties of a test based on the sequential presentation of response options. We created a strong…
Descriptors: Test Wiseness, Test Validity, Test Reliability, Multiple Choice Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Zhang, Mo; van Rijn, Peter W.; Deane, Paul; Bennett, Randy E. – Educational Assessment, 2019
Writing from source text is critical for developing college-and-career readiness because it is required in advanced academic environments and many vocations. Scenario-based assessment (SBA) represents one approach to measuring this ability. In such assessment, the scenario presents an issue that the student is to read and write about. Before…
Descriptors: Writing Evaluation, Vignettes, Essays, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Shiyu; Lin, Haiyan; Chang, Hua-Hua; Douglas, Jeff – Journal of Educational Measurement, 2016
Computerized adaptive testing (CAT) and multistage testing (MST) have become two of the most popular modes in large-scale computer-based sequential testing. Though most designs of CAT and MST exhibit strength and weakness in recent large-scale implementations, there is no simple answer to the question of which design is better because different…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Format, Sequential Approach
Peer reviewed Peer reviewed
Direct linkDirect link
Fuchs, Douglas; Fuchs, Lynn S. – Exceptional Children, 2017
In 2010, the Institute of Education Sciences commissioned a much-needed national evaluation of response to intervention (RTI). The evaluators defined their task very narrowly, asking "Does the use of universal screening, including a cut-point for designating students for more intensive Tier 2 and Tier 3 interventions, increase children's…
Descriptors: Criticism, Response to Intervention, National Programs, Program Effectiveness
Peer reviewed Peer reviewed
Direct linkDirect link
Davis, Doris Bitler – Teaching of Psychology, 2017
Providing two or more versions of multiple-choice exams has long been a popular strategy for reducing the opportunity for students to engage in academic dishonesty. While the results of studies comparing exam scores under different question-order conditions have been inconclusive, the potential importance of contextual cues to aid student recall…
Descriptors: Test Construction, Multiple Choice Tests, Sequential Approach, Cues
Peer reviewed Peer reviewed
Direct linkDirect link
Furtak, Erin Marie; Ruiz-Primo, Maria Araceli; Bakeman, Roger – Educational Measurement: Issues and Practice, 2017
Formative assessment is a classroom practice that has received much attention in recent years for its established potential at increasing student learning. A frequent analytic approach for determining the quality of formative assessment practices is to develop a coding scheme and determine frequencies with which the codes are observed; however,…
Descriptors: Sequential Approach, Formative Evaluation, Alternative Assessment, Incidence
Millett, Catherine M.; Payne, David G.; Dwyer, Carol A.; Stickler, Leslie M.; Alexiou, Jon J. – Educational Testing Service, 2008
This paper presents a framework that institutions of higher education can use to improve, revise and introduce comprehensive systems for the collection and dissemination of information on student learning outcomes. For faculty and institutional leaders grappling with the many issues and nuances inherent in assessing student learning, the framework…
Descriptors: Higher Education, Educational Testing, Accountability, Outcomes of Education
Peer reviewed Peer reviewed
Direct linkDirect link
Allalouf, Avi – Educational Measurement: Issues and Practice, 2007
There is significant potential for error in long production processes that consist of sequential stages, each of which is heavily dependent on the previous stage, such as the SER (Scoring, Equating, and Reporting) process. Quality control procedures are required in order to monitor this process and to reduce the number of mistakes to a minimum. In…
Descriptors: Scoring, Quality Control, Sequential Approach, Error Correction
Johnstone, Christopher; Altman, Jason; Thurlow, Martha – National Center on Educational Outcomes, University of Minnesota, 2006
Universal design for assessments is an approach to educational assessment based on principles of accessibility for a wide variety of end users. Elements of universal design include inclusive test population; precisely defined constructs; accessible, non-biased items; tests that are amenable to accommodations; simple, clear and intuitive…
Descriptors: Educational Assessment, Test Construction, Testing Accommodations, Design Requirements
Smith, Donald M. – 1974
The concept of scaled achievement tests is discussed and a method of selecting those items of a test that form the most scalable (i.e., having the highest coefficient of reproducibility) subset is presented. Sometimes called a monotonic-deterministic model, this type of test assumes that the test items may be sequentially ordered. To determine the…
Descriptors: Achievement Tests, Arithmetic, Difficulty Level, Item Analysis
HOLLAND, JAMES G. – 1967
IN AN ATTEMPT TO PROVIDE AN OBJECTIVE MEANS FOR IDENTIFYING THE DEGREE TO WHICH MATERIAL CAN BE TECHNICALLY TERMED "PROGRAMMED", THE SO-CALLED "BLACKOUT" TECHNIQUE HAS BEEN DEVELOPED. ALL WORDS IN A PROGRAM WHICH ARE NOT DIRECTLY NEEDED IN ORDER TO PROVIDE THE REQUIRED ANSWERS ARE COVERED WITH BLACK CRAYON, AND THIS EDITED…
Descriptors: Covert Response, Diagnostic Tests, Measurement Instruments, Overt Response
Larkin, Kevin C.; Weiss, David J. – 1974
Three pyramidal adaptive tests and a conventional peaked test were constructed and administered by computer to two groups of students enrolled in undergraduate psychology courses. Six methods of scoring pyramidal tests were evaluated with respect to score distributions, stability, and the degree of relationship among scoring methods and between…
Descriptors: Adaptive Testing, Aptitude Tests, College Students, Computer Assisted Testing
Thomas, David B. – 1975
Individualized instruction programs have imposed an increased reliance on tests as a means of selecting and routing students through sometimes complex programs. Testing which occurs within the training sequence is particularly vulnerable to inefficient use of both trainee and instructor time, but computer-based instruction system can provide a…
Descriptors: Computer Assisted Instruction, Computer Oriented Programs, Discriminant Analysis, Educational Technology
Peer reviewed Peer reviewed
Wilcox, Rand R. – Journal of Experimental Education, 1982
A closed sequential procedure for estimating true score is proposed for use with answer-until-correct tests. The accuracy of determining true score is the same as in conventional sequential solutions, but the possibility of using an unnecessarily large number of items is eliminated. (Author/CM)
Descriptors: Answer Sheets, Guessing (Tests), Item Banks, Measurement Techniques