NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)1
Since 2006 (last 20 years)15
What Works Clearinghouse Rating
Showing 1 to 15 of 30 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Wright, Daniel B. – Educational Measurement: Issues and Practice, 2019
There is much discussion about and many policies to address achievement gaps in education among groups of students. The focus here is on a different gap and it is argued that it also should be of concern. Speed gaps are differences in how quickly different groups of students answer the questions on academic assessments. To investigate some speed…
Descriptors: Academic Achievement, Achievement Gap, Reaction Time, Educational Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Ramineni, Chaitanya; Williamson, David M. – Assessing Writing, 2013
In this paper, we provide an overview of psychometric procedures and guidelines Educational Testing Service (ETS) uses to evaluate automated essay scoring for operational use. We briefly describe the e-rater system, the procedures and criteria used to evaluate e-rater, implications for a range of potential uses of e-rater, and directions for…
Descriptors: Educational Testing, Guidelines, Scoring, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Condon, William – Assessing Writing, 2013
Automated Essay Scoring (AES) has garnered a great deal of attention from the rhetoric and composition/writing studies community since the Educational Testing Service began using e-rater[R] and the "Criterion"[R] Online Writing Evaluation Service as products in scoring writing tests, and most of the responses have been negative. While the…
Descriptors: Measurement, Psychometrics, Evaluation Methods, Educational Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Alsubait, Tahani; Parsia, Bijan; Sattler, Uli – Research in Learning Technology, 2012
Different computational models for generating analogies of the form "A is to B as C is to D" have been proposed over the past 35 years. However, analogy generation is a challenging problem that requires further research. In this article, we present a new approach for generating analogies in Multiple Choice Question (MCQ) format that can be used…
Descriptors: Computer Assisted Testing, Programming, Computer Software, Computer Software Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Chang, Yuan-chin Ivan; Lu, Hung-Yi – Psychometrika, 2010
Item calibration is an essential issue in modern item response theory based psychological or educational testing. Due to the popularity of computerized adaptive testing, methods to efficiently calibrate new items have become more important than that in the time when paper and pencil test administration is the norm. There are many calibration…
Descriptors: Test Items, Educational Testing, Adaptive Testing, Measurement
Gonzalez, Gabriella; Le, Vi-Nhuan; Broer, Markus; Mariano, Louis T.; Froemel, J. Enrique; Goldman, Charles A.; DaVanzo, Julie – RAND Corporation, 2009
Analysis of Qatar's standards-based student assessment system, the first in the region, offers several lessons for other nations instituting similar reforms. These include the need to coordinate on standards and assessment development, allow sufficient time for a fully aligned assessment, and communicate about the purposes and uses of testing.…
Descriptors: Academic Standards, Educational Testing, Foreign Countries, Special Needs Students
Quinlan, Thomas; Higgins, Derrick; Wolff, Susanne – Educational Testing Service, 2009
This report evaluates the construct coverage of the e-rater[R[ scoring engine. The matter of construct coverage depends on whether one defines writing skill, in terms of process or product. Originally, the e-rater engine consisted of a large set of components with a proven ability to predict human holistic scores. By organizing these capabilities…
Descriptors: Guides, Writing Skills, Factor Analysis, Writing Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Frey, Andreas; Seitz, Nicki-Nils – Studies in Educational Evaluation, 2009
The paper gives an overview of multidimensional adaptive testing (MAT) and evaluates its applicability in educational and psychological testing. The approach of Segall (1996) is described as a general framework for MAT. The main advantage of MAT is its capability to increase measurement efficiency. In simulation studies conceptualizing situations…
Descriptors: Psychological Testing, Adaptive Testing, Simulation, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Yin, Alexander C.; Volkwein, J. Fredericks – New Directions for Institutional Research, 2010
After surveying 1,827 students in their final year at eighty randomly selected two-year and four-year public and private institutions, American Institutes for Research (2006) reported that approximately 30 percent of students in two-year institutions and nearly 20 percent of students in four-year institutions have only basic quantitative…
Descriptors: Standardized Tests, Basic Skills, College Admission, Educational Testing
Rothman, Robert – Alliance for Excellent Education, 2010
Assessment has long been at the center of education policy debates, and for good reason. The goal of schooling is to maximize student learning, and assessments provide a picture of what students know and are able to do. Assessments also have a strong influence on what goes on in classrooms. The United States is now poised to make the most dramatic…
Descriptors: Foreign Countries, Comparative Education, Student Evaluation, Elementary Secondary Education
Behrens, John T.; Mislevy, Robert J.; DiCerbo, Kristen E.; Levy, Roy – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2010
The world in which learning and assessment must take place is rapidly changing. The digital revolution has created a vast space of interconnected information, communication, and interaction. Functioning effectively in this environment requires so-called 21st century skills such as technological fluency, complex problem solving, and the ability to…
Descriptors: Evidence, Student Evaluation, Educational Assessment, Influence of Technology
Peer reviewed Peer reviewed
Direct linkDirect link
McPherson, Douglas – Interactive Technology and Smart Education, 2009
Purpose: The purpose of this paper is to describe how and why Texas A&M University at Qatar (TAMUQ) has developed a system aiming to effectively place students in freshman and developmental English programs. The placement system includes: triangulating data from external test scores, with scores from a panel-marked hand-written essay (HWE),…
Descriptors: Student Placement, Educational Testing, English (Second Language), Second Language Instruction
Altman, J. R.; Lazarus, S. S.; Quenemoen, R. F.; Kearns, J.; Quenemoen, M.; Thurlow, M. L. – National Center on Educational Outcomes, University of Minnesota, 2010
This report summarizes the twelfth survey of states by the National Center on Educational Outcomes (NCEO) at the University of Minnesota. Results are presented for all 50 states and 8 of the 11 unique states. The purpose of this report is to provide a snapshot of the new initiatives, trends, accomplishments, and emerging issues during this…
Descriptors: Alternative Assessment, Outcomes of Education, Academic Achievement, Second Language Learning
Peer reviewed Peer reviewed
Direct linkDirect link
Schultz, Marian C.; Schultz, James T.; Gallogly, James – Journal of College Teaching & Learning, 2007
In 2004 Embry-Riddle Aeronautical University transitioned from proctored examinations for distance learning courses to online examinations that are not proctored. The purpose of this study was to determine if there is a significant difference in the midterm and final examination grades between proctored and non-proctored online examinations. Three…
Descriptors: Distance Education, Tests, Grades (Scholastic), Student Evaluation
Lau, Che-Ming Allen; And Others – 1996
This study focused on the robustness of unidimensional item response theory (UIRT) models in computerized classification testing against violation of the unidimensionality assumption. The study addressed whether UIRT models remain acceptable under various testing conditions and dimensionality strengths. Monte Carlo simulation techniques were used…
Descriptors: Classification, Computer Assisted Testing, Educational Testing, Item Response Theory
Previous Page | Next Page ยป
Pages: 1  |  2