Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 8 |
Since 2006 (last 20 years) | 27 |
Descriptor
Source
Author
Publication Type
Education Level
Higher Education | 27 |
Postsecondary Education | 22 |
Elementary Secondary Education | 6 |
Adult Education | 2 |
Early Childhood Education | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Location
China | 1 |
Germany | 1 |
Japan | 1 |
Portugal | 1 |
Spain | 1 |
Tennessee | 1 |
Turkey | 1 |
United Kingdom | 1 |
United Kingdom (England) | 1 |
Uruguay | 1 |
West Germany | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Saerys-Foy, Jeffrey E.; LoCasto, Paul C.; Burn, David; Ferranti, Daniella – Discourse Processes: A Multidisciplinary Journal, 2022
According to theories of validation, people routinely check incoming information against prior knowledge during comprehension. On these theories, information is validated if it fits with prior knowledge. Some researchers argue that information needs to be successfully validated before being incorporated into the situation model. We report five…
Descriptors: Fantasy, Reading Rate, Prior Learning, Reading Comprehension
Yasuda, Jun-ichiro; Hull, Michael M.; Mae, Naohiro – Physical Review Physics Education Research, 2022
This paper presents improvements made to a computerized adaptive testing (CAT)-based version of the FCI (FCI-CAT) in regards to test security and test efficiency. First, we will discuss measures to enhance test security by controlling for item overexposure, decreasing the risk that respondents may (i) memorize the content of a pretest for use on…
Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Risk Management
Yörük, Tayfun – Shanlax International Journal of Education, 2021
The aim of this study is to reveal the views of the practitioners regarding the measurement and evaluation sub-system of distance education applications in higher education. 11 faculty members working at Akdeniz University, determined with an easily accessible sampling of purposeful sampling methods, participated in this study in which qualitative…
Descriptors: Student Evaluation, Computer Assisted Testing, Educational Technology, Technology Integration
Gyll, Sean P. – AERA Online Paper Repository, 2020
Simulated testing has become more prevalent in higher education, especially as competency-based institutions begin to incorporate micro-credentials and skills certificates into their curriculum. Competency-based assessment falls outside traditional norm-based testing practices used in K-12 education, and is largely focused on criterion referenced…
Descriptors: Competency Based Education, Evaluation Methods, Knowledge Level, Measurement
Vanek, Norbert; Tovalovich, Artem – International Journal of Bilingual Education and Bilingualism, 2022
To what extent does emotional reactivity differ when bilinguals process input in their native (L1) or non-native language (L2)? Does the L1 elicit a significantly stronger emotional arousal or can salient second language experience generate comparably strong associations between emotions and the L2? These questions were addressed through two…
Descriptors: Physiology, Vocabulary Development, Plagiarism, Russian
Acosta-Gonzaga, Elizabeth; Walet, Niels R. – Assessment & Evaluation in Higher Education, 2018
This study explores student attitudes to the use of substantive on-line assessments that require mathematical answers. Since there is limited guidance available for their use in a university setting, our goal is to learn what are the important aspects in student acceptance of e-assessments that support learning of mathematical subjects in higher…
Descriptors: Student Attitudes, Evaluation Methods, Computer Assisted Testing, Undergraduate Students
Ramineni, Chaitanya; Williamson, David M. – Assessing Writing, 2013
In this paper, we provide an overview of psychometric procedures and guidelines Educational Testing Service (ETS) uses to evaluate automated essay scoring for operational use. We briefly describe the e-rater system, the procedures and criteria used to evaluate e-rater, implications for a range of potential uses of e-rater, and directions for…
Descriptors: Educational Testing, Guidelines, Scoring, Psychometrics
Deane, Paul – Assessing Writing, 2013
This paper examines the construct measured by automated essay scoring (AES) systems. AES systems measure features of the text structure, linguistic structure, and conventional print form of essays; as such, the systems primarily measure text production skills. In the current state-of-the-art, AES provide little direct evidence about such matters…
Descriptors: Scoring, Essays, Text Structure, Writing (Composition)
Ramineni, Chaitanya – Assessing Writing, 2013
In this paper, I describe the design and evaluation of automated essay scoring (AES) models for an institution's writing placement program. Information was gathered on admitted student writing performance at a science and technology research university in the northeastern United States. Under timed conditions, first-year students (N = 879) were…
Descriptors: Validity, Comparative Analysis, Internet, Student Placement
Greiff, Samuel; Wustenberg, Sascha; Funke, Joachim – Applied Psychological Measurement, 2012
This article addresses two unsolved measurement issues in dynamic problem solving (DPS) research: (a) unsystematic construction of DPS tests making a comparison of results obtained in different studies difficult and (b) use of time-intensive single tasks leading to severe reliability problems. To solve these issues, the MicroDYN approach is…
Descriptors: Problem Solving, Tests, Measurement, Structural Equation Models
Condon, William – Assessing Writing, 2013
Automated Essay Scoring (AES) has garnered a great deal of attention from the rhetoric and composition/writing studies community since the Educational Testing Service began using e-rater[R] and the "Criterion"[R] Online Writing Evaluation Service as products in scoring writing tests, and most of the responses have been negative. While the…
Descriptors: Measurement, Psychometrics, Evaluation Methods, Educational Testing
Swerdzewski, Peter J.; Harmes, J. Christine; Finney, Sara J. – Applied Measurement in Education, 2011
Many universities rely on data gathered from tests that are low stakes for examinees but high stakes for the various programs being assessed. Given the lack of consequences associated with many collegiate assessments, the construct-irrelevant variance introduced by unmotivated students is potentially a serious threat to the validity of the…
Descriptors: Computer Assisted Testing, Student Motivation, Inferences, Universities
Livingston, Samuel A.; Antal, Judit – Applied Measurement in Education, 2010
A simultaneous equating of four new test forms to each other and to one previous form was accomplished through a complex design incorporating seven separate equating links. Each new form was linked to the reference form by four different paths, and each path produced a different score conversion. The procedure used to resolve these inconsistencies…
Descriptors: Measurement Techniques, Measurement, Educational Assessment, Educational Testing
Qian, Hong – ProQuest LLC, 2013
This dissertation includes three essays: one essay focuses on the effect of teacher preparation programs on teacher knowledge while the other two focus on test-takers' response times on test items. Essay One addresses the problem of how opportunities to learn in teacher preparation programs influence future elementary mathematics teachers'…
Descriptors: Teacher Education Programs, Pedagogical Content Knowledge, Preservice Teacher Education, Preservice Teachers
Wise, Steven L.; Pastor, Dena A.; Kong, Xiaojing J. – Applied Measurement in Education, 2009
Previous research has shown that rapid-guessing behavior can degrade the validity of test scores from low-stakes proficiency tests. This study examined, using hierarchical generalized linear modeling, examinee and item characteristics for predicting rapid-guessing behavior. Several item characteristics were found significant; items with more text…
Descriptors: Guessing (Tests), Achievement Tests, Correlation, Test Items
Previous Page | Next Page »
Pages: 1 | 2