NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 42 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Samah AlKhuzaey; Floriana Grasso; Terry R. Payne; Valentina Tamma – International Journal of Artificial Intelligence in Education, 2024
Designing and constructing pedagogical tests that contain items (i.e. questions) which measure various types of skills for different levels of students equitably is a challenging task. Teachers and item writers alike need to ensure that the quality of assessment materials is consistent, if student evaluations are to be objective and effective.…
Descriptors: Test Items, Test Construction, Difficulty Level, Prediction
Peer reviewed Peer reviewed
Direct linkDirect link
Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024
Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…
Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Mardiana – Eurasian Journal of Applied Linguistics, 2023
Written inquiries, which are more frequent and have less of a focus on complex thinking, are issues at school. Students are not taught how to respond to questions found in High-Level Thinking Skills (HOTS) tests, hence, their thinking abilities are generally weak. The issue for teachers is that neither they nor anyone else has been able to create…
Descriptors: Skill Development, Thinking Skills, Check Lists, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Villarroel, Verónica; Bloxham, Susan; Bruna, Daniela; Bruna, Carola; Herrera-Seda, Constanza – Assessment & Evaluation in Higher Education, 2018
Authenticity has been identified as a key characteristic of assessment design which promotes learning. Authentic assessment aims to replicate the tasks and performance standards typically found in the world of work, and has been found to have a positive impact on student learning, autonomy, motivation, self-regulation and metacognition; abilities…
Descriptors: Performance Based Assessment, Barriers, Higher Education, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Andrich, David; Marais, Ida – Journal of Educational Measurement, 2018
Even though guessing biases difficulty estimates as a function of item difficulty in the dichotomous Rasch model, assessment programs with tests which include multiple-choice items often construct scales using this model. Research has shown that when all items are multiple-choice, this bias can largely be eliminated. However, many assessments have…
Descriptors: Multiple Choice Tests, Test Items, Guessing (Tests), Test Bias
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Sarwanto; Fajari, Laksmi Evasufi Widi; Chumdari – International Journal of Instruction, 2021
Critical thinking skills are the 21st-century life skills that are needed by students. However, in elementary schools, there are no instruments that are truly effective and efficient to measure critical thinking skills. This research aims to develop an open-ended question assessment instrument to measure students' critical-thinking skills, to test…
Descriptors: Critical Thinking, Thinking Skills, Teaching Methods, Questioning Techniques
Beghetto, Ronald A. – ECNU Review of Education, 2019
Purpose: This article, based on an invited talk, aims to explore the relationship among large-scale assessments, creativity and personalized learning. Design/Approach/Methods: Starting with the working definition of large-scale assessments, creativity, and personalized learning, this article identified the paradox of combining these three…
Descriptors: Measurement, Creativity, Problem Solving, Artificial Intelligence
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Sadhu, Satyu; Laksono, Endang W. – International Journal of Instruction, 2018
Although the development of critical thinking and chemical literacy is a major goal of science education, the adequate emphasis has not been given to the measurement of both skills. This study reports the development and validation of an integrated assessment instrument to measure students' critical thinking skill and chemical literacy together in…
Descriptors: Critical Thinking, Item Response Theory, Chemistry, Construct Validity
Edward Paul Getman – Online Submission, 2020
Despite calls for engaging assessments targeting young language learners (YLLs) between 8 and 13 years old, what makes assessment tasks engaging and how such task characteristics affect measurement quality have not been well studied empirically. Furthermore, there has been a dearth of validity research about technology-enhanced speaking tests for…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Learner Engagement
Peer reviewed Peer reviewed
Direct linkDirect link
El Masri, Yasmine H.; Ferrara, Steve; Foltz, Peter W.; Baird, Jo-Anne – Curriculum Journal, 2017
Predicting item difficulty is highly important in education for both teachers and item writers. Despite identifying a large number of explanatory variables, predicting item difficulty remains a challenge in educational assessment with empirical attempts rarely exceeding 25% of variance explained. This paper analyses 216 science items of key stage…
Descriptors: Predictor Variables, Test Items, Difficulty Level, Test Construction
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Bejar, Isaac I.; Deane, Paul D.; Flor, Michael; Chen, Jing – ETS Research Report Series, 2017
The report is the first systematic evaluation of the sentence equivalence item type introduced by the "GRE"® revised General Test. We adopt a validity framework to guide our investigation based on Kane's approach to validation whereby a hierarchy of inferences that should be documented to support score meaning and interpretation is…
Descriptors: College Entrance Examinations, Graduate Study, Generalization, Inferences
Peer reviewed Peer reviewed
Direct linkDirect link
Luecht, Richard M. – Journal of Applied Testing Technology, 2013
Assessment engineering is a new way to design and implement scalable, sustainable and ideally lower-cost solutions to the complexities of designing and developing tests. It represents a merger of sorts between cognitive task modeling and engineering design principles--a merger that requires some new thinking about the nature of score scales, item…
Descriptors: Engineering, Test Construction, Test Items, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Hendrickson, Amy; Ewing, Maureen; Kaliski, Pamela; Huff, Kristen – Journal of Applied Testing Technology, 2013
Evidence-centered design (ECD) is an orientation towards assessment development. It differs from conventional practice in several ways and consists of multiple activities. Each of these activities results in a set of useful documentation: domain analysis, domain modeling, construction of the assessment framework, and assessment…
Descriptors: Evidence, Test Construction, Educational Assessment, Learning Theories
Peer reviewed Peer reviewed
Direct linkDirect link
Ashford-Rowe, Kevin; Herrington, Janice; Brown, Christine – Assessment & Evaluation in Higher Education, 2014
This study sought to determine the critical elements of an authentic learning activity, design them into an applicable framework and then use this framework to guide the design, development and application of work-relevant assessment. Its purpose was to formulate an effective model of task design and assessment. The first phase of the study…
Descriptors: Performance Based Assessment, Models, Test Construction, Transfer of Training
Rao, Vasanthi – ProQuest LLC, 2012
In 1997, based on the amendments to Individuals with Disabilities Education Act (IDEA), all states were faced with a statutory requirement to develop and implement alternate assessments for students with disabilities unable to participate in the statewide large-scale assessment. States were given the challenge of creating, implementing, and…
Descriptors: Alternative Assessment, Psychometrics, Item Response Theory, Models
Previous Page | Next Page »
Pages: 1  |  2  |  3