NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Practitioners1
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 23 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Nan Xie; Zhengxu Li; Haipeng Lu; Wei Pang; Jiayin Song; Beier Lu – IEEE Transactions on Learning Technologies, 2025
Classroom engagement is a critical factor for evaluating students' learning outcomes and teachers' instructional strategies. Traditional methods for detecting classroom engagement, such as coding and questionnaires, are often limited by delays, subjectivity, and external interference. While some neural network models have been proposed to detect…
Descriptors: Learner Engagement, Artificial Intelligence, Technology Uses in Education, Educational Technology
Peer reviewed Peer reviewed
Direct linkDirect link
An, Lily Shiao; Ho, Andrew Dean; Davis, Laurie Laughlin – Educational Measurement: Issues and Practice, 2022
Technical documentation for educational tests focuses primarily on properties of individual scores at single points in time. Reliability, standard errors of measurement, item parameter estimates, fit statistics, and linking constants are standard technical features that external stakeholders use to evaluate items and individual scale scores.…
Descriptors: Documentation, Scores, Evaluation Methods, Longitudinal Studies
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Mohammed, Aisha; Dawood, Abdul Kareem Shareef; Alghazali, Tawfeeq; Kadhim, Qasim Khlaif; Sabti, Ahmed Abdulateef; Sabit, Shaker Holh – International Journal of Language Testing, 2023
Cognitive diagnostic models (CDMs) have received much interest within the field of language testing over the last decade due to their great potential to provide diagnostic feedback to all stakeholders and ultimately improve language teaching and learning. A large number of studies have demonstrated the application of CDMs on advanced large-scale…
Descriptors: Reading Comprehension, Reading Tests, Language Tests, English (Second Language)
Peer reviewed Peer reviewed
Direct linkDirect link
Tavares, Walter; Brydges, Ryan; Myre, Paul; Prpic, Jason; Turner, Linda; Yelle, Richard; Huiskamp, Maud – Advances in Health Sciences Education, 2018
Assessment of clinical competence is complex and inference based. Trustworthy and defensible assessment processes must have favourable evidence of validity, particularly where decisions are considered high stakes. We aimed to organize, collect and interpret validity evidence for a high stakes simulation based assessment strategy for certifying…
Descriptors: Competence, Simulation, Allied Health Personnel, Certification
Peer reviewed Peer reviewed
Direct linkDirect link
Zimmer, Ron; Engberg, John – Journal of School Choice, 2016
School choice programs continue to be controversial, spurring a number of researchers into evaluating them. When possible, researchers evaluate the effect of attending a school of choice using randomized designs to eliminate possible selection bias. Randomized designs are often thought of as the gold standard for research, but many circumstances…
Descriptors: Inferences, School Choice, Educational Vouchers, Charter Schools
Peer reviewed Peer reviewed
Direct linkDirect link
Martínez, José Felipe; Schweig, Jonathan; Goldschmidt, Pete – Educational Evaluation and Policy Analysis, 2016
A key question facing teacher evaluation systems is how to combine multiple measures of complex constructs into composite indicators of performance. We use data from the Measures of Effective Teaching (MET) study to investigate the measurement properties of composite indicators obtained under various conjunctive, disjunctive (or complementary),…
Descriptors: Teacher Evaluation, Outcome Measures, Evaluation Methods, Educational Policy
Peer reviewed Peer reviewed
Direct linkDirect link
Ercikan, Kadriye; Oliveri, María Elena – Applied Measurement in Education, 2016
Assessing complex constructs such as those discussed under the umbrella of 21st century constructs highlights the need for a principled assessment design and validation approach. In our discussion, we made a case for three considerations: (a) taking construct complexity into account across various stages of assessment development such as the…
Descriptors: Evaluation Methods, Test Construction, Design, Scaling
Peer reviewed Peer reviewed
Direct linkDirect link
Anderson, Daniel; Farley, Dan; Tindal, Gerald – Journal of Special Education, 2015
Students with significant cognitive disabilities present an assessment dilemma that centers on access and validity in large-scale testing programs. Typically, access is improved by eliminating construct-irrelevant barriers, while validity is improved, in part, through test standardization. In this article, one state's alternate assessment data…
Descriptors: Mental Retardation, Evaluation Methods, Student Evaluation, Standardized Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Ruiz-Primo, Maria Araceli; Li, Min; Wills, Kellie; Giamellaro, Michael; Lan, Ming-Chih; Mason, Hillary; Sands, Deanna – Journal of Research in Science Teaching, 2012
The purpose of this article is to address a major gap in the instructional sensitivity literature on how to develop instructionally sensitive assessments. We propose an approach to developing and evaluating instructionally sensitive assessments in science and test this approach with one elementary life-science module. The assessment we developed…
Descriptors: Effect Size, Inferences, Student Centered Curriculum, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Sandilands, Debra; Oliveri, Maria Elena; Zumbo, Bruno D.; Ercikan, Kadriye – International Journal of Testing, 2013
International large-scale assessments of achievement often have a large degree of differential item functioning (DIF) between countries, which can threaten score equivalence and reduce the validity of inferences based on comparisons of group performances. It is important to understand potential sources of DIF to improve the validity of future…
Descriptors: Validity, Measures (Individuals), International Studies, Foreign Countries
Peer reviewed Peer reviewed
Direct linkDirect link
Mislevy, Robert J.; Haertel, Geneva; Cheng, Britte H.; Ructtinger, Liliana; DeBarger, Angela; Murray, Elizabeth; Rose, David; Gravel, Jenna; Colker, Alexis M.; Rutstein, Daisy; Vendlinski, Terry – Educational Research and Evaluation, 2013
Standardizing aspects of assessments has long been recognized as a tactic to help make evaluations of examinees fair. It reduces variation in irrelevant aspects of testing procedures that could advantage some examinees and disadvantage others. However, recent attention to making assessment accessible to a more diverse population of students…
Descriptors: Testing Accommodations, Access to Education, Testing, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Pellegrino, James W. – Journal of Research in Science Teaching, 2012
Beginning with a reference to living in a time of both uncertainty and opportunity, this article presents a discussion of key areas where shared understanding is needed if we are to successfully realize the design and use of high quality, valid assessments of science. The key areas discussed are: (1) assessment purpose and use, (2) the nature of…
Descriptors: Science Education, Science and Society, Academic Standards, State Standards
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Almond, Patricia; Winter, Phoebe; Cameto, Renee; Russell, Michael; Sato, Edynn; Clarke-Midura, Jody; Torres, Chloe; Haertel, Geneva; Dolan, Robert; Beddow, Peter; Lazarus, Sheryl – Journal of Technology, Learning, and Assessment, 2010
This paper represents one outcome from the "Invitational Research Symposium on Technology-Enabled and Universally Designed Assessments," which examined technology-enabled assessments (TEA) and universal design (UD) as they relate to students with disabilities (SWD). It was developed to stimulate research into TEAs designed to make tests…
Descriptors: Disabilities, Inferences, Computer Assisted Testing, Alternative Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Harwell, Michael; LeBeau, Brandon – Educational Researcher, 2010
The use of eligibility for a free lunch as a measure of a student's socioeconomic status continues to be a fixture of quantitative education research. Despite its popularity, it is unclear that education researchers are familiar with what student eligibility for a free lunch does (and does not) represent. The authors examine the National School…
Descriptors: Socioeconomic Status, Eligibility, Lunch Programs, Nutrition
Millett, Catherine M.; Payne, David G.; Dwyer, Carol A.; Stickler, Leslie M.; Alexiou, Jon J. – Educational Testing Service, 2008
This paper presents a framework that institutions of higher education can use to improve, revise and introduce comprehensive systems for the collection and dissemination of information on student learning outcomes. For faculty and institutional leaders grappling with the many issues and nuances inherent in assessing student learning, the framework…
Descriptors: Higher Education, Educational Testing, Accountability, Outcomes of Education
Previous Page | Next Page »
Pages: 1  |  2