NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 2,356 to 2,370 of 9,530 results Save | Export
Kim, Weon H. – ProQuest LLC, 2017
The purpose of the present study is to apply the item response theory (IRT) and testlet response theory (TRT) models to a reading comprehension test. This study applied the TRT models and the traditional IRT model to a seventh-grade reading comprehension test (n = 8,815) with eight testlets. These three models were compared to determine the best…
Descriptors: Item Response Theory, Test Items, Correlation, Reading Tests
Nelson, Gena; Powell, Sarah R – Grantee Submission, 2017
Though proficiency with computation is highly emphasized in national mathematics standards, students with mathematics difficulty (MD) continue to struggle with computation. To learn more about the differences in computation error patterns between typically achieving students and students with MD, we assessed 478 3rd-grade students on a measure of…
Descriptors: Computation, Mathematics Instruction, Learning Problems, Mathematics Skills
Peer reviewed Peer reviewed
Direct linkDirect link
Madya, Suwarsih; Retnawati, Heri; Purnawan, Ari; Putro, Nur Hidayanto Pancoro Setyo; Apino, Ezi – TEFLIN Journal: A publication on the teaching and learning of English, 2019
This explorative-descriptive study set out to examine the equivalence among Test of English Proficiency (TOEP) forms, developed by the Indonesian Testing Service Centre (ITSC) and co-founded by The Association for The Teaching of English as a Foreign Language in Indonesia (TEFLIN) and The Association of Psychology in Indonesia. Using a…
Descriptors: Language Tests, Language Proficiency, English (Second Language), Second Language Learning
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Dirlik, Ezgi Mor – International Journal of Progressive Education, 2019
Item response theory (IRT) has so many advantages than its precedent Classical Test Theory (CTT) such as non-changing item parameters, ability parameter estimations free from the items. However, in order to get these advantages, some assumptions should be met and they are; unidimensionality, normality and local independence. However, it is not…
Descriptors: Comparative Analysis, Nonparametric Statistics, Item Response Theory, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Cohen, Dale J.; Zhang, Jin; Wothke, Werner – Applied Measurement in Education, 2019
Construct-irrelevant cognitive complexity of some items in the statewide grade-level assessments may impose performance barriers for students with disabilities who are ineligible for alternate assessments based on alternate achievement standards. This has spurred research into whether items can be modified to reduce complexity without affecting…
Descriptors: Test Items, Accessibility (for Disabled), Students with Disabilities, Low Achievement
Peer reviewed Peer reviewed
Direct linkDirect link
Deane, Paul; Song, Yi; van Rijn, Peter; O'Reilly, Tenaha; Fowles, Mary; Bennett, Randy; Sabatini, John; Zhang, Mo – Reading and Writing: An Interdisciplinary Journal, 2019
This paper presents a theoretical and empirical case for the value of scenario-based assessment (SBA) in the measurement of students' written argumentation skills. First, we frame the problem in terms of creating a reasonably efficient method of evaluating written argumentation skills, including for students at relatively low levels of competency.…
Descriptors: Vignettes, Writing Skills, Persuasive Discourse, Writing Evaluation
Achieve, Inc., 2019
Throughout the country, state assessments in reading, mathematics, and science continue to play important roles in instructional improvement and accountability. State assessments must align to academic standards that are significantly more challenging than previous standards, reflecting the current knowledge and skill demands of postsecondary…
Descriptors: State Standards, Academic Standards, Guidelines, Mathematics Achievement
Neuman, Susan B. – Educational Leadership, 2016
"Data-drive instruction can distort the way reading is taught, harming the students who need high-quality instruction the most," Susan B. Neuman concludes from her research team's two years of observation in nine low-income New York City schools. She describes how some students are reminded that they are "failures" every day by…
Descriptors: Data, Decision Making, Teaching Methods, Educational Theories
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Khoshaim, Heba Bakr; Rashid, Saima – International Journal of Instruction, 2016
Assessment is one of the vital steps in the teaching and learning process. The reported action research examines the effectiveness of an assessment process and inspects the validity of exam questions used for the assessment purpose. The instructors of a college-level mathematics course studied questions used in the final exams during the academic…
Descriptors: Item Analysis, Test Items, Mathematics Tests, Difficulty Level
Peer reviewed Peer reviewed
Direct linkDirect link
Benítez, Isabel; Padilla, José-Luis; Hidalgo Montesinos, María Dolores; Sireci, Stephen G. – Applied Measurement in Education, 2016
Analysis of differential item functioning (DIF) is often used to determine if cross-lingual assessments are equivalent across languages. However, evidence on the causes of cross-lingual DIF is still evasive. Expert appraisal is a qualitative method useful for obtaining detailed information about problematic elements in the different linguistic…
Descriptors: Test Bias, Mixed Methods Research, Questionnaires, International Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Bochner, Joseph H.; Samar, Vincent J.; Hauser, Peter C.; Garrison, Wayne M.; Searls, J. Matt; Sanders, Cynthia A. – Language Testing, 2016
American Sign Language (ASL) is one of the most commonly taught languages in North America. Yet, few assessment instruments for ASL proficiency have been developed, none of which have adequately demonstrated validity. We propose that the American Sign Language Discrimination Test (ASL-DT), a recently developed measure of learners' ability to…
Descriptors: American Sign Language, Test Validity, Language Proficiency, Phonological Awareness
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Zembat, Rengin; Turasli, Nalan Kuru; Güven, Gülçin; Sezer, Türker; Aksin, Ezgi; Yilmaz, Elif; Bayindir, Dilan – Journal of Education and Training Studies, 2016
The aim of this study is to investigate the reliability and validity of the DeMoulin Self-Concept Developmental Scale for 36-72 month old children. In addition, it has been attempted to examine the effects of age and gender variables on the self-concept of children. The study is in survey method. The sample consists of 810 children who attend…
Descriptors: Test Validity, Test Reliability, Self Concept Measures, Age Differences
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick; Schmitt, Neal – ETS Research Report Series, 2016
In this report, systematic applications of statistical and psychometric methods are used to develop and evaluate scoring rules in terms of test reliability. Data collected from a situational judgment test are used to facilitate the comparison. For a well-developed item with appropriate keys (i.e., the correct answers), agreement among various…
Descriptors: Scoring, Test Reliability, Statistical Analysis, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Andersson, Björn – Journal of Educational Measurement, 2016
In observed-score equipercentile equating, the goal is to make scores on two scales or tests measuring the same construct comparable by matching the percentiles of the respective score distributions. If the tests consist of different items with multiple categories for each item, a suitable model for the responses is a polytomous item response…
Descriptors: Equated Scores, Item Response Theory, Error of Measurement, Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Seker, Hasan – Universal Journal of Educational Research, 2016
In the present study, some of the pre-service teachers' criticisms against their exams were investigated. Moreover, as an alternative, to what extent philosophical, romantic and mythic questions could be used was also looked at. The study group consists of 117 pre-service teachers from the classroom teacher education. In the study, it was…
Descriptors: Test Items, Test Content, Preservice Teachers, Criticism
Pages: 1  |  ...  |  154  |  155  |  156  |  157  |  158  |  159  |  160  |  161  |  162  |  ...  |  636