NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 2,371 to 2,385 of 9,547 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Tengberg, Michael – Language Testing, 2017
Reading comprehension tests are often assumed to measure the same, or at least similar, constructs. Yet, reading is not a single but a multidimensional form of processing, which means that variations in terms of reading material and item design may emphasize one aspect of the construct at the cost of another. The educational systems in Denmark,…
Descriptors: Foreign Countries, National Competency Tests, Reading Tests, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Le Hebel, Florence; Montpied, Pascale; Tiberghien, Andrée; Fontanieu, Valérie – International Journal of Science Education, 2017
The understanding of what makes a question difficult is a crucial concern in assessment. To study the difficulty of test questions, we focus on the case of PISA, which assesses to what degree 15-year-old students have acquired knowledge and skills essential for full participation in society. Our research question is to identify PISA science item…
Descriptors: Achievement Tests, Foreign Countries, International Assessment, Secondary School Students
Kim, Weon H. – ProQuest LLC, 2017
The purpose of the present study is to apply the item response theory (IRT) and testlet response theory (TRT) models to a reading comprehension test. This study applied the TRT models and the traditional IRT model to a seventh-grade reading comprehension test (n = 8,815) with eight testlets. These three models were compared to determine the best…
Descriptors: Item Response Theory, Test Items, Correlation, Reading Tests
Nelson, Gena; Powell, Sarah R – Grantee Submission, 2017
Though proficiency with computation is highly emphasized in national mathematics standards, students with mathematics difficulty (MD) continue to struggle with computation. To learn more about the differences in computation error patterns between typically achieving students and students with MD, we assessed 478 3rd-grade students on a measure of…
Descriptors: Computation, Mathematics Instruction, Learning Problems, Mathematics Skills
Peer reviewed Peer reviewed
Direct linkDirect link
Madya, Suwarsih; Retnawati, Heri; Purnawan, Ari; Putro, Nur Hidayanto Pancoro Setyo; Apino, Ezi – TEFLIN Journal: A publication on the teaching and learning of English, 2019
This explorative-descriptive study set out to examine the equivalence among Test of English Proficiency (TOEP) forms, developed by the Indonesian Testing Service Centre (ITSC) and co-founded by The Association for The Teaching of English as a Foreign Language in Indonesia (TEFLIN) and The Association of Psychology in Indonesia. Using a…
Descriptors: Language Tests, Language Proficiency, English (Second Language), Second Language Learning
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Dirlik, Ezgi Mor – International Journal of Progressive Education, 2019
Item response theory (IRT) has so many advantages than its precedent Classical Test Theory (CTT) such as non-changing item parameters, ability parameter estimations free from the items. However, in order to get these advantages, some assumptions should be met and they are; unidimensionality, normality and local independence. However, it is not…
Descriptors: Comparative Analysis, Nonparametric Statistics, Item Response Theory, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Cohen, Dale J.; Zhang, Jin; Wothke, Werner – Applied Measurement in Education, 2019
Construct-irrelevant cognitive complexity of some items in the statewide grade-level assessments may impose performance barriers for students with disabilities who are ineligible for alternate assessments based on alternate achievement standards. This has spurred research into whether items can be modified to reduce complexity without affecting…
Descriptors: Test Items, Accessibility (for Disabled), Students with Disabilities, Low Achievement
Peer reviewed Peer reviewed
Direct linkDirect link
Deane, Paul; Song, Yi; van Rijn, Peter; O'Reilly, Tenaha; Fowles, Mary; Bennett, Randy; Sabatini, John; Zhang, Mo – Reading and Writing: An Interdisciplinary Journal, 2019
This paper presents a theoretical and empirical case for the value of scenario-based assessment (SBA) in the measurement of students' written argumentation skills. First, we frame the problem in terms of creating a reasonably efficient method of evaluating written argumentation skills, including for students at relatively low levels of competency.…
Descriptors: Vignettes, Writing Skills, Persuasive Discourse, Writing Evaluation
Achieve, Inc., 2019
Throughout the country, state assessments in reading, mathematics, and science continue to play important roles in instructional improvement and accountability. State assessments must align to academic standards that are significantly more challenging than previous standards, reflecting the current knowledge and skill demands of postsecondary…
Descriptors: State Standards, Academic Standards, Guidelines, Mathematics Achievement
Neuman, Susan B. – Educational Leadership, 2016
"Data-drive instruction can distort the way reading is taught, harming the students who need high-quality instruction the most," Susan B. Neuman concludes from her research team's two years of observation in nine low-income New York City schools. She describes how some students are reminded that they are "failures" every day by…
Descriptors: Data, Decision Making, Teaching Methods, Educational Theories
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Khoshaim, Heba Bakr; Rashid, Saima – International Journal of Instruction, 2016
Assessment is one of the vital steps in the teaching and learning process. The reported action research examines the effectiveness of an assessment process and inspects the validity of exam questions used for the assessment purpose. The instructors of a college-level mathematics course studied questions used in the final exams during the academic…
Descriptors: Item Analysis, Test Items, Mathematics Tests, Difficulty Level
Peer reviewed Peer reviewed
Direct linkDirect link
Benítez, Isabel; Padilla, José-Luis; Hidalgo Montesinos, María Dolores; Sireci, Stephen G. – Applied Measurement in Education, 2016
Analysis of differential item functioning (DIF) is often used to determine if cross-lingual assessments are equivalent across languages. However, evidence on the causes of cross-lingual DIF is still evasive. Expert appraisal is a qualitative method useful for obtaining detailed information about problematic elements in the different linguistic…
Descriptors: Test Bias, Mixed Methods Research, Questionnaires, International Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Bochner, Joseph H.; Samar, Vincent J.; Hauser, Peter C.; Garrison, Wayne M.; Searls, J. Matt; Sanders, Cynthia A. – Language Testing, 2016
American Sign Language (ASL) is one of the most commonly taught languages in North America. Yet, few assessment instruments for ASL proficiency have been developed, none of which have adequately demonstrated validity. We propose that the American Sign Language Discrimination Test (ASL-DT), a recently developed measure of learners' ability to…
Descriptors: American Sign Language, Test Validity, Language Proficiency, Phonological Awareness
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Zembat, Rengin; Turasli, Nalan Kuru; Güven, Gülçin; Sezer, Türker; Aksin, Ezgi; Yilmaz, Elif; Bayindir, Dilan – Journal of Education and Training Studies, 2016
The aim of this study is to investigate the reliability and validity of the DeMoulin Self-Concept Developmental Scale for 36-72 month old children. In addition, it has been attempted to examine the effects of age and gender variables on the self-concept of children. The study is in survey method. The sample consists of 810 children who attend…
Descriptors: Test Validity, Test Reliability, Self Concept Measures, Age Differences
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick; Schmitt, Neal – ETS Research Report Series, 2016
In this report, systematic applications of statistical and psychometric methods are used to develop and evaluate scoring rules in terms of test reliability. Data collected from a situational judgment test are used to facilitate the comparison. For a well-developed item with appropriate keys (i.e., the correct answers), agreement among various…
Descriptors: Scoring, Test Reliability, Statistical Analysis, Psychometrics
Pages: 1  |  ...  |  155  |  156  |  157  |  158  |  159  |  160  |  161  |  162  |  163  |  ...  |  637