NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 2,341 to 2,355 of 9,533 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Lu, Ru; Haberman, Shelby; Guo, Hongwen; Liu, Jinghua – ETS Research Report Series, 2015
In this study, we apply jackknifing to anchor items to evaluate the impact of anchor selection on equating stability. In an ideal world, the choice of anchor items should have little impact on equating results. When this ideal does not correspond to reality, selection of anchor items can strongly influence equating results. This influence does not…
Descriptors: Test Construction, Equated Scores, Test Items, Sampling
Peer reviewed Peer reviewed
Direct linkDirect link
Schmitz, Florian; Wilhelm, Oliver – Measurement: Interdisciplinary Research and Perspectives, 2015
The excellent paper by Goldhammer (this issue) deals with a most relevant and very pervasive problem of ability assessment: the evaluation of performance by considering speed and accuracy of performance. Goldhammer proposes item-level time limits as a possible remedy for individual differences in the speed-accuracy trade-off (SATO) to keep time…
Descriptors: Ability, Reaction Time, Accuracy, Performance
Peer reviewed Peer reviewed
Direct linkDirect link
Andrich, David; Hagquist, Curt – Educational and Psychological Measurement, 2015
Differential item functioning (DIF) for an item between two groups is present if, for the same person location on a variable, persons from different groups have different expected values for their responses. Applying only to dichotomously scored items in the popular Mantel-Haenszel (MH) method for detecting DIF in which persons are classified by…
Descriptors: Test Bias, Test Items, Item Response Theory, Statistical Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Tay-lim, Brenda Siok-Hoon; Zhang, Jinming – Applied Measurement in Education, 2015
To ensure the statistical result validity, model-data fit must be evaluated for each item. In practice, certain actions or treatments are needed for misfit items. If all misfit items are treated, much item information would be lost during calibration. On the other hand, if only severely misfit items are treated, the inclusion of misfit items may…
Descriptors: Test Items, Goodness of Fit, Classification, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Crisp, Victoria – Educational Studies, 2015
This research investigated the difficulty of examination questions for students with weaker reading skills. Item level performance data were obtained for all candidates who took a maths examination (for 16 year olds). A sub-group of students who had access to a reader was identified (students with proven reading difficulties are permitted to have…
Descriptors: Test Items, Difficulty Level, Mathematics Tests, Reading Difficulties
Peer reviewed Peer reviewed
Direct linkDirect link
Briggs, Derek C.; Dadey, Nathan – Educational Assessment, 2015
This study focuses on an instance in which the mean grade-to-grade scale scores on a vertical scale showed evidence of common test items that do not get easier from one grade to the next. The issue was examined as part of a 2-day workshop in which participants were asked to predict the growth on all linking items used in the construction of…
Descriptors: Test Items, Grading, Scores, Scaling
Peer reviewed Peer reviewed
Direct linkDirect link
Klein, Ariel; Badia, Toni – Journal of Creative Behavior, 2015
In this study we show how complex creative relations can arise from fairly frequent semantic relations observed in everyday language. By doing this, we reflect on some key cognitive aspects of linguistic and general creativity. In our experimentation, we automated the process of solving a battery of Remote Associates Test tasks. By applying…
Descriptors: Language Usage, Semantics, Natural Language Processing, Test Items
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Seipel, Ben; Biancarosa, Gina; Carlson, Sarah; Davison, Mark – Society for Research on Educational Effectiveness, 2015
Previous research has established two types of struggling readers: those who struggle with lower-level reading skills and those who struggle with higher-level reading skills (Cain & Oakhill, 2006; Perfetti, 2007). The latter group is commonly termed "poor comprehenders": readers who exhibit poor comprehension compared to peers with…
Descriptors: Reading Comprehension, Reading Tests, Cloze Procedure, Multiple Choice Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Scott, Terry F.; Schumayer, Dániel – Physical Review Physics Education Research, 2017
The Force Concept Inventory is one of the most popular and most analyzed multiple-choice concept tests used to investigate students' understanding of Newtonian mechanics. The correct answers poll a set of underlying Newtonian concepts and the coherence of these underlying concepts has been found in the data. However, this inventory was constructed…
Descriptors: World Views, Scientific Concepts, Scientific Principles, Multiple Choice Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Shangraw, Rebecca – Strategies: A Journal for Physical and Sport Educators, 2017
The Domain Five Observation Instrument (DFOI) is a competency-based observation instrument recommended for sport leaders or researchers who wish to evaluate coaches' instructional behaviors. The DFOI includes 10 behavior categories and four timed categories that encompass 34 observable instructional benchmarks outlined in domain five of the…
Descriptors: Competency Based Teacher Education, Coaching (Performance), Evaluation Methods, Teacher Behavior
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Çakir, Sinan – Journal of Language and Linguistic Studies, 2017
The present study is a follow-up study of Çakir (2016b) which focused on the wh-adverbial & which NP constructions asymmetry within island structures in Turkish. The characteristics of wh-adverbial nasil "how" is compared with the which-NP constructions "hangisekilde" "in what way" and "hangihalde"…
Descriptors: Nouns, Phrase Structure, Grammar, Turkish
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Yusof, Safiah Md; Lim, Tick Meng; Png, Leo; Khatab, Zainuriyah Abd; Singh, Harvinder Kaur Dharam – Journal of Learning for Development, 2017
Open University Malaysia (OUM) is progressively moving towards implementing assessment on demand and online assessment. This move is deemed necessary for OUM to continue to be the leading provider of flexible learning. OUM serves a very large number of students each semester and these students are vastly distributed throughout the country. As the…
Descriptors: Foreign Countries, Computer Assisted Testing, Computer Managed Instruction, Management Systems
Peer reviewed Peer reviewed
Direct linkDirect link
O'Keeffe, Lisa; O'Halloran, Kay L.; Wignell, Peter; Tan, Sabine – Australian Educational Researcher, 2017
In 2015, the Australian Council for Educational Research (ACER) was tasked with developing literacy and numeracy skills testing for pre-service teachers. All undergraduate and postgraduate trainee teachers are now required to pass these literacy and numeracy tests at some stage on their journey to becoming a teacher; for commencing students from…
Descriptors: Foreign Countries, Preservice Teachers, Numeracy, Mathematics Skills
Peer reviewed Peer reviewed
Direct linkDirect link
Golubovich, Juliya; Seybert, Jacob; Martin-Raugh, Michelle; Naemi, Bobby; Vega, Ronald P.; Roberts, Richard D. – International Journal of Testing, 2017
Accurate appraisal of others' behavior is critical for the production of skilled interpersonal behavior. We used an ecologically valid methodology, a video-based situational judgment test with true-false items, to assess the accuracy with which students (N = 947) perceive the interpersonal behavior of actors involved in workplace situations.…
Descriptors: Video Technology, Interpersonal Competence, Test Items, Accuracy
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Wheadon, Jacob; Wright, Geoff A.; West, Richard E.; Skaggs, Paul – Journal of Technology Education, 2017
This study discusses the need, development, and validation of the Innovation Test Instrument (ITI). This article outlines how the researchers identified the content domain of the assessment and created test items. Then, it describes initial validation testing of the instrument. The findings suggest that the ITI is a good first step in creating an…
Descriptors: Innovation, Program Validation, Evaluation Needs, Test Construction
Pages: 1  |  ...  |  153  |  154  |  155  |  156  |  157  |  158  |  159  |  160  |  161  |  ...  |  636