NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 2,071 to 2,085 of 9,530 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Wyse, Adam E. – Educational Measurement: Issues and Practice, 2017
This article illustrates five different methods for estimating Angoff cut scores using item response theory (IRT) models. These include maximum likelihood (ML), expected a priori (EAP), modal a priori (MAP), and weighted maximum likelihood (WML) estimators, as well as the most commonly used approach based on translating ratings through the test…
Descriptors: Cutting Scores, Item Response Theory, Bayesian Statistics, Maximum Likelihood Statistics
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Liu, Xiaolu; Keating, Xiaofen D.; Shangguan, Rulan – ICHPER-SD Journal of Research, 2017
This study examined changes in China's college student fitness test batteries since its inception in 1954. Using the constant content comparison method, the testing components, testing items and related cut-off values, testing methods, testing results utility, and testing material distribution were examined to identify the salient trends. The…
Descriptors: Foreign Countries, Physical Fitness, Tests, College Students
Susanti, Yuni; Tokunaga, Takenobu; Nishikawa, Hitoshi; Obari, Hiroyuki – Research and Practice in Technology Enhanced Learning, 2017
The present study investigates the best factor for controlling the item difficulty of multiple-choice English vocabulary questions generated by an automatic question generation system. Three factors are considered for controlling item difficulty: (1) reading passage difficulty, (2) semantic similarity between the correct answer and distractors,…
Descriptors: Test Items, Difficulty Level, Computer Assisted Testing, Vocabulary Development
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Teneqexhi, Romeo; Qirko, Margarita; Sharko, Genci; Vrapi, Fatmir; Kuneshka, Loreta – International Association for Development of the Information Society, 2017
Exams assessment is one of the most tedious work for university teachers all over the world. Multiple choice theses make exams assessment a little bit easier, but the teacher cannot prepare more than 3-4 variants; in this case, the possibility of students for cheating from one another becomes a risk for "objective assessment outcome." On…
Descriptors: Testing, Computer Assisted Testing, Test Items, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Cole, Ki Matlock; Turner, Ronna L.; Gitchel, Wallace D. – AERA Online Paper Repository, 2017
This study uses the nominal response model to investigate the effects of extreme response styles. The Zung Self-Rating Anxiety Scale (SAS) is a commonly used scale for the identification of anxiety disorders. In some cases, the response options are not extreme, ranging from "A little of the time" to "Most of the time;" in other…
Descriptors: Self Evaluation (Individuals), Depression (Psychology), Rating Scales, Response Style (Tests)
Yunxiao Chen; Xiaoou Li; Jingchen Liu; Gongjun Xu; Zhiliang Ying – Grantee Submission, 2017
Large-scale assessments are supported by a large item pool. An important task in test development is to assign items into scales that measure different characteristics of individuals, and a popular approach is cluster analysis of items. Classical methods in cluster analysis, such as the hierarchical clustering, K-means method, and latent-class…
Descriptors: Item Analysis, Classification, Graphs, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015
The maximum likelihood estimate (MLE) of the ability parameter of an item response theory model with known item parameters was proved to be asymptotically normally distributed under a set of regularity conditions for tests involving dichotomous items and a unidimensional ability parameter (Klauer, 1990; Lord, 1983). This article first considers…
Descriptors: Item Response Theory, Maximum Likelihood Statistics, Test Items, Ability
Peer reviewed Peer reviewed
Direct linkDirect link
Belov, Dmitry I. – Journal of Educational Measurement, 2015
The statistical analysis of answer changes (ACs) has uncovered multiple testing irregularities on large-scale assessments and is now routinely performed at testing organizations. However, AC data has an uncertainty caused by technological or human factors. Therefore, existing statistics (e.g., number of wrong-to-right ACs) used to detect examinees…
Descriptors: Statistical Analysis, Robustness (Statistics), Identification, Test Items
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Ali, Usama S.; Chang, Hua-Hua; Anderson, Carolyn J. – ETS Research Report Series, 2015
Polytomous items are typically described by multiple category-related parameters; situations, however, arise in which a single index is needed to describe an item's location along a latent trait continuum. Situations in which a single index would be needed include item selection in computerized adaptive testing or test assembly. Therefore single…
Descriptors: Item Response Theory, Test Items, Computer Assisted Testing, Adaptive Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Tannenbaum, Richard J.; Kannan, Priya – Educational Assessment, 2015
Angoff-based standard setting is widely used, especially for high-stakes licensure assessments. Nonetheless, some critics have claimed that the judgment task is too cognitively complex for panelists, whereas others have explicitly challenged the consistency in (replicability of) standard-setting outcomes. Evidence of consistency in item judgments…
Descriptors: Standard Setting (Scoring), Reliability, Scores, Licensing Examinations (Professions)
Peer reviewed Peer reviewed
Direct linkDirect link
McIntosh, James – Scandinavian Journal of Educational Research, 2019
This article examines whether the way that PISA models item outcomes in mathematics affects the validity of its country rankings. As an alternative to PISA methodology a two-parameter model is applied to PISA mathematics item data from Canada and Finland for the year 2012. In the estimation procedure item difficulty and dispersion parameters are…
Descriptors: Foreign Countries, Achievement Tests, Secondary School Students, International Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Sibulkin, Amy E.; Butler, J. S. – Teaching of Psychology, 2019
After explicit instruction on how to give possible bidirectional (two-way) causality explanations for a correlation, 240 students from eight sections of social psychology and research methods courses wrote "reverse causality" explanations on various test questions, creating a total of 882 answers. Averaging across multiple graded…
Descriptors: Correlation, Causal Models, Research Methodology, Social Psychology
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Bengtsson, Lars – Education Sciences, 2019
This work describes a systematic review of the research on take-home exams in tertiary education. It was found that there is some disagreement in the community about the virtues of take-home exams but also a lot of agreement. It is concluded that take-home exams may be the preferred choice of assessment method on the higher taxonomy levels because…
Descriptors: Testing Problems, Cheating, Thinking Skills, Supervision
Peer reviewed Peer reviewed
Direct linkDirect link
Dutt, Anuradha; Tan, Marilyn; Alagumalai, Sivakumar; Nair, Rahul – Journal of Autism and Developmental Disorders, 2019
Functional Behavior Assessment (FBA) and behavior interventions have been effective in the management of challenging behavior among children with developmental disabilities including autism spectrum disorders. Research suggests the need for valid measurement instruments for verifying, calibrating and scoring competence in FBA and behavior…
Descriptors: Program Development, Program Validation, Functional Behavioral Assessment, Intervention
Peer reviewed Peer reviewed
Direct linkDirect link
Betts, Joe; Muntean, William; Kim, Doyoung; Jorion, Natalie; Dickison, Philip – Journal of Applied Testing Technology, 2019
Clinical judgment has become an increasingly important aspect of modern health service professionals. To ensure public safety, licensure exams must go beyond assessing only knowledge and skills when evaluating entry-level professions to evaluating clinical judgment. This importance necessitates licensure and certification examinations in these…
Descriptors: Decision Making, Licensing Examinations (Professions), Certification, Nursing Education
Pages: 1  |  ...  |  135  |  136  |  137  |  138  |  139  |  140  |  141  |  142  |  143  |  ...  |  636