NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 4,066 to 4,080 of 9,552 results Save | Export
Kim, Sooyeon; Walker, Michael E. – Educational Testing Service, 2011
This study examines the use of subpopulation invariance indices to evaluate the appropriateness of using a multiple-choice (MC) item anchor in mixed-format tests, which include both MC and constructed-response (CR) items. Linking functions were derived in the nonequivalent groups with anchor test (NEAT) design using an MC-only anchor set for 4…
Descriptors: Test Format, Multiple Choice Tests, Test Items, Gender Differences
Peer reviewed Peer reviewed
Direct linkDirect link
Kettler, Ryan J.; Rodriguez, Michael C.; Bolt, Daniel M.; Elliott, Stephen N.; Beddow, Peter A.; Kurz, Alexander – Applied Measurement in Education, 2011
Federal policy on alternate assessment based on modified academic achievement standards (AA-MAS) inspired this research. Specifically, an experimental study was conducted to determine whether tests composed of modified items would have the same level of reliability as tests composed of original items, and whether these modified items helped reduce…
Descriptors: Multiple Choice Tests, Test Items, Alternative Assessment, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Wyse, Adam E. – Educational and Psychological Measurement, 2011
Standard setting is a method used to set cut scores on large-scale assessments. One of the most popular standard setting methods is the Bookmark method. In the Bookmark method, panelists are asked to envision a response probability (RP) criterion and move through a booklet of ordered items based on a RP criterion. This study investigates whether…
Descriptors: Testing Programs, Standard Setting (Scoring), Cutting Scores, Probability
Peer reviewed Peer reviewed
Direct linkDirect link
Papinczak, Tracey; Babri, Awais Saleem; Peterson, Ray; Kippers, Vaughan; Wilkinson, David – Advances in Health Sciences Education, 2011
Assessment partnerships between staff and students are considered a vital component of the student-centred educational process. To enhance the development of this partnership in a problem-based learning curriculum, all first-year students were involved in generating a bank of formative assessment questions with answers, some of which were included…
Descriptors: Formative Evaluation, Problem Based Learning, Teaching Methods, Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Reimann, Nicola – Assessment & Evaluation in Higher Education, 2011
This case study explores students' perceptions of seen examination questions about topics not covered by the formal curriculum of a final-year economics module and of the associated group support sessions. Eight semi-structured interviews with a total of 13 students were analysed. Contrary to expectations, learners taking a strategic approach to…
Descriptors: College Seniors, Case Studies, Student Attitudes, Economics Education
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, Lay Wah; Wheldall, Kevin – Dyslexia, 2011
Malay is a consistent alphabetic orthography with complex syllable structures. The focus of this research was to investigate word recognition performance in order to inform reading interventions for low-progress early readers. Forty-six Grade 1 students were sampled and 11 were identified as low-progress readers. The results indicated that both…
Descriptors: Indonesian Languages, Word Recognition, Grade 1, Elementary School Students
Fu, Qiong – ProQuest LLC, 2010
This research investigated how the accuracy of person ability and item difficulty parameter estimation varied across five IRT models with respect to the presence of guessing, targeting, and varied combinations of sample sizes and test lengths. The data were simulated with 50 replications under each of the 18 combined conditions. Five IRT models…
Descriptors: Item Response Theory, Guessing (Tests), Accuracy, Computation
Laborda, Jesus Garcia; Bakieva, Margarita; Gonzalez-Such, Jose; Pavon, Ana Sevilla – Online Submission, 2010
Since the Spanish Educational system is changing and promoting the use of online tests, it was necessary to study the transformation of test items in the "Spanish University Entrance Examination" (IB P.A.U.) to diminish the effect of test delivery changes (through its computerization) in order to affect the least the current model. The…
Descriptors: Foreign Countries, College Entrance Examinations, Computer Assisted Testing, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Miller, Tess; Chahine, Saad; Childs, Ruth A. – Practical Assessment, Research & Evaluation, 2010
This study illustrates the use of differential item functioning (DIF) and differential step functioning (DSF) analyses to detect differences in item difficulty that are related to experiences of examinees, such as their teachers' instructional practices, that are relevant to the knowledge, skill, or ability the test is intended to measure. This…
Descriptors: Test Bias, Difficulty Level, Test Items, Mathematics Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Wen-Chung; Jin, Kuan-Yu – Applied Psychological Measurement, 2010
In this study, all the advantages of slope parameters, random weights, and latent regression are acknowledged when dealing with component and composite items by adding slope parameters and random weights into the standard item response model with internal restrictions on item difficulty and formulating this new model within a multilevel framework…
Descriptors: Test Items, Difficulty Level, Regression (Statistics), Generalization
Peer reviewed Peer reviewed
Direct linkDirect link
Tseng, Mei-Hui; Fu, Chung-Pei; Wilson, Brenda N.; Hu, Fu-Chang – Research in Developmental Disabilities: A Multidisciplinary Journal, 2010
The aim of this study was to adapt and evaluate the Developmental Coordination Disorder Questionnaire (DCDQ) for use in Chinese-speaking countries. A total of 1082 parents completed the DCDQ and 35 parents repeated it after 2 weeks for test-retest reliability. Two items were deleted after examination of test consistency. Cronbach's [alpha] for the…
Descriptors: Test Validity, Measures (Individuals), Psychometrics, Probability
Peer reviewed Peer reviewed
Direct linkDirect link
Hirschel, Michael J.; Schulenberg, Stefan E. – Psychological Assessment, 2010
One measure commonly used to assess posttraumatic stress disorder is the PTSD Checklist (PCL). Lang and Stein (2005) extracted 4 subsets of PCL items, validating 2 of them for possible use in screening in primary care settings. The viability of the 4 item subsets was evaluated psychometrically in the present study with a sample of Hurricane…
Descriptors: Check Lists, Natural Disasters, Posttraumatic Stress Disorder, Psychometrics
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Lin, Chuan-Ju – Journal of Technology, Learning, and Assessment, 2010
Assembling equivalent test forms with minimal test overlap across forms is important in ensuring test security. Chen and Lei (2009) suggested a exposure control technique to control test overlap-ordered item pooling on the fly based on the essence that test overlap rate--ordered item pooling for the first t examinees is a function of test overlap…
Descriptors: Test Length, Test Format, Evaluation Criteria, Psychometrics
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Hill, Yao Zhang; Liu, Ou Lydia – ETS Research Report Series, 2012
This study investigated the effect of the interaction between test takers' background knowledge and language proficiency on their performance on the "TOEFL iBT"® reading section. Test takers with the target content background knowledge (the focal groups) and those without (the reference groups) were identified for each of the 5 selected…
Descriptors: Language Tests, Second Language Learning, English (Second Language), Internet
Peer reviewed Peer reviewed
Direct linkDirect link
Jordan, Eoin – Language Testing in Asia, 2012
This article examines the issue of cognates in frequency-based vocabulary size testing. Data from a pilot study for a cognate-controlled English vocabulary size test was used to assess whether a group of Japanese university English learners (n = 60) were more successful at responding to cognate items than noncognate ones in three 1000 word…
Descriptors: English (Second Language), Second Language Learning, College Students, Foreign Countries
Pages: 1  |  ...  |  268  |  269  |  270  |  271  |  272  |  273  |  274  |  275  |  276  |  ...  |  637