NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 11 results Save | Export
Sherwin E. Balbuena – Online Submission, 2024
This study introduces a new chi-square test statistic for testing the equality of response frequencies among distracters in multiple-choice tests. The formula uses the information from the number of correct answers and wrong answers, which becomes the basis of calculating the expected values of response frequencies per distracter. The method was…
Descriptors: Multiple Choice Tests, Statistics, Test Validity, Testing
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Aleyna Altan; Zehra Taspinar Sener – Online Submission, 2023
This research aimed to develop a valid and reliable test to be used to detect sixth grade students' misconceptions and errors regarding the subject of fractions. A misconception diagnostic test has been developed that includes the concept of fractions, different representations of fractions, ordering and comparing fractions, equivalence of…
Descriptors: Diagnostic Tests, Mathematics Tests, Fractions, Misconceptions
Parry, James R. – Online Submission, 2020
This paper presents research and provides a method to ensure that parallel assessments, that are generated from a large test-item database, maintain equitable difficulty and content coverage each time the assessment is presented. To maintain fairness and validity it is important that all instances of an assessment, that is intended to test the…
Descriptors: Culture Fair Tests, Difficulty Level, Test Items, Test Validity
Al-Jarf, Reima – Online Submission, 2021
The present study aimed to describe and evaluate the current assessment practices prevalent in the different translation courses offered at the College of Languages and Translation (COLT). A sample of specialized translation final exams in 18 translation subject areas was collected. Each final exam was analyzed in terms of the following: (1) # of…
Descriptors: Translation, Language Tests, Readability, Semitic Languages
Edward Paul Getman – Online Submission, 2020
Despite calls for engaging assessments targeting young language learners (YLLs) between 8 and 13 years old, what makes assessment tasks engaging and how such task characteristics affect measurement quality have not been well studied empirically. Furthermore, there has been a dearth of validity research about technology-enhanced speaking tests for…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Learner Engagement
Toker, Turker; Green, Kathy – Online Submission, 2012
The least squares distance method (LSDM) was used in a cognitive diagnostic analysis of TIMSS (Trends in International Mathematics and Science Study) items administered to 4,498 8th-grade students from seven geographical regions of Turkey, extending analysis of attributes from content to process and skill attributes. Logit item positions were…
Descriptors: Foreign Countries, Least Squares Statistics, Grade 8, Mathematics Tests
Hamzah, Mohd Sahandri Gani; Abdullah, Saifuddin Kumar – Online Submission, 2011
The evaluation of learning is a systematic process involving testing, measuring and evaluation. In the testing step, a teacher needs to choose the best instrument that can test the minds of students. Testing will produce scores or marks with many variations either in homogeneous or heterogeneous forms that will be used to categorize the scores…
Descriptors: Test Items, Item Analysis, Difficulty Level, Testing
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Razi, Salim – Online Submission, 2012
This study presents the processes of developing and establishing reliability and validity of a reading test by administering an integrative approach as conventional reliability and validity measures superficially reveals the difficulty of a reading test. In this respect, analysing vocabulary frequency of the test is regarded as a more eligible way…
Descriptors: Foreign Countries, Undergraduate Students, Reading Tests, Test Validity
Tristan, Agustin; Vidal, Rafael – Online Submission, 2007
Wright and Stone had proposed three features to assess the quality of the distribution of the items difficulties in a test, on the so called "most probable response map": line, stack and gap. Once a line is accepted as a design model for a test, gaps and stacks are practically eliminated, producing an evidence of the "scale…
Descriptors: Test Validity, Models, Difficulty Level, Test Items
McCowan, Richard J.; McCowan, Sheila C. – Online Submission, 1999
This paper describes major concepts related to item analysis for criterion-referenced tests including validity, reliability, item difficulty, and item discrimination, particularly in relation to criterion-referenced tests. The paper discussed how these concepts can be used to revise and improve items and listed suggestions regarding general…
Descriptors: Criterion Referenced Tests, Standard Setting, Item Analysis, Item Response Theory
Abdellah, Antar Solhy – Online Submission, 2007
The study reviews translation validated tests and proposes a process-oriented translation test for assessing basic translation skills for freshmen English majors at the faculty of Education. The proposed test is developed based on the process approach to translating and translation teaching, and is confined to translation from English to Arabic.…
Descriptors: Majors (Students), Student Attitudes, Semitic Languages, Answer Keys