Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 9 |
Since 2006 (last 20 years) | 14 |
Descriptor
Scoring Formulas | 66 |
Test Reliability | 66 |
Test Validity | 66 |
Multiple Choice Tests | 21 |
Test Construction | 19 |
Guessing (Tests) | 16 |
Scoring | 15 |
Test Interpretation | 13 |
Weighted Scores | 13 |
Item Analysis | 12 |
Measurement Techniques | 12 |
More ▼ |
Source
Author
Echternacht, Gary | 3 |
Hambleton, Ronald K. | 3 |
Rippey, Robert M. | 3 |
Frary, Robert B. | 2 |
Hakstian, A. Ralph | 2 |
Kansup, Wanlop | 2 |
Reilly, Richard R. | 2 |
Traub, Ross E. | 2 |
Acar, Selcuk | 1 |
Aghbar, Ali-Asghar | 1 |
Ahmed, Ayesha | 1 |
More ▼ |
Publication Type
Reports - Research | 32 |
Journal Articles | 21 |
Tests/Questionnaires | 3 |
Guides - Non-Classroom | 2 |
Reports - Descriptive | 2 |
Reports - Evaluative | 2 |
Speeches/Meeting Papers | 2 |
Guides - Classroom - Teacher | 1 |
Guides - General | 1 |
Education Level
Higher Education | 4 |
Postsecondary Education | 4 |
Elementary Education | 2 |
Secondary Education | 2 |
Elementary Secondary Education | 1 |
High Schools | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Steven Holtzman; Jonathan Steinberg; Jonathan Weeks; Christopher Robertson; Jessica Findley; David Klieger – ETS Research Report Series, 2024
At a time when institutions of higher education are exploring alternatives to traditional admissions testing, institutions are also seeking to better support students and prepare them for academic success. Under such an engaged model, one may seek to measure not just the accumulated knowledge and skills that students would bring to a new academic…
Descriptors: Law Schools, College Applicants, Legal Education (Professions), College Entrance Examinations
Wagaman, John; Fletcher, Michael – Teaching Statistics: An International Journal for Teachers, 2018
This article considers how a handicapping system should be devised for squash. It looks at the American scoring system, and whether it is possible to have a fair system of handicapping. We consider "fair" from a perspective of expected number of rallies won and probability of winning.
Descriptors: Probability, Athletes, Athletics, Inhibition
Feldman, Jo – Educational Leadership, 2018
Have teachers become too dependent on points? This article explores educators' dependency on their points systems, and the ways that points can distract teachers from really analyzing students' capabilities and achievements. Feldman argues that using a more subjective grading system can help illuminate crucial information about students and what…
Descriptors: Grading, Evaluation Methods, Evaluation Criteria, Achievement Rating
Yun, Young Ho; Kim, Yaeji; Sim, Jin A.; Choi, Soo Hyuk; Lim, Cheolil; Kang, Joon-ho – Journal of School Health, 2018
Background: The objective of this study was to develop the School Health Score Card (SHSC) and validate its psychometric properties. Methods: The development of the SHSC questionnaire included 3 phases: item generation, construction of domains and items, and field testing with validation. To assess the instrument's reliability and validity, we…
Descriptors: School Health Services, Psychometrics, Test Construction, Test Validity
Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017
This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…
Descriptors: Scores, Test Construction, Test Reliability, Test Validity
Lim, Chang Kuan; Eng, Lin Siew; Mohamed, Abdul Rashid; Ismail, Shaik Abdul Malik Mohamed – English Language Teaching, 2018
The purpose of the study is to have a relook at the ESL reading comprehension assessment system for Malaysian Year Five students. Traditionally, the ESL teachers have been assessing and reporting on their primary year's students by merely giving a composite grade with some vague remarks. This process has been used and is still being employed in…
Descriptors: Foreign Countries, Elementary Schools, English (Second Language), Second Language Instruction
Burger, Roland – Higher Education: The International Journal of Higher Education Research, 2017
The purpose of this study is to examine the effects of assessment method (essays vs. examinations) and instruction method (seminars vs. lectures) on student perceptions of the fairness of the assessment process. Department-specific combinations of these factors give a unique profile to the assessment process and to the way students interact with…
Descriptors: Educational Environment, Grading, Evaluation Criteria, Scoring Formulas
Docktor, Jennifer L.; Dornfeld, Jay; Frodermann, Evan; Heller, Kenneth; Hsu, Leonardo; Jackson, Koblar Alan; Mason, Andrew; Ryan, Qing X.; Yang, Jie – Physical Review Physics Education Research, 2016
Problem solving is a complex process valuable in everyday life and crucial for learning in the STEM fields. To support the development of problem-solving skills it is important for researchers and curriculum developers to have practical tools that can measure the difference between novice and expert problem-solving performance in authentic…
Descriptors: Introductory Courses, Physics, Problem Solving, Scoring Rubrics
Temel, Gülhan Orekici; Erdogan, Semra; Selvi, Hüseyin; Kaya, Irem Ersöz – Educational Sciences: Theory and Practice, 2016
Studies based on longitudinal data focus on the change and development of the situation being investigated and allow for examining cases regarding education, individual development, cultural change, and socioeconomic improvement in time. However, as these studies require taking repeated measures in different time periods, they may include various…
Descriptors: Investigations, Sample Size, Longitudinal Studies, Interrater Reliability
Gafoor, K. Abdul; Naseer, A. R. – Online Submission, 2015
With a view to support instruction, formative and summative assessment and to provide model handwriting performance for students to compare their own performance, a Malayalam handwriting scale is developed. Data from 2640 school students belonging to Malappuram, Palakkad and Kozhikode districts, sampled by taking 240 students per each grade…
Descriptors: Formative Evaluation, Summative Evaluation, Handwriting, Performance Based Assessment
Runco, Mark A.; Acar, Selcuk – Creativity Research Journal, 2012
Divergent thinking (DT) tests are very often used in creativity studies. Certainly DT does not guarantee actual creative achievement, but tests of DT are reliable and reasonably valid predictors of certain performance criteria. The validity of DT is described as reasonable because validity is not an all-or-nothing attribute, but is, instead, a…
Descriptors: Creativity, Creative Activities, Creative Thinking, Test Validity
Ahmed, Ayesha; Pollitt, Alastair – Assessment in Education: Principles, Policy & Practice, 2011
At the heart of most assessments lies a set of questions, and those who write them must achieve "two" things. Not only must they ensure that each question elicits the kind of performance that shows how "good" pupils are at the subject, but they must also ensure that each mark scheme gives more marks to those who are…
Descriptors: Academic Achievement, Classification, Educational Quality, Quality Assurance

Holmes, Roy A.; And Others – Educational and Psychological Measurement, 1974
Descriptors: Chemistry, Multiple Choice Tests, Scoring Formulas, Test Reliability
Stricker, Lawrence J.; Rock, Donald A. – ETS Research Report Series, 2008
This study assessed the invariance in the factor structure of the "Test of English as a Foreign Language"™ Internet-based test (TOEFL® iBT) across subgroups of test takers who differed in native language and exposure to the English language. The subgroups were defined by (a) Indo-European and Non-Indo-European language family, (b)…
Descriptors: Factor Structure, English (Second Language), Language Tests, Computer Assisted Testing

Reilly, Richard R. – Educational and Psychological Measurement, 1975
Because previous reports have suggested that the lowered validity of tests scored with empirical option weights might be explained by a capitalization of the keying procedures on omitting tendencies, a procedure was devised to key options empirically with a "correction-for-guessing" constraint. (Author)
Descriptors: Achievement Tests, Graduate Study, Guessing (Tests), Scoring Formulas