Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 7 |
Descriptor
Statistical Analysis | 47 |
Test Construction | 47 |
Testing | 47 |
Test Validity | 18 |
Test Reliability | 14 |
Item Analysis | 11 |
Measurement Techniques | 9 |
Test Interpretation | 9 |
Evaluation Methods | 8 |
Criterion Referenced Tests | 7 |
Language Tests | 7 |
More ▼ |
Source
Author
ANDRADE, MANUEL | 1 |
Abayeva, Nella F. | 1 |
Ames, Russell | 1 |
Bande, Rhodora A. | 1 |
Bendulo, Hermabeth O. | 1 |
Blatchford, Charles H. | 1 |
Carpenter, Allison | 1 |
Colwell, David | 1 |
Cor, M. Ken | 1 |
Dawis, Rene V. | 1 |
Diwase, Dipti S. | 1 |
More ▼ |
Publication Type
Reports - Research | 19 |
Journal Articles | 13 |
Reports - Descriptive | 4 |
Speeches/Meeting Papers | 4 |
Reports - Evaluative | 3 |
Collected Works - Proceedings | 2 |
Opinion Papers | 1 |
Reports - General | 1 |
Tests/Questionnaires | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Metropolitan Readiness Tests | 1 |
Modern Language Aptitude Test | 1 |
Program for International… | 1 |
Test of English as a Foreign… | 1 |
Test of English for… | 1 |
What Works Clearinghouse Rating
Pawade, Yogesh R.; Diwase, Dipti S. – Journal of Educational Technology, 2016
Item analysis of Multiple Choice Questions (MCQs) is the process of collecting, summarizing and utilizing information from students' responses to evaluate the quality of test items. Difficulty Index (p-value), Discrimination Index (DI) and Distractor Efficiency (DE) are the parameters which help to evaluate the quality of MCQs used in an…
Descriptors: Test Items, Item Analysis, Multiple Choice Tests, Curriculum Development
Bendulo, Hermabeth O.; Tibus, Erlinda D.; Bande, Rhodora A.; Oyzon, Voltaire Q.; Milla, Norberto E.; Macalinao, Myrna L. – International Journal of Evaluation and Research in Education, 2017
Testing or evaluation in an educational context is primarily used to measure or evaluate and authenticate the academic readiness, learning advancement, acquisition of skills, or instructional needs of learners. This study tried to determine whether the varied combinations of arrangements of options and letter cases in a Multiple-Choice Test (MCT)…
Descriptors: Test Format, Multiple Choice Tests, Test Construction, Eye Movements
Oliveri, María Elena; von Davier, Alina A. – International Journal of Testing, 2016
In this study, we propose that the unique needs and characteristics of linguistic minorities should be considered throughout the test development process. Unlike most measurement invariance investigations in the assessment of linguistic minorities, which typically are conducted after test administration, we propose strategies that focus on the…
Descriptors: Psychometrics, Linguistics, Test Construction, Testing
Golovachyova, Viktoriya N.; Menlibekova, Gulbakhyt Zh.; Abayeva, Nella F.; Ten, Tatyana L.; Kogaya, Galina D. – International Journal of Environmental and Science Education, 2016
Using computer-based monitoring systems that rely on tests could be the most effective way of knowledge evaluation. The problem of objective knowledge assessment by means of testing takes on a new dimension in the context of new paradigms in education. The analysis of the existing test methods enabled us to conclude that tests with selected…
Descriptors: Expertise, Computer Assisted Testing, Student Evaluation, Knowledge Level
Stichter, Janine Peck; Herzog, Melissa J.; O'Connor, Karen V.; Schmidt, Carla – Assessment for Effective Intervention, 2012
Individuals with Pervasive Developmental Disorders (PDD) have social competence impairments that can result in negative adult outcomes. Despite considerable research on social skills training, little is available to evaluate these programs. This study describes the development, administration, and utility of a progress-monitoring tool for…
Descriptors: Pervasive Developmental Disorders, Interpersonal Competence, Intervention, Progress Monitoring
Leighton, Jacqueline P.; Gokiert, Rebecca J.; Cor, M. Ken; Heffernan, Colleen – Assessment in Education: Principles, Policy & Practice, 2010
Classroom teachers are in the front line of introducing students to formal learning, including assessments, which can be assumed to continue for students should they extend their schooling past the expected mandatory 12 years. The purpose of the present investigation was to survey secondary teachers' beliefs of classroom and large-scale tests for…
Descriptors: Measures (Individuals), Learning Processes, Test Construction, Teacher Attitudes
Kirnan, Jean Powell; Edler, Erin; Carpenter, Allison – International Journal of Testing, 2007
The range of response options has been shown to influence the answers given in self-report instruments that measure behaviors ranging from television viewing to sexual partners. The current research extends this line of inquiry to 36 quantitative items extracted from a biographical inventory used in personnel selection. A total of 92…
Descriptors: Personnel Selection, Biographical Inventories, Testing, Self Disclosure (Individuals)

Echternacht, Gary – Educational and Psychological Measurement, 1975
Estimates for the variances of empirically determined scoring weights are given. It is also shown that test item writers should write distractors that discriminate on the criterion variable when this type of scoring is used. (Author)
Descriptors: Scoring, Statistical Analysis, Test Construction, Test Reliability

Henning, Grant – Language Testing, 1992
This simulation study considered the effects on statistical measures of test dimensionality that result from systematic sampling variation in both a single- and a double-trait assessment model. Results suggest that there are distinct psychological and psychometric states of test dimensionality, and that psychometric unidimensionality may be…
Descriptors: Construct Validity, Language Tests, Psycholinguistics, Psychometrics
Pinsky, Paul D. – 1970
Developing a student testing mathematical model for instructional management purposes necessitates clear structuring of the curriculum materials involved, whether designated in the domain of content or the dimension of concepts or skills. Such structuring of a course written in performance objectives is presented and noted to be helpful in making…
Descriptors: Administration, Behavioral Objectives, Conferences, Instructional Improvement
McKinley, Mark B.; Lorion, James E. – 1975
The purpose of this study was to determine if answer sheet design, particularly a self-scoring answer sheet, was a differential variable of test anxiety. Data for the study was gathered from the administration of pre and post anxiety tests, given in conjunction with an in class psychology exam. Students in the control group used conventional IBM…
Descriptors: Answer Sheets, Anxiety, Feedback, Higher Education
Woodson, M. I. Charles E.
It has been argued that item variance and test variance are not necessary characteristics for criterion-referenced tests, although they are necessary for norm-referenced tests. This position is in error because it considers sample statistics as the criteria for evaluating items and tests. Within a particular sample, an item or test may have no…
Descriptors: Criterion Referenced Tests, Evaluation Criteria, Item Analysis, Item Sampling

Williams, Janet L. – RSR: Reference Services Review, 2000
Discusses the basic concepts of testing and item development and the application of alternative assessments to information literacy content for library instruction. Topics include reliability; validity; statistical analysis; selected response, including checklists, rank order, or simple match; constructed response; essays; and complex assessments.…
Descriptors: Essays, Evaluation Methods, Information Literacy, Library Instruction

Levin, Joel R. – Journal of Educational Measurement, 1975
A set procedure developed in this study is useful in determining sample size, based on specification of linear contrasts involving certain formula treatments. (Author/DEP)
Descriptors: Analysis of Variance, Comparative Analysis, Mathematical Models, Measurement Techniques
Pyrczak, Fred, Jr. – 1972
The basic objective of the study was to determine the validity of four new indices of item quality. Three of these were based on analyses of differential, empirical weights for item choices, and the fourth was designed to measure the relative attractiveness of distracters. A secondary objective was to ascertain the validity of the conventional…
Descriptors: College Students, Evaluation, Item Analysis, Measurement Techniques