Publication Date
In 2025 | 3 |
Since 2024 | 12 |
Since 2021 (last 5 years) | 41 |
Since 2016 (last 10 years) | 126 |
Since 2006 (last 20 years) | 395 |
Descriptor
Test Theory | 1161 |
Test Items | 261 |
Test Reliability | 252 |
Test Construction | 245 |
Test Validity | 245 |
Psychometrics | 181 |
Scores | 176 |
Item Response Theory | 165 |
Foreign Countries | 159 |
Item Analysis | 141 |
Statistical Analysis | 134 |
More ▼ |
Source
Author
Publication Type
Education Level
Location
United States | 17 |
United Kingdom (England) | 15 |
Canada | 14 |
Australia | 13 |
Turkey | 12 |
Sweden | 8 |
United Kingdom | 8 |
Netherlands | 7 |
Texas | 7 |
New York | 6 |
Taiwan | 6 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 4 |
Elementary and Secondary… | 3 |
Individuals with Disabilities… | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
Chen, Haiwen; Holland, Paul – Educational Testing Service, 2009
In this paper, we develop a new chained equipercentile equating procedure for the nonequivalent groups with anchor test (NEAT) design under the assumptions of the classical test theory model. This new equating is named chained true score equipercentile equating. We also apply the kernel equating framework to this equating design, resulting in a…
Descriptors: True Scores, Equated Scores, Test Theory, Methods
Gomez, Laura E.; Arias, Benito; Verdugo, Miguel Angel; Navas, Patricia – Journal of Intellectual & Developmental Disability, 2012
Background: Most instruments that assess quality of life have been validated by means of the classical test theory (CTT). However, CTT limitations have resulted in the development of alternative models, such as the Rasch rating scale model (RSM). The main goal of this paper is testing and improving the psychometric properties of the INTEGRAL…
Descriptors: Evidence, Models, Mental Retardation, Quality of Life
Winchell, Brooke – ProQuest LLC, 2011
The purpose of the study was to (a) examine the psychometric properties of The Assessment, Evaluation, and Programming System for Infants and Children (AEPS Test); (b) provide a process for establishing psychometric properties for other Curriculum Based Assessments (CBAs); and (c) identify and guide evaluation and subsequent revisions of the AEPS…
Descriptors: Curriculum Based Assessment, Psychometrics, Item Response Theory, Test Theory
Wallace, Colin S.; Prather, Edward E.; Duncan, Douglas K. – Astronomy Education Review, 2011
This is the first in a series of five articles describing a national study of general education astronomy students' conceptual and reasoning difficulties with cosmology. In this paper, we describe the process by which we designed four new surveys to assess general education astronomy students' conceptual cosmology knowledge. These surveys focused…
Descriptors: General Education, Astronomy, Surveys, Evolution
Almehrizi, Rashid S. – Applied Psychological Measurement, 2013
The majority of large-scale assessments develop various score scales that are either linear or nonlinear transformations of raw scores for better interpretations and uses of assessment results. The current formula for coefficient alpha (a; the commonly used reliability coefficient) only provides internal consistency reliability estimates of raw…
Descriptors: Raw Scores, Scaling, Reliability, Computation
Bandalos, Deborah L.; Kopp, Jason P. – Educational Measurement: Issues and Practice, 2012
In this article, we discuss the importance of measurement literacy and some issues encountered in teaching introductory measurement courses. We present results from a survey of introductory measurement instructors, including information about the topics included in such courses and the amount of time spent on each. Topics that were included by the…
Descriptors: Class Activities, Motivation Techniques, Item Analysis, Test Theory
Fang, Jiqian; Power, Mick; Lin, Yueqing; Zhang, Jinxin; Hao, Yuantao; Chatterji, Somnath – Gerontologist, 2012
Purpose of the study: To explore short-form versions of World Health Organization Quality of Life (WHOQOL-OLD) with acceptable psychometric properties, which was developed for older adults by the WHOQOL research group, containing 24 items initially. Design and Methods: We randomly sampled two-thirds of respondents from the data of WHOQOL-OLD field…
Descriptors: Quality of Life, Test Reliability, Correlation, Psychometrics
Lyren, Per-Erik – Practical Assessment, Research & Evaluation, 2009
The added value of reporting subscores on a college admission test (SweSAT) was examined in this study. Using a CTT-derived objective method for determining the value of reporting subscores, it was concluded that there is added value in reporting section scores (Verbal/Quantitative) as well as subtest scores. These results differ from a study of…
Descriptors: College Entrance Examinations, Scores, Test Theory, Foreign Countries
Mahon, Catherine; Lyddy, Fiona; Barnes-Holmes, Dermot – Journal of Applied Behavior Analysis, 2010
The purpose of the current study was to develop and test a computerized matching-to-sample (MTS) protocol to facilitate recombinative generalization of subword units (onsets and rimes) and recognition of novel onset-rime and onset-rime-rime words. In addition, we sought to isolate the key training components necessary for recombinative…
Descriptors: Rhyme, Generalization, Reading Instruction, Behavior
Shahat, Mohamed A.; Ohle, Annika; Treagust, David F.; Fischer, Hans E. – International Journal of Science and Mathematics Education, 2013
Educators and policymakers envision the future of education in Egypt as enabling learners to acquire scientific inquiry and problem-solving skills. In this article, we describe the validation of a model for problem solving and the design of instruments for evaluating new teaching methods in Egyptian science classes. The instruments were based on…
Descriptors: Foreign Countries, Questionnaires, Problem Solving, Science Instruction
Kiddle, Thom; Kormos, Judit – Language Assessment Quarterly, 2011
This article reports on a study conducted with 42 participants from a Chilean university, which aimed to determine the effect of mode of response on test performance and test-taker perception of test features by comparing a semidirect online version and a direct face-to-face version of a speaking test. Candidate performances on both test versions…
Descriptors: Student Attitudes, Test Theory, Foreign Countries, Evaluation Methods
Shiu, William – ProQuest LLC, 2012
This study examined the Flynn Effect (FE; i.e., the rise in IQ scores over time) in Estonia from Scale B of the National Intelligence Test using both classical test theory (CTT) and item response theory (IRT) methods. Secondary data from two cohorts (1934, n = 890 and 2006, n = 913) of students were analyzed, using both classical test theory (CTT)…
Descriptors: Foreign Countries, Intelligence Tests, Intelligence Quotient, Change
Jacob, Robin Tepper; Jacob, Brian – Journal of Research on Educational Effectiveness, 2012
Teacher and principal surveys are among the most common data collection techniques employed in education research. Yet there is remarkably little research on survey methods in education, or about the most cost-effective way to raise response rates among teachers and principals. In an effort to explore various methods for increasing survey response…
Descriptors: Principals, Data Collection, Test Theory, Response Rates (Questionnaires)
Puhan, Gautam; Sinharay, Sandip; Haberman, Shelby; Larkin, Kevin – Applied Measurement in Education, 2010
Will subscores provide additional information than what is provided by the total score? Is there a method that can estimate more trustworthy subscores than observed subscores? To answer the first question, this study evaluated whether the true subscore was more accurately predicted by the observed subscore or total score. To answer the second…
Descriptors: Licensing Examinations (Professions), Scores, Computation, Methods
Daly, Anthony L.; Baird, Jo-Anne; Chamberlain, Suzanne; Meadows, Michelle – Curriculum Journal, 2012
This paper describes an exploration into a reform of the A-level qualification in England in 2008; namely, the introduction of the "stretch and challenge" policy. This policy was initiated by the exams regulator and determined that exam papers should be redesigned to encourage the application of higher order thinking skills, both in the…
Descriptors: Test Preparation, Student Evaluation, Student Attitudes, Educational Change