Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 19 |
Descriptor
Item Analysis | 39 |
Test Reliability | 39 |
Test Validity | 19 |
Test Construction | 16 |
Test Items | 14 |
Item Response Theory | 8 |
Psychometrics | 8 |
Scores | 5 |
Test Theory | 5 |
Testing | 5 |
Computer Assisted Testing | 4 |
More ▼ |
Source
Author
Publication Type
Education Level
Elementary Secondary Education | 5 |
Higher Education | 4 |
Elementary Education | 3 |
High Schools | 3 |
Early Childhood Education | 2 |
Grade 3 | 2 |
Grade 4 | 2 |
Grade 5 | 2 |
Grade 6 | 2 |
Grade 7 | 2 |
Grade 9 | 2 |
More ▼ |
Audience
Practitioners | 4 |
Teachers | 4 |
Administrators | 1 |
Researchers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
General Educational… | 1 |
Graduate Record Examinations | 1 |
Parenting Stress Index | 1 |
What Works Clearinghouse Rating
Sheng, Yanyan – Measurement: Interdisciplinary Research and Perspectives, 2019
Classical approach to test theory has been the foundation for educational and psychological measurement for over 90 years. This approach concerns with measurement error and hence test reliability, which in part relies on individual test items. The CTT package, developed in light of this, provides functions for test- and item-level analyses of…
Descriptors: Item Response Theory, Test Reliability, Item Analysis, Error of Measurement
Bichi, Ado Abdu; Talib, Rohaya – International Journal of Evaluation and Research in Education, 2018
Testing in educational system perform a number of functions, the results from a test can be used to make a number of decisions in education. It is therefore well accepted in the education literature that, testing is an important element of education. To effectively utilize the tests in educational policies and quality assurance its validity and…
Descriptors: Item Response Theory, Test Items, Test Construction, Decision Making
Johnson, Alyce O. – Journal of Psychoeducational Assessment, 2015
The "Parenting Stress Index, Fourth Edition" (PSI-4) is a 120-item measure used to explore parental stress levels considering a parent's relationship with one of his or her children between the ages of 1 month and 12 years. The main purpose of the test is to define these stress levels and from where they originate in order to identify…
Descriptors: Anxiety, Measures (Individuals), Parents, Child Rearing
Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014
A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…
Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing
New Meridian Corporation, 2020
The purpose of this report is to describe the technical qualities of the 2018-2019 operational administration of the English language arts/literacy (ELA/L) and mathematics summative assessments in grades 3 through 8 and high school. The ELA/L assessments focus on reading and comprehending a range of sufficiently complex texts independently and…
Descriptors: Language Arts, Literacy Education, Mathematics Education, Summative Evaluation
New Meridian Corporation, 2020
The purpose of this report is to describe the technical qualities of the 2018-2019 operational administration of the English language arts/literacy (ELA/L) and mathematics assessments in grades 3 through 8 and high school. New Meridian, in coordination with multiple states and vendors, developed an alternate form of the summative assessment to…
Descriptors: Language Arts, Literacy Education, Mathematics Education, Summative Evaluation
Goldhaber, Dan; Chaplin, Duncan – Center for Education Data & Research, 2012
In a provocative and influential paper, Jesse Rothstein (2010) finds that standard value added models (VAMs) suggest implausible future teacher effects on past student achievement, a finding that obviously cannot be viewed as causal. This is the basis of a falsification test (the Rothstein falsification test) that appears to indicate bias in VAM…
Descriptors: School Effectiveness, Teacher Effectiveness, Achievement Gains, Statistical Bias
Beddow, Peter A. – International Journal of Disability, Development and Education, 2012
In the arena of educational testing, accessibility refers to the degree to which students are given the opportunity to participate in and engage a test. Accessibility theory is a model for examining the interactions between the test-taker and the test itself and defining how they may decrease some students' access to the test event, ultimately…
Descriptors: Test Results, Test Items, Educational Testing, Scores
Anderson, Trevor R.; Rogan, John M. – Biochemistry and Molecular Biology Education, 2010
Student assessment is central to the educational process and can be used for multiple purposes including, to promote student learning, to grade student performance and to evaluate the educational quality of qualifications. It is, therefore, of utmost importance that assessment instruments are of a high quality. In this article, we present various…
Descriptors: Educational Assessment, Educational Quality, Student Evaluation, Educational Research
Salmani-Nodoushan, Mohammad Ali – Journal on Educational Psychology, 2009
A good test is one that has at least three qualities: reliability, or the precision with which a test measures what it is supposed to measure; validity, i.e., if the test really measures what it is supposed to measure, and practicality, or if the test, no matter how sound theoretically, is practicable in reality. These are the sine qua non for any…
Descriptors: Generalizability Theory, Testing, Language Tests, Item Response Theory
Salmani-Nodoushan, Mohammad Ali – Online Submission, 2009
A good test is one that has at least three qualities: reliability, or the precision with which a test measures what it is supposed to measure; validity, i.e., if the test really measures what it is supposed to measure; and practicality, or if the test, no matter how sound theoretically, is practicable in reality. These are the sine qua non for…
Descriptors: Generalizability Theory, Testing, Language Tests, Item Response Theory
Okonkwo, Charity Akuadi – Turkish Online Journal of Distance Education, 2010
This paper first presents an overview of the concepts of assessment and evaluation in Open and Distance Learning (ODL) environment. The large numbers of students and numerous courses make assessment and evaluation very difficult and administrative nightmare at Distance Learning (DL) institutions. These challenges informed exploring issues relating…
Descriptors: Distance Education, Sustainability, Evaluation Methods, Educational Strategies
Setzer, J. Carl – GED Testing Service, 2009
The GED[R] English as a Second Language (GED ESL) Test was designed to serve as an adjunct to the GED test battery when an examinee takes either the Spanish- or French-language version of the tests. The GED ESL Test is a criterion-referenced, multiple-choice instrument that assesses the functional, English reading skills of adults whose first…
Descriptors: Language Tests, High School Equivalency Programs, Psychometrics, Reading Skills
Chong, Sylvia; Cheah, Horn Mun – Australian Journal of Teacher Education, 2009
The purpose of this paper is to introduce an integrated values, skills and knowledge (VSK) framework for initial teacher preparation programmes. The VSK framework articulated, in broad terms, the desired skills and knowledge components for beginning teachers, with the underlying core values permeating the programmes. The paper has two parts, the…
Descriptors: Student Teachers, Values, Teacher Education Programs, Knowledge Base for Teaching
Tracey, Terence J. G.; Sodano, Sandro M. – Career Development Quarterly, 2008
Interest development is not an easily studied process. There are at least 4 methods for examining the process of stability and change over time: relative stability, absolute stability, profile stability, and structural stability. A program of research that focuses on examining these 4 types of stability is summarized relative to the issues…
Descriptors: Vocational Interests, Childhood Interests, Attitude Change, Research Projects