Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 5 |
| Since 2007 (last 20 years) | 18 |
Descriptor
| Statistical Analysis | 77 |
| Testing | 77 |
| Test Reliability | 60 |
| Test Validity | 32 |
| Reliability | 15 |
| Test Construction | 15 |
| Measurement Techniques | 14 |
| Scores | 14 |
| Test Interpretation | 14 |
| Foreign Countries | 13 |
| Comparative Analysis | 12 |
| More ▼ | |
Source
Author
Publication Type
Education Level
| Higher Education | 6 |
| Postsecondary Education | 5 |
| Elementary Secondary Education | 4 |
| Secondary Education | 2 |
| Grade 7 | 1 |
| Grade 8 | 1 |
| Grade 9 | 1 |
| High Schools | 1 |
Audience
| Practitioners | 3 |
| Teachers | 2 |
Location
| Australia | 2 |
| Taiwan | 2 |
| United Kingdom (England) | 2 |
| Arizona | 1 |
| Arkansas | 1 |
| Brazil | 1 |
| California | 1 |
| California (Stanford) | 1 |
| Colorado | 1 |
| Colorado (Denver) | 1 |
| Connecticut | 1 |
| More ▼ | |
Laws, Policies, & Programs
| Americans with Disabilities… | 1 |
| Debra P v Turlington | 1 |
| Elementary and Secondary… | 1 |
| Individuals with Disabilities… | 1 |
| Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Practices in Instrument Use and Development in "Chemistry Education Research and Practice" 2010-2021
Lazenby, Katherine; Tenney, Kristin; Marcroft, Tina A.; Komperda, Regis – Chemistry Education Research and Practice, 2023
Assessment instruments that generate quantitative data on attributes (cognitive, affective, behavioral, "etc.") of participants are commonly used in the chemistry education community to draw conclusions in research studies or inform practice. Recently, articles and editorials have stressed the importance of providing evidence for the…
Descriptors: Chemistry, Periodicals, Journal Articles, Science Education
Miciak, Jeremy; Taylor, W. Pat; Stuebing, Karla K.; Fletcher, Jack M. – Journal of Psychoeducational Assessment, 2018
We investigated the classification accuracy of learning disability (LD) identification methods premised on the identification of an intraindividual pattern of processing strengths and weaknesses (PSW) method using multiple indicators for all latent constructs. Known LD status was derived from latent scores; values at the observed level identified…
Descriptors: Accuracy, Learning Disabilities, Classification, Identification
Powers, Sonya; Li, Dongmei; Suh, Hongwook; Harris, Deborah J. – ACT, Inc., 2016
ACT reporting categories and ACT Readiness Ranges are new features added to the ACT score reports starting in fall 2016. For each reporting category, the number correct score, the maximum points possible, the percent correct, and the ACT Readiness Range, along with an indicator of whether the reporting category score falls within the Readiness…
Descriptors: Scores, Classification, College Entrance Examinations, Error of Measurement
Bayazidi, Aso; Saeb, Fateme – Advances in Language and Literary Studies, 2017
This study examined the equivalence and reliability of the two versions of the Vocabulary Levels Test in an Iranian context. This study was motivated by the fact that the Vocabulary Levels test is increasingly being used in Iran for both research and pedagogical purposes without having been checked for validity and reliability in this context. The…
Descriptors: Foreign Countries, Vocabulary, English (Second Language), College Second Language Programs
Ishigami, Yoko; Klein, Raymond M. – Journal of Cognition and Development, 2015
The current study examined the robustness, stability, reliability, and isolability of the attention network scores (alerting, orienting, and executive control) when young children experienced repeated administrations of the child version of the Attention Network Test (ANT; Rueda et al., 2004). Ten test sessions of the ANT were administered to 12…
Descriptors: Measurement, Attention, Scores, Executive Function
Rios, Joseph A.; Liu, Ou Lydia – American Journal of Distance Education, 2017
Online higher education institutions are presented with the concern of how to obtain valid results when administering student learning outcomes (SLO) assessments remotely. Traditionally, there has been a great reliance on unproctored Internet test administration (UIT) due to increased flexibility and reduced costs; however, a number of validity…
Descriptors: Online Courses, Testing, Test Wiseness, Academic Achievement
Öz, Hüseyin; Özturan, Tuba – Journal of Language and Linguistic Studies, 2018
This article reports the findings of a study that sought to investigate whether computer-based vs. paper-based test-delivery mode has an impact on the reliability and validity of an achievement test for a pedagogical content knowledge course in an English teacher education program. A total of 97 university students enrolled in the English as a…
Descriptors: Computer Assisted Testing, Testing, Test Format, Teaching Methods
Satake, Eike – Journal of Mathematics Education at Teachers College, 2015
This cross-cultural study investigated the relationship between attitudes toward statistics (ATS) and course achievement (CA) among Japanese college students. The sample consisted of 135 male and 134 female students from the first two-year liberal arts program of a four-year college in Tokyo, Japan. Attitudes about statistics were measured using…
Descriptors: Foreign Countries, College Students, Statistics, Validity
Baker, Bruce D.; Oluwole, Joseph O.; Green, Preston C., III – Education Policy Analysis Archives, 2013
In this article, we explain how overly prescriptive, rigid state statutory and regulatory policy frameworks regarding teacher evaluation, tenure and employment decisions outstrip the statistical reliability and validity of proposed measures of teaching effectiveness. We begin with a discussion of the emergence of highly prescriptive state…
Descriptors: Teacher Evaluation, Teacher Effectiveness, Teacher Employment, Tenure
Crossley, Scott; Clevinger, Amanda; Kim, YouJin – Language Assessment Quarterly, 2014
There has been a growing interest in the use of integrated tasks in the field of second language testing to enhance the authenticity of language tests. However, the role of text integration in test takers' performance has not been widely investigated. The purpose of the current study is to examine the effects of text-based relational (i.e.,…
Descriptors: Language Proficiency, Connected Discourse, Language Tests, English (Second Language)
Lu, Chia-Chen; Luh, Ding-Bang – Creativity Research Journal, 2012
Although previous studies have attempted to use different experiences of raters to rate product creativity by adopting the Consensus Assessment Method (CAT) approach, the validity of replacing CAT with another measurement tool has not been adequately tested. This study aimed to compare raters with different levels of experience (expert ves.…
Descriptors: Creativity, Interrater Reliability, Construct Validity, Comparative Analysis
Okada, Alexandra; Scott, Peter; Mendonça, Murilo – Open Praxis, 2015
The challenging of assessing formal and informal online learning at scale includes various issues. Many universities who are now promoting "Massive Online Open Courses" (MOOC), for instance, focus on relatively informal assessment of participant competence, which is not highly "quality assured". This paper reports best…
Descriptors: Videoconferencing, Internet, Online Courses, Large Group Instruction
May, Henry; Cole, Russell; Haimson, Josh; Perez-Johnson, Irma – Society for Research on Educational Effectiveness, 2010
The purpose of this study is to provide empirical benchmarks of the conditional reliabilities of state tests for samples of the student population defined by ability level. Given that many educational interventions are targeted for samples of low performing students, schools, or districts, the primary goal of this research is to determine how…
Descriptors: Intervention, Statistical Analysis, Academic Achievement, Test Reliability
Chidi, Christopher O.; Shadare, Oluseyi A. – Journal of International Education Research, 2011
This study investigated the influence of host community on industrial relations practices and policies using Agbara community and Power Holding Company of Nigeria PLC as a case. The study adopted both the qualitative and quantitative methods. A total of 120 samples were drawn from the population using the simple random sampling technique in which…
Descriptors: Testing, Social Sciences, Foreign Countries, Sampling
Mercier, Kevin John – ProQuest LLC, 2011
The purpose of this investigation was to develop an instrument that has scores that are valid and reliable for measuring students' attitudes toward fitness testing. A second purpose of the study was to determine the attitudes of secondary students toward fitness testing. A review of literature, an elicitation study, and a pilot study were…
Descriptors: Student Attitudes, Females, Testing, Reliability

Peer reviewed
Direct link
