NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 1 to 15 of 179 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Kosh, Audra E.; Simpson, Mary Ann; Bickel, Lisa; Kellogg, Mark; Sanford-Moore, Ellie – Educational Measurement: Issues and Practice, 2019
Automatic item generation (AIG)--a means of leveraging technology to create large quantities of items--requires a minimum number of items to offset the sizable upfront investment (i.e., model development and technology deployment) in order to achieve cost savings. In this cost-benefit analysis, we estimated the cost of each step of AIG and manual…
Descriptors: Cost Effectiveness, Automation, Test Items, Mathematics Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Evans, Carla M.; Lyons, Susan – Educational Measurement: Issues and Practice, 2017
The purpose of this study was to test methods that strengthen the comparability claims about annual determinations of student proficiency in English language arts, math, and science (Grades 3-12) in the New Hampshire Performance Assessment of Competency Education (NH PACE) pilot project. First, we examined the literature in order to define…
Descriptors: Academic Achievement, Language Arts, Mathematics Achievement, Science Achievement
Peer reviewed Peer reviewed
Direct linkDirect link
Buzick, Heather M.; Jones, Nathan D. – Educational Measurement: Issues and Practice, 2015
Much of the recent focus of educational policymakers has been on improving the measurement of teacher effectiveness. Linking student growth to teacher effects has been a large part of reform efforts. To date, neither researchers nor practitioners have arrived at a consensus on how to treat test scores from students with disabilities in…
Descriptors: Teacher Evaluation, Scores, Disabilities, Teacher Effectiveness
Peer reviewed Peer reviewed
Direct linkDirect link
Buzick, Heather; Stone, Elizabeth – Educational Measurement: Issues and Practice, 2014
Read aloud is a testing accommodation that has been studied by many researchers, and its use on K-12 assessments continues to be debated because of its potential to change the measured construct or unfairly increase test scores. This study is a summary of quantitative research on the read aloud accommodation. Previous studies contributed…
Descriptors: Meta Analysis, Reading Aloud to Others, Educational Research, Statistical Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Kingston, Neal; Nash, Brooke – Educational Measurement: Issues and Practice, 2011
An effect size of about 0.70 (or 0.40-0.70) is often claimed for the efficacy of formative assessment, but is not supported by the existing research base. More than 300 studies that appeared to address the efficacy of formative assessment in grades K-12 were reviewed. Many of the studies had severely flawed research designs yielding…
Descriptors: Elementary Secondary Education, Formative Evaluation, Program Effectiveness, Effect Size
Peer reviewed Peer reviewed
Direct linkDirect link
Shepard, Lorrie A. – Educational Measurement: Issues and Practice, 2009
In many school districts, the pressure to raise test scores has created overnight celebrity status for formative assessment. Its powers to raise student achievement have been touted, however, without attending to the research on which these claims were based. Sociocultural learning theory provides theoretical grounding for understanding how…
Descriptors: Learning Theories, Validity, Student Evaluation, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Perie, Marianne; Marion, Scott; Gong, Brian – Educational Measurement: Issues and Practice, 2009
Local assessment systems are being marketed as formative, benchmark, predictive, and a host of other terms. Many so-called formative assessments are not at all similar to the types of assessments and strategies studied by Black and Wiliam (1998) but instead are interim assessments. In this article, we clarify the definition and uses of interim…
Descriptors: Student Evaluation, Evaluation Methods, Educational Assessment, Formative Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Frey, Andreas; Hartig, Johannes; Rupp, Andre A. – Educational Measurement: Issues and Practice, 2009
In most large-scale assessments of student achievement, several broad content domains are tested. Because more items are needed to cover the content domains than can be presented in the limited testing time to each individual student, multiple test forms or booklets are utilized to distribute the items to the students. The construction of an…
Descriptors: Measures (Individuals), Test Construction, Theory Practice Relationship, Design
Peer reviewed Peer reviewed
Direct linkDirect link
Heritage, Margaret; Kim, Jinok; Vendlinski, Terry; Herman, Joan – Educational Measurement: Issues and Practice, 2009
Based on the results of a generalizability study of measures of teacher knowledge for teaching mathematics developed at the National Center for Research on Evaluation, Standards, and Student Testing at the University of California, Los Angeles, this article provides evidence that teachers are better at drawing reasonable inferences about student…
Descriptors: Formative Evaluation, Educational Testing, Inferences, Mathematics Instruction
Peer reviewed Peer reviewed
Direct linkDirect link
Zwick, Rebecca; Sklar, Jeffrey C.; Wakefield, Graham; Hamilton, Cris; Norman, Alex; Folsom, Douglas – Educational Measurement: Issues and Practice, 2008
In the current No Child Left Behind era, K-12 teachers and principals are expected to have a sophisticated understanding of standardized test results, use them to improve instruction, and communicate them to others. The goal of our project, funded by the National Science Foundation, was to develop and evaluate three Web-based instructional modules…
Descriptors: Standardized Tests, Test Results, Web Based Instruction, Educational Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Elliott, Stephen N.; Compton, Elizabeth; Roach, Andrew T. – Educational Measurement: Issues and Practice, 2007
The relationships between ratings on the Idaho Alternate Assessment (IAA) for 116 students with significant disabilities and corresponding ratings for the same students on two norm-referenced teacher rating scales were examined to gain evidence about the validity of resulting IAA scores. To contextualize these findings, another group of 54…
Descriptors: Inferences, Disabilities, Rating Scales, Eligibility
Peer reviewed Peer reviewed
Direct linkDirect link
Lohman, David F.; Nichols, Paul – Educational Measurement: Issues and Practice, 2006
In this commentary, we summarize some of the main themes of the NRC report and note ways in which the papers by Mislevy and Haertel, Gorin, and Abedi and Gandara address the Panel's recommendations. We then briefly review and offer reflections on each paper. We see much to applaud here and also in the broader effort to build bridges between the…
Descriptors: Psychometrics, Student Evaluation, Measurement, Teacher Education
Peer reviewed Peer reviewed
Direct linkDirect link
La Marca, Paul M. – Educational Measurement: Issues and Practice, 2006
Although it is assumed that student cognition contributes to student performance on achievement tests, it may be that current testing models lack the degree of specification necessary to warrant such inferences. With test score interpretations as the referent, the authors in this special issue address the role of student cognition in learning and…
Descriptors: Federal Legislation, Measurement, Testing, Achievement Tests
Peer reviewed Peer reviewed
Cizek, Gregory J. – Educational Measurement: Issues and Practice, 2003
Responds to critiques of an earlier article on the unintended consequences of high-stakes testing. Reiterates the need to look for positive as well as negative consequences of high stakes testing and calls for balance between the role of large-scale assessment in identifying achievement deficits and the responsibility educational systems have to…
Descriptors: Achievement Tests, Educational Assessment, Elementary Secondary Education, High Stakes Tests
Peer reviewed Peer reviewed
Popham, W. James – Educational Measurement: Issues and Practice, 2003
Suggests that members of the educational measurement community are at least in part responsible for the fact that many U.S. children are receiving a reduced quality education because of the increasing misuse of educational tests. Assessment personnel have failed to speak out against the use of tests to assess school quality. (SLD)
Descriptors: Educational Quality, Educational Testing, Elementary Secondary Education, Psychometrics
Previous Page | Next Page ยป
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  12