Managing Gigabytes

Compressing and Indexing Documents and Images
Author: Ian H. Witten,Alistair Moffat,Timothy C. Bell
Publisher: Morgan Kaufmann
ISBN: 9781558605701
Category: Business & Economics
Page: 519
View: 8075

Continue Reading →

In this fully updated second edition of the highly acclaimed Managing Gigabytes, authors Witten, Moffat, and Bell continue to provide unparalleled coverage of state-of-the-art techniques for compressing and indexing data. Whatever your field, if you work with large quantities of information, this book is essential reading--an authoritative theoretical resource and a practical guide to meeting the toughest storage and access challenges. It covers the latest developments in compression and indexing and their application on the Web and in digital libraries. It also details dozens of powerful techniques supported by mg, the authors' own system for compressing, storing, and retrieving text, images, and textual images. mg's source code is freely available on the Web. * Up-to-date coverage of new text compression algorithms such as block sorting, approximate arithmetic coding, and fat Huffman coding * New sections on content-based index compression and distributed querying, with 2 new data structures for fast indexing * New coverage of image coding, including descriptions of de facto standards in use on the Web (GIF and PNG), information on CALIC, the new proposed JPEG Lossless standard, and JBIG2 * New information on the Internet and WWW, digital libraries, web search engines, and agent-based retrieval * Accompanied by a public domain system called MG which is a fully worked-out operational example of the advanced techniques developed and explained in the book * New appendix on an existing digital library system that uses the MG software

Multimedia Servers

Applications, Environments, and Design
Author: Dinkar Sitaram,Asit Dan
Publisher: Morgan Kaufmann
ISBN: 9781558604308
Category: Computers
Page: 297
View: 3943

Continue Reading →

This book will undoubtedly satisfy the needs of application developers, server designers, integrators, and service providers, as it provides end-to-end, top-down coverage: from application-specific issues to low-level components. Inside, the authors offer specific design, development, and implementation approaches that take into account the complexity of the environments in which multimedia servers operate. You'll learn which techniques are best suited for different kinds of applications and different kinds of networks. You'll master the challenges associated with resource scheduling, collaborative computing, session set-up, and distributed storage. Most importantly, you'll discover how to put all of these solutions to work as part of a coherent strategy aimed at exploiting economies of scale and meeting quality of service requirements. * Presents optimized design algorithms developed by the authors and other leading researchers. * Deals comprehensively with the systems supporting the large-scale storage, retrieval, and distribution of audio and video data. * Balances the coverage of current technologies with forward-looking discussions to help you devise a sustainable, evolvable solution. * Covers key issues in video-on-demand and other multimedia systems: resource scheduling, local caching, interactivity, architectural strategies, and more.

How to Build a Digital Library


Author: Ian H. Witten,David Bainbridge,David M. Nichols
Publisher: Morgan Kaufmann
ISBN: 9780080890395
Category: Computers
Page: 656
View: 5211

Continue Reading →

How to Build a Digital Library reviews knowledge and tools to construct and maintain a digital library, regardless of the size or purpose. A resource for individuals, agencies, and institutions wishing to put this powerful tool to work in their burgeoning information treasuries. The Second Edition reflects developments in the field as well as in the Greenstone Digital Library open source software. In Part I, the authors have added an entire new chapter on user groups, user support, collaborative browsing, user contributions, and so on. There is also new material on content-based queries, map-based queries, cross-media queries. There is an increased emphasis placed on multimedia by adding a "digitizing" section to each major media type. A new chapter has also been added on "internationalization," which will address Unicode standards, multi-language interfaces and collections, and issues with non-European languages (Chinese, Hindi, etc.). Part II, the software tools section, has been completely rewritten to reflect the new developments in Greenstone Digital Library Software, an internationally popular open source software tool with a comprehensive graphical facility for creating and maintaining digital libraries. Outlines the history of libraries on both traditional and digital Written for both technical and non-technical audiences and covers the entire spectrum of media, including text, images, audio, video, and related XML standards Web-enhanced with software documentation, color illustrations, full-text index, source code, and more

Text Compression


Author: Timothy C. Bell
Publisher: Englewood Cliffs, N.J. : Prentice Hall
ISBN: N.A
Category: Computers
Page: 318
View: 6631

Continue Reading →

M->CREATED

Digital Watermarking and Steganography


Author: Ingemar Cox,Matthew Miller,Jeffrey Bloom,Jessica Fridrich,Ton Kalker
Publisher: Morgan Kaufmann
ISBN: 9780080555805
Category: Computers
Page: 624
View: 8665

Continue Reading →

Digital audio, video, images, and documents are flying through cyberspace to their respective owners. Unfortunately, along the way, individuals may choose to intervene and take this content for themselves. Digital watermarking and steganography technology greatly reduces the instances of this by limiting or eliminating the ability of third parties to decipher the content that he has taken. The many techiniques of digital watermarking (embedding a code) and steganography (hiding information) continue to evolve as applications that necessitate them do the same. The authors of this second edition provide an update on the framework for applying these techniques that they provided researchers and professionals in the first well-received edition. Steganography and steganalysis (the art of detecting hidden information) have been added to a robust treatment of digital watermarking, as many in each field research and deal with the other. New material includes watermarking with side information, QIM, and dirty-paper codes. The revision and inclusion of new material by these influential authors has created a must-own book for anyone in this profession. This new edition now contains essential information on steganalysis and steganography New concepts and new applications including QIM introduced Digital watermark embedding is given a complete update with new processes and applications

How to Build a Digital Library


Author: Ian H. Witten,David Bainbridge,David M. Nichols
Publisher: Morgan Kaufmann
ISBN: 9780080890395
Category: Computers
Page: 656
View: 9320

Continue Reading →

How to Build a Digital Library reviews knowledge and tools to construct and maintain a digital library, regardless of the size or purpose. A resource for individuals, agencies, and institutions wishing to put this powerful tool to work in their burgeoning information treasuries. The Second Edition reflects developments in the field as well as in the Greenstone Digital Library open source software. In Part I, the authors have added an entire new chapter on user groups, user support, collaborative browsing, user contributions, and so on. There is also new material on content-based queries, map-based queries, cross-media queries. There is an increased emphasis placed on multimedia by adding a "digitizing" section to each major media type. A new chapter has also been added on "internationalization," which will address Unicode standards, multi-language interfaces and collections, and issues with non-European languages (Chinese, Hindi, etc.). Part II, the software tools section, has been completely rewritten to reflect the new developments in Greenstone Digital Library Software, an internationally popular open source software tool with a comprehensive graphical facility for creating and maintaining digital libraries. Outlines the history of libraries on both traditional and digital Written for both technical and non-technical audiences and covers the entire spectrum of media, including text, images, audio, video, and related XML standards Web-enhanced with software documentation, color illustrations, full-text index, source code, and more

Introduction to Data Compression


Author: Khalid Sayood
Publisher: Newnes
ISBN: 0124157963
Category: Computers
Page: 740
View: 7047

Continue Reading →

Each edition of Introduction to Data Compression has widely been considered the best introduction and reference text on the art and science of data compression, and the fourth edition continues in this tradition. Data compression techniques and technology are ever-evolving with new applications in image, speech, text, audio, and video. The fourth edition includes all the cutting edge updates the reader will need during the work day and in class. Khalid Sayood provides an extensive introduction to the theory underlying today's compression techniques with detailed instruction for their applications using several examples to explain the concepts. Encompassing the entire field of data compression, Introduction to Data Compression includes lossless and lossy compression, Huffman coding, arithmetic coding, dictionary techniques, context based compression, scalar and vector quantization. Khalid Sayood provides a working knowledge of data compression, giving the reader the tools to develop a complete and concise compression package upon completion of his book. New content added to include a more detailed description of the JPEG 2000 standard New content includes speech coding for internet applications Explains established and emerging standards in depth including JPEG 2000, JPEG-LS, MPEG-2, H.264, JBIG 2, ADPCM, LPC, CELP, MELP, and iLBC Source code provided via companion web site that gives readers the opportunity to build their own algorithms, choose and implement techniques in their own applications

Fundamentals of Multimedia


Author: Ze-Nian Li,Mark S. Drew,Jiangchuan Liu
Publisher: Springer Science & Business Media
ISBN: 331905290X
Category: Computers
Page: 727
View: 8487

Continue Reading →

This textbook introduces the “Fundamentals of Multimedia”, addressing real issues commonly faced in the workplace. The essential concepts are explained in a practical way to enable students to apply their existing skills to address problems in multimedia. Fully revised and updated, this new edition now includes coverage of such topics as 3D TV, social networks, high-efficiency video compression and conferencing, wireless and mobile networks, and their attendant technologies. Features: presents an overview of the key concepts in multimedia, including color science; reviews lossless and lossy compression methods for image, video and audio data; examines the demands placed by multimedia communications on wired and wireless networks; discusses the impact of social media and cloud computing on information sharing and on multimedia content search and retrieval; includes study exercises at the end of each chapter; provides supplementary resources for both students and instructors at an associated website.

Understanding Digital Libraries


Author: Michael Lesk
Publisher: Elsevier
ISBN: 9780080481517
Category: Language Arts & Disciplines
Page: 456
View: 3499

Continue Reading →

This fully revised and updated second edition of Understanding Digital Libraries focuses on the challenges faced by both librarians and computer scientists in a field that has been dramatically altered by the growth of the Web. At every turn, the goal is practical: to show you how things you might need to do are already being done, or how they can be done. The first part of the book is devoted to technology and examines issues such as varying media requirements, indexing and classification, networks and distribution, and presentation. The second part of the book is concerned with the human contexts in which digital libraries function. Here you’ll find specific and useful information on usability, preservation, scientific applications, and thorny legal and economic questions. - Thoroughly updated and expanded from original edition to include recent research, case studies and new technologies - For librarians and technologists alike, this book provides a thorough introduction to the interdisciplinary science of digital libraries - Written by Michael Lesk, a legend in computer science and a leading figure in the digital library field. - Provides insights into the integration of both the technical and non-technical aspects of digital libraries

Information Retrieval

Algorithms and Heuristics
Author: David A. Grossman,Ophir Frieder
Publisher: Springer Science & Business Media
ISBN: 9781402030048
Category: Computers
Page: 332
View: 1690

Continue Reading →

Interested in how an efficient search engine works? Want to know what algorithms are used to rank resulting documents in response to user requests? The authors answer these and other key information retrieval design and implementation questions. This book is not yet another high level text. Instead, algorithms are thoroughly described, making this book ideally suited for both computer science students and practitioners who work on search-related applications. As stated in the foreword, this book provides a current, broad, and detailed overview of the field and is the only one that does so. Examples are used throughout to illustrate the algorithms. The authors explain how a query is ranked against a document collection using either a single or a combination of retrieval strategies, and how an assortment of utilities are integrated into the query processing scheme to improve these rankings. Methods for building and compressing text indexes, querying and retrieving documents in multiple languages, and using parallel or distributed processing to expedite the search are likewise described. This edition is a major expansion of the one published in 1998. Besides updating the entire book with current techniques, it includes new sections on language models, cross-language information retrieval, peer-to-peer processing, XML search, mediators, and duplicate document detection.

Web Data Management


Author: Serge Abiteboul,Ioana Manolescu,Philippe Rigaux,Marie-Christine Rousset,Pierre Senellart
Publisher: Cambridge University Press
ISBN: 113950505X
Category: Computers
Page: N.A
View: 9435

Continue Reading →

The Internet and World Wide Web have revolutionized access to information. Users now store information across multiple platforms from personal computers to smartphones and websites. As a consequence, data management concepts, methods and techniques are increasingly focused on distribution concerns. Now that information largely resides in the network, so do the tools that process this information. This book explains the foundations of XML with a focus on data distribution. It covers the many facets of distributed data management on the Web, such as description logics, that are already emerging in today's data integration applications and herald tomorrow's semantic Web. It also introduces the machinery used to manipulate the unprecedented amount of data collected on the Web. Several 'Putting into Practice' chapters describe detailed practical applications of the technologies and techniques. The book will serve as an introduction to the new, global, information systems for Web professionals and master's level courses.

Data Mining

Practical Machine Learning Tools and Techniques
Author: Ian H. Witten,Eibe Frank,Mark A. Hall,Christopher J. Pal
Publisher: Morgan Kaufmann
ISBN: 0128043571
Category: Computers
Page: 654
View: 5256

Continue Reading →

Data Mining: Practical Machine Learning Tools and Techniques, Fourth Edition, offers a thorough grounding in machine learning concepts, along with practical advice on applying these tools and techniques in real-world data mining situations. This highly anticipated fourth edition of the most acclaimed work on data mining and machine learning teaches readers everything they need to know to get going, from preparing inputs, interpreting outputs, evaluating results, to the algorithmic methods at the heart of successful data mining approaches. Extensive updates reflect the technical changes and modernizations that have taken place in the field since the last edition, including substantial new chapters on probabilistic methods and on deep learning. Accompanying the book is a new version of the popular WEKA machine learning software from the University of Waikato. Authors Witten, Frank, Hall, and Pal include today's techniques coupled with the methods at the leading edge of contemporary research. Please visit the book companion website at http://www.cs.waikato.ac.nz/ml/weka/book.html It contains Powerpoint slides for Chapters 1-12. This is a very comprehensive teaching resource, with many PPT slides covering each chapter of the book Online Appendix on the Weka workbench; again a very comprehensive learning aid for the open source software that goes with the book Table of contents, highlighting the many new sections in the 4th edition, along with reviews of the 1st edition, errata, etc. Provides a thorough grounding in machine learning concepts, as well as practical advice on applying the tools and techniques to data mining projects Presents concrete tips and techniques for performance improvement that work by transforming the input or output in machine learning methods Includes a downloadable WEKA software toolkit, a comprehensive collection of machine learning algorithms for data mining tasks-in an easy-to-use interactive interface Includes open-access online courses that introduce practical applications of the material in the book

Digital Video and HD

Algorithms and Interfaces
Author: Charles A. Poynton
Publisher: Elsevier
ISBN: 0123919266
Category: Computers
Page: 707
View: 2448

Continue Reading →

Acknowledgments -- List of Figures -- List of Tables -- Preface -- Part 1 -- Introduction -- Chapter 1 -- Raster Images -- Chapter 2 -- Quantization -- Chapter 3 -- Brightness Contrast Controls -- Chapter 4 -- Raster Images in Computing -- Chapter 5 -- Raster Scanning -- Chapter 6 -- Image Structure -- Chapter 7 -- Resolution -- Chapter 8 -- Constant Luminance -- Chapter 9 -- Rendering Intent -- Chapter 10 -- Introduction to Luma Chroma -- Chapter 11 -- Introduction to Component SDTV -- Chapter 12 -- Introduction to Composite NTSC PAL -- Chapter 13 -- Introduction to HDTV -- Chapter 14 -- Introduction to Compression -- Chapter 15 -- Digital Video Interfaces -- Part 2 -- Principles -- Chapter 16 -- Filtering and Sampling -- Chapter 17 -- Resampling, Interpolation, and decimation -- Chapter 18 -- Image Digitization and Reconstruction -- Chapter 19 -- Perception and Visual Acuity -- Chapter 20 -- Luminance and Lightness -- Chapter 21 -- The CIE System of Colorimetry -- Chapter 22 -- Color ...

Information Retrieval

Implementing and Evaluating Search Engines
Author: Stefan Büttcher,Charles L. A. Clarke,Gordon V. Cormack
Publisher: MIT Press
ISBN: 0262528878
Category: Computers
Page: 632
View: 9184

Continue Reading →

An introduction to information retrieval, the foundation for modern search engines, that emphasizes implementation and experimentation.

Keeping Found Things Found: The Study and Practice of Personal Information Management


Author: William Jones
Publisher: Morgan Kaufmann
ISBN: 9780080554150
Category: Computers
Page: 448
View: 1798

Continue Reading →

Keeping Found Things Found: The Study and Practice of Personal Information Management is the first comprehensive book on new 'favorite child' of R&D at Microsoft and elsewhere, personal information management (PIM). It provides a comprehensive overview of PIM as both a study and a practice of the activities people do, and need to be doing, so that information can work for them in their daily lives. It explores what good and better PIM looks like, and how to measure improvements. It presents key questions to consider when evaluating any new PIM informational tools or systems. This book is designed for R&D professionals in HCI, data mining and data management, information retrieval, and related areas, plus developers of tools and software that include PIM solutions. Focuses exclusively on one of the most interesting and challenging problems in today's world Explores what good and better PIM looks like, and how to measure improvements Presents key questions to consider when evaluating any new PIM informational tools or systems

Readings in Information Retrieval


Author: Karen Sparck Jones,Peter Willett
Publisher: Morgan Kaufmann
ISBN: 9781558604544
Category: Computers
Page: 589
View: 1988

Continue Reading →

This compilation of original papers on information retrieval presents an overview, covering both general theory and specific methods, of the development and current status of information retrieval systems. Each chapter contains several papers carefully chosen to represent substantive research work that has been carried out in that area, each is preceded by an introductory overview and followed by supported references for further reading.

Computer Organization and Design RISC-V Edition

The Hardware Software Interface
Author: David A. Patterson,John L. Hennessy
Publisher: Morgan Kaufmann
ISBN: 0128122765
Category: Computers
Page: 696
View: 4669

Continue Reading →

The new RISC-V Edition of Computer Organization and Design features the RISC-V open source instruction set architecture, the first open source architecture designed to be used in modern computing environments such as cloud computing, mobile devices, and other embedded systems. With the post-PC era now upon us, Computer Organization and Design moves forward to explore this generational change with examples, exercises, and material highlighting the emergence of mobile computing and the Cloud. Updated content featuring tablet computers, Cloud infrastructure, and the x86 (cloud computing) and ARM (mobile computing devices) architectures is included. An online companion Web site provides advanced content for further study, appendices, glossary, references, and recommended reading. Features RISC-V, the first such architecture designed to be used in modern computing environments, such as cloud computing, mobile devices, and other embedded systems Includes relevant examples, exercises, and material highlighting the emergence of mobile computing and the cloud

Embedded Computing

A VLIW Approach to Architecture, Compilers and Tools
Author: Joseph A. Fisher,Paolo Faraboschi,Clifford Young
Publisher: Elsevier
ISBN: 1558607668
Category: Computers
Page: 671
View: 3451

Continue Reading →

The fact that there are more embedded computers than general-purpose computers and that we are impacted by hundreds of them every day is no longer news. What is news is that their increasing performance requirements, complexity and capabilities demand a new approach to their design. Fisher, Faraboschi, and Young describe a new age of embedded computing design, in which the processor is central, making the approach radically distinct from contemporary practices of embedded systems design. They demonstrate why it is essential to take a computing-centric and system-design approach to the traditional elements of nonprogrammable components, peripherals, interconnects and buses. These elements must be unified in a system design with high-performance processor architectures, microarchitectures and compilers, and with the compilation tools, debuggers and simulators needed for application development. In this landmark text, the authors apply their expertise in highly interdisciplinary hardware/software development and VLIW processors to illustrate this change in embedded computing. VLIW architectures have long been a popular choice in embedded systems design, and while VLIW is a running theme throughout the book, embedded computing is the core topic. Embedded Computing examines both in a book filled with fact and opinion based on the authors many years of R&D experience. · Complemented by a unique, professional-quality embedded tool-chain on the authors' website, http://www.vliw.org/book · Combines technical depth with real-world experience · Comprehensively explains the differences between general purpose computing systems and embedded systems at the hardware, software, tools and operating system levels. · Uses concrete examples to explain and motivate the trade-offs.

Beautiful Data

The Stories Behind Elegant Data Solutions
Author: Toby Segaran,Jeff Hammerbacher
Publisher: "O'Reilly Media, Inc."
ISBN: 144937929X
Category: Computers
Page: 386
View: 6160

Continue Reading →

In this insightful book, you'll learn from the best data practitioners in the field just how wide-ranging -- and beautiful -- working with data can be. Join 39 contributors as they explain how they developed simple and elegant solutions on projects ranging from the Mars lander to a Radiohead video. With Beautiful Data, you will: Explore the opportunities and challenges involved in working with the vast number of datasets made available by the Web Learn how to visualize trends in urban crime, using maps and data mashups Discover the challenges of designing a data processing system that works within the constraints of space travel Learn how crowdsourcing and transparency have combined to advance the state of drug research Understand how new data can automatically trigger alerts when it matches or overlaps pre-existing data Learn about the massive infrastructure required to create, capture, and process DNA data That's only small sample of what you'll find in Beautiful Data. For anyone who handles data, this is a truly fascinating book. Contributors include: Nathan Yau Jonathan Follett and Matt Holm J.M. Hughes Raghu Ramakrishnan, Brian Cooper, and Utkarsh Srivastava Jeff Hammerbacher Jason Dykes and Jo Wood Jeff Jonas and Lisa Sokol Jud Valeski Alon Halevy and Jayant Madhavan Aaron Koblin with Valdean Klump Michal Migurski Jeff Heer Coco Krumme Peter Norvig Matt Wood and Ben Blackburne Jean-Claude Bradley, Rajarshi Guha, Andrew Lang, Pierre Lindenbaum, Cameron Neylon, Antony Williams, and Egon Willighagen Lukas Biewald and Brendan O'Connor Hadley Wickham, Deborah Swayne, and David Poole Andrew Gelman, Jonathan P. Kastellec, and Yair Ghitza Toby Segaran