Managing Gigabytes

Compressing and Indexing Documents and Images
Author: Ian H. Witten,Alistair Moffat,Timothy C. Bell
Publisher: Morgan Kaufmann
ISBN: 9781558605701
Category: Business & Economics
Page: 519
View: 1892

Continue Reading →

In this fully updated second edition of the highly acclaimed Managing Gigabytes, authors Witten, Moffat, and Bell continue to provide unparalleled coverage of state-of-the-art techniques for compressing and indexing data. Whatever your field, if you work with large quantities of information, this book is essential reading--an authoritative theoretical resource and a practical guide to meeting the toughest storage and access challenges. It covers the latest developments in compression and indexing and their application on the Web and in digital libraries. It also details dozens of powerful techniques supported by mg, the authors' own system for compressing, storing, and retrieving text, images, and textual images. mg's source code is freely available on the Web. * Up-to-date coverage of new text compression algorithms such as block sorting, approximate arithmetic coding, and fat Huffman coding * New sections on content-based index compression and distributed querying, with 2 new data structures for fast indexing * New coverage of image coding, including descriptions of de facto standards in use on the Web (GIF and PNG), information on CALIC, the new proposed JPEG Lossless standard, and JBIG2 * New information on the Internet and WWW, digital libraries, web search engines, and agent-based retrieval * Accompanied by a public domain system called MG which is a fully worked-out operational example of the advanced techniques developed and explained in the book * New appendix on an existing digital library system that uses the MG software

Multimedia Servers

Applications, Environments, and Design
Author: Dinkar Sitaram,Asit Dan
Publisher: Morgan Kaufmann
ISBN: 9781558604308
Category: Computers
Page: 297
View: 3467

Continue Reading →

This book will undoubtedly satisfy the needs of application developers, server designers, integrators, and service providers, as it provides end-to-end, top-down coverage: from application-specific issues to low-level components. Inside, the authors offer specific design, development, and implementation approaches that take into account the complexity of the environments in which multimedia servers operate. You'll learn which techniques are best suited for different kinds of applications and different kinds of networks. You'll master the challenges associated with resource scheduling, collaborative computing, session set-up, and distributed storage. Most importantly, you'll discover how to put all of these solutions to work as part of a coherent strategy aimed at exploiting economies of scale and meeting quality of service requirements. * Presents optimized design algorithms developed by the authors and other leading researchers. * Deals comprehensively with the systems supporting the large-scale storage, retrieval, and distribution of audio and video data. * Balances the coverage of current technologies with forward-looking discussions to help you devise a sustainable, evolvable solution. * Covers key issues in video-on-demand and other multimedia systems: resource scheduling, local caching, interactivity, architectural strategies, and more.

How to Build a Digital Library


Author: Ian H. Witten,David Bainbridge,David M. Nichols
Publisher: Morgan Kaufmann
ISBN: 9780080890395
Category: Computers
Page: 656
View: 8624

Continue Reading →

How to Build a Digital Library reviews knowledge and tools to construct and maintain a digital library, regardless of the size or purpose. A resource for individuals, agencies, and institutions wishing to put this powerful tool to work in their burgeoning information treasuries. The Second Edition reflects developments in the field as well as in the Greenstone Digital Library open source software. In Part I, the authors have added an entire new chapter on user groups, user support, collaborative browsing, user contributions, and so on. There is also new material on content-based queries, map-based queries, cross-media queries. There is an increased emphasis placed on multimedia by adding a "digitizing" section to each major media type. A new chapter has also been added on "internationalization," which will address Unicode standards, multi-language interfaces and collections, and issues with non-European languages (Chinese, Hindi, etc.). Part II, the software tools section, has been completely rewritten to reflect the new developments in Greenstone Digital Library Software, an internationally popular open source software tool with a comprehensive graphical facility for creating and maintaining digital libraries. Outlines the history of libraries on both traditional and digital Written for both technical and non-technical audiences and covers the entire spectrum of media, including text, images, audio, video, and related XML standards Web-enhanced with software documentation, color illustrations, full-text index, source code, and more

Digital Watermarking and Steganography


Author: Ingemar Cox,Matthew Miller,Jeffrey Bloom,Jessica Fridrich,Ton Kalker
Publisher: Morgan Kaufmann
ISBN: 9780080555805
Category: Computers
Page: 624
View: 3448

Continue Reading →

Digital audio, video, images, and documents are flying through cyberspace to their respective owners. Unfortunately, along the way, individuals may choose to intervene and take this content for themselves. Digital watermarking and steganography technology greatly reduces the instances of this by limiting or eliminating the ability of third parties to decipher the content that he has taken. The many techiniques of digital watermarking (embedding a code) and steganography (hiding information) continue to evolve as applications that necessitate them do the same. The authors of this second edition provide an update on the framework for applying these techniques that they provided researchers and professionals in the first well-received edition. Steganography and steganalysis (the art of detecting hidden information) have been added to a robust treatment of digital watermarking, as many in each field research and deal with the other. New material includes watermarking with side information, QIM, and dirty-paper codes. The revision and inclusion of new material by these influential authors has created a must-own book for anyone in this profession. This new edition now contains essential information on steganalysis and steganography New concepts and new applications including QIM introduced Digital watermark embedding is given a complete update with new processes and applications

Mining the Web

Discovering Knowledge from Hypertext Data
Author: Soumen Chakrabarti
Publisher: Morgan Kaufmann
ISBN: 9781558607545
Category: Computers
Page: 345
View: 4446

Continue Reading →

The definitive book on mining the Web from the preeminent authority.

How to Build a Digital Library


Author: Ian H. Witten,David Bainbridge,David M. Nichols
Publisher: Morgan Kaufmann
ISBN: 9780080890395
Category: Computers
Page: 656
View: 2418

Continue Reading →

How to Build a Digital Library reviews knowledge and tools to construct and maintain a digital library, regardless of the size or purpose. A resource for individuals, agencies, and institutions wishing to put this powerful tool to work in their burgeoning information treasuries. The Second Edition reflects developments in the field as well as in the Greenstone Digital Library open source software. In Part I, the authors have added an entire new chapter on user groups, user support, collaborative browsing, user contributions, and so on. There is also new material on content-based queries, map-based queries, cross-media queries. There is an increased emphasis placed on multimedia by adding a "digitizing" section to each major media type. A new chapter has also been added on "internationalization," which will address Unicode standards, multi-language interfaces and collections, and issues with non-European languages (Chinese, Hindi, etc.). Part II, the software tools section, has been completely rewritten to reflect the new developments in Greenstone Digital Library Software, an internationally popular open source software tool with a comprehensive graphical facility for creating and maintaining digital libraries. Outlines the history of libraries on both traditional and digital Written for both technical and non-technical audiences and covers the entire spectrum of media, including text, images, audio, video, and related XML standards Web-enhanced with software documentation, color illustrations, full-text index, source code, and more

Introduction to Data Compression


Author: Khalid Sayood
Publisher: Newnes
ISBN: 0124157963
Category: Computers
Page: 740
View: 5578

Continue Reading →

Each edition of Introduction to Data Compression has widely been considered the best introduction and reference text on the art and science of data compression, and the fourth edition continues in this tradition. Data compression techniques and technology are ever-evolving with new applications in image, speech, text, audio, and video. The fourth edition includes all the cutting edge updates the reader will need during the work day and in class. Khalid Sayood provides an extensive introduction to the theory underlying today's compression techniques with detailed instruction for their applications using several examples to explain the concepts. Encompassing the entire field of data compression, Introduction to Data Compression includes lossless and lossy compression, Huffman coding, arithmetic coding, dictionary techniques, context based compression, scalar and vector quantization. Khalid Sayood provides a working knowledge of data compression, giving the reader the tools to develop a complete and concise compression package upon completion of his book. New content added to include a more detailed description of the JPEG 2000 standard New content includes speech coding for internet applications Explains established and emerging standards in depth including JPEG 2000, JPEG-LS, MPEG-2, H.264, JBIG 2, ADPCM, LPC, CELP, MELP, and iLBC Source code provided via companion web site that gives readers the opportunity to build their own algorithms, choose and implement techniques in their own applications

Understanding Digital Libraries


Author: Michael Lesk
Publisher: Elsevier
ISBN: 9780080481517
Category: Language Arts & Disciplines
Page: 456
View: 9148

Continue Reading →

This fully revised and updated second edition of Understanding Digital Libraries focuses on the challenges faced by both librarians and computer scientists in a field that has been dramatically altered by the growth of the Web. At every turn, the goal is practical: to show you how things you might need to do are already being done, or how they can be done. The first part of the book is devoted to technology and examines issues such as varying media requirements, indexing and classification, networks and distribution, and presentation. The second part of the book is concerned with the human contexts in which digital libraries function. Here you’ll find specific and useful information on usability, preservation, scientific applications, and thorny legal and economic questions. - Thoroughly updated and expanded from original edition to include recent research, case studies and new technologies - For librarians and technologists alike, this book provides a thorough introduction to the interdisciplinary science of digital libraries - Written by Michael Lesk, a legend in computer science and a leading figure in the digital library field. - Provides insights into the integration of both the technical and non-technical aspects of digital libraries

Bioinformatics

Managing Scientific Data
Author: Zoé Lacroix
Publisher: Academic Press
ISBN: 155860829X
Category: Science
Page: 441
View: 1830

Continue Reading →

The heart of the book lies in the collaboration efforts of eight distinct bioinformatics teams that describe their own unique approaches to data integration and interoperability. Each system receives its own chapter where the lead contributors provide precious insight into the specific problems being addressed by the system, why the particular architecture was chosen, and details on the system's strengths and weaknesses. In closing, the editors provide important criteria for evaluating these systems that bioinformatics professionals will find valuable. * Provides a clear overview of the state-of-the-art in data integration and interoperability in genomics, highlighting a variety of systems and giving insight into the strengths and weaknesses of their different approaches.-

Digital Watermarking


Author: Ingemar Cox,Matthew Miller,Jeffrey Bloom,Mathew Miller
Publisher: Elsevier
ISBN: 9780080504599
Category: Computers
Page: 542
View: 7482

Continue Reading →

Digital watermarking is a key ingredient to copyright protection. It provides a solution to illegal copying of digital material and has many other useful applications such as broadcast monitoring and the recording of electronic transactions. Now, for the first time, there is a book that focuses exclusively on this exciting technology. Digital Watermarking covers the crucial research findings in the field: it explains the principles underlying digital watermarking technologies, describes the requirements that have given rise to them, and discusses the diverse ends to which these technologies are being applied. As a result, additional groundwork is laid for future developments in this field, helping the reader understand and anticipate new approaches and applications. * Emphasizes the underlying watermarking principles that are relevant for all media: images, video, and audio. * Discusses a wide variety of applications, theoretical principles, detection and embedding concepts and the key properties of digital watermarks--robustness, fidelity, data payload, and security * Examines copyright protection and many other applications, including broadcast monitoring, transaction tracking, authentication, copy control, and device control. * Presents a series of detailed examples called "Investigations" that illustrate key watermarking concepts and practices. * Includes an appendix in the book and on the web containing the source code for the examples. * Includes a comprehensive glossary of watermarking terminology

Modern Information Retrieval

The Concepts and Technology Behind Search
Author: Ricardo Baeza-Yates,Berthier Ribeiro-Neto
Publisher: Addison-Wesley Professional
ISBN: 9780321416919
Category: Computers
Page: 913
View: 1039

Continue Reading →

This is a rigorous and complete textbook for a first course on information retrieval from the computer science perspective. It provides an up-to-date student oriented treatment of information retrieval including extensive coverage of new topics such as web retrieval, web crawling, open source search engines and user interfaces. From parsing to indexing, clustering to classification, retrieval to ranking, and user feedback to retrieval evaluation, all of the most important concepts are carefully introduced and exemplified. The contents and structure of the book have been carefully designed by the two main authors, with individual contributions coming from leading international authorities in the field, including Yoelle Maarek, Senior Director of Yahoo! Research Israel; Dulce Poncele´on IBM Research; and Malcolm Slaney, Yahoo Research USA. This completely reorganized, revised and enlarged second edition of Modern Information Retrieval contains many new chapters and double the number of pages and bibliographic references of the first edition, and a companion website www.mir2ed.org with teaching material. It will prove invaluable to students, professors, researchers, practitioners, and scholars of this fascinating field of information retrieval.

Fundamentals of Multimedia


Author: Ze-Nian Li,Mark S. Drew,Jiangchuan Liu
Publisher: Springer Science & Business Media
ISBN: 331905290X
Category: Computers
Page: 727
View: 2159

Continue Reading →

This textbook introduces the “Fundamentals of Multimedia”, addressing real issues commonly faced in the workplace. The essential concepts are explained in a practical way to enable students to apply their existing skills to address problems in multimedia. Fully revised and updated, this new edition now includes coverage of such topics as 3D TV, social networks, high-efficiency video compression and conferencing, wireless and mobile networks, and their attendant technologies. Features: presents an overview of the key concepts in multimedia, including color science; reviews lossless and lossy compression methods for image, video and audio data; examines the demands placed by multimedia communications on wired and wireless networks; discusses the impact of social media and cloud computing on information sharing and on multimedia content search and retrieval; includes study exercises at the end of each chapter; provides supplementary resources for both students and instructors at an associated website.

Comparative Evaluation of XML Information Retrieval Systems

5th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2006 Dagstuhl Castle, Germany, December 17-20, 2006 Revised and Selected Papers
Author: Norbert Fuhr,Mounia Lalmas,Andrew Trotman
Publisher: Springer Science & Business Media
ISBN: 3540738878
Category: Computers
Page: 554
View: 7675

Continue Reading →

This book constitutes the thoroughly refereed post-proceedings of the 5th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2006, held at Dagstuhl Castle, Germany, in December 2006. The 49 revised full papers presented were carefully selected for presentation at the workshop and went through a subsequent round of careful reviewing and revision. The papers are organized in topical sections on methodology, and 7 additional tracks on ad-hoc, natural language processing, heterogeneous collection, multimedia, interactive, use case, as well as document mining.

Digital Video and HD

Algorithms and Interfaces
Author: Charles A. Poynton
Publisher: Elsevier
ISBN: 0123919266
Category: Computers
Page: 707
View: 3304

Continue Reading →

Acknowledgments -- List of Figures -- List of Tables -- Preface -- Part 1 -- Introduction -- Chapter 1 -- Raster Images -- Chapter 2 -- Quantization -- Chapter 3 -- Brightness Contrast Controls -- Chapter 4 -- Raster Images in Computing -- Chapter 5 -- Raster Scanning -- Chapter 6 -- Image Structure -- Chapter 7 -- Resolution -- Chapter 8 -- Constant Luminance -- Chapter 9 -- Rendering Intent -- Chapter 10 -- Introduction to Luma Chroma -- Chapter 11 -- Introduction to Component SDTV -- Chapter 12 -- Introduction to Composite NTSC PAL -- Chapter 13 -- Introduction to HDTV -- Chapter 14 -- Introduction to Compression -- Chapter 15 -- Digital Video Interfaces -- Part 2 -- Principles -- Chapter 16 -- Filtering and Sampling -- Chapter 17 -- Resampling, Interpolation, and decimation -- Chapter 18 -- Image Digitization and Reconstruction -- Chapter 19 -- Perception and Visual Acuity -- Chapter 20 -- Luminance and Lightness -- Chapter 21 -- The CIE System of Colorimetry -- Chapter 22 -- Color ...

Web Data Management


Author: Serge Abiteboul,Ioana Manolescu,Philippe Rigaux,Marie-Christine Rousset,Pierre Senellart
Publisher: Cambridge University Press
ISBN: 113950505X
Category: Computers
Page: N.A
View: 6138

Continue Reading →

The Internet and World Wide Web have revolutionized access to information. Users now store information across multiple platforms from personal computers to smartphones and websites. As a consequence, data management concepts, methods and techniques are increasingly focused on distribution concerns. Now that information largely resides in the network, so do the tools that process this information. This book explains the foundations of XML with a focus on data distribution. It covers the many facets of distributed data management on the Web, such as description logics, that are already emerging in today's data integration applications and herald tomorrow's semantic Web. It also introduces the machinery used to manipulate the unprecedented amount of data collected on the Web. Several 'Putting into Practice' chapters describe detailed practical applications of the technologies and techniques. The book will serve as an introduction to the new, global, information systems for Web professionals and master's level courses.

Information Retrieval

Algorithms and Heuristics
Author: David A. Grossman,Ophir Frieder
Publisher: Springer Science & Business Media
ISBN: 9781402030048
Category: Computers
Page: 332
View: 2397

Continue Reading →

Interested in how an efficient search engine works? Want to know what algorithms are used to rank resulting documents in response to user requests? The authors answer these and other key information retrieval design and implementation questions. This book is not yet another high level text. Instead, algorithms are thoroughly described, making this book ideally suited for both computer science students and practitioners who work on search-related applications. As stated in the foreword, this book provides a current, broad, and detailed overview of the field and is the only one that does so. Examples are used throughout to illustrate the algorithms. The authors explain how a query is ranked against a document collection using either a single or a combination of retrieval strategies, and how an assortment of utilities are integrated into the query processing scheme to improve these rankings. Methods for building and compressing text indexes, querying and retrieving documents in multiple languages, and using parallel or distributed processing to expedite the search are likewise described. This edition is a major expansion of the one published in 1998. Besides updating the entire book with current techniques, it includes new sections on language models, cross-language information retrieval, peer-to-peer processing, XML search, mediators, and duplicate document detection.

Data Mining

Practical Machine Learning Tools and Techniques
Author: Ian H. Witten,Eibe Frank,Mark A. Hall,Christopher J. Pal
Publisher: Morgan Kaufmann
ISBN: 0128043571
Category: Computers
Page: 654
View: 6165

Continue Reading →

Data Mining: Practical Machine Learning Tools and Techniques, Fourth Edition, offers a thorough grounding in machine learning concepts, along with practical advice on applying these tools and techniques in real-world data mining situations. This highly anticipated fourth edition of the most acclaimed work on data mining and machine learning teaches readers everything they need to know to get going, from preparing inputs, interpreting outputs, evaluating results, to the algorithmic methods at the heart of successful data mining approaches. Extensive updates reflect the technical changes and modernizations that have taken place in the field since the last edition, including substantial new chapters on probabilistic methods and on deep learning. Accompanying the book is a new version of the popular WEKA machine learning software from the University of Waikato. Authors Witten, Frank, Hall, and Pal include today's techniques coupled with the methods at the leading edge of contemporary research. Please visit the book companion website at http://www.cs.waikato.ac.nz/ml/weka/book.html It contains Powerpoint slides for Chapters 1-12. This is a very comprehensive teaching resource, with many PPT slides covering each chapter of the book Online Appendix on the Weka workbench; again a very comprehensive learning aid for the open source software that goes with the book Table of contents, highlighting the many new sections in the 4th edition, along with reviews of the 1st edition, errata, etc. Provides a thorough grounding in machine learning concepts, as well as practical advice on applying the tools and techniques to data mining projects Presents concrete tips and techniques for performance improvement that work by transforming the input or output in machine learning methods Includes a downloadable WEKA software toolkit, a comprehensive collection of machine learning algorithms for data mining tasks-in an easy-to-use interactive interface Includes open-access online courses that introduce practical applications of the material in the book

Visual Interfaces to Digital Libraries


Author: Chaomei Chen
Publisher: Springer
ISBN: 3540362223
Category: Computers
Page: 232
View: 639

Continue Reading →

Visual Interfaces to Digital Libraries exploit the power of human vision and spatial cognition to help individuals mentally organize and electronically access and manage large and complex information spaces. They draw on progress in the field of information visualization and seek to shift the users' mental load from slow reading to faster perceptual processes such as visual pattern recognition. Based on two workshops, the book presents an introductory overview as well as a closing listing of the top ten problems in the area by the volume editors. Also included are 16 thoroughly reviewed and revised full papers organized in topical sections on visual interfaces to documents, document parts, document variants, and document usage data; visual interfaces to image and video documents; visualization of knowledge domains; cartographic interfaces to digital libraries; and a general framework.

Internet of Things

Principles and Paradigms
Author: Rajkumar Buyya,Amir Vahid Dastjerdi
Publisher: Elsevier
ISBN: 0128093471
Category: Computers
Page: 378
View: 1145

Continue Reading →

Internet of Things: Principles and Paradigms captures the state-of-the-art research in Internet of Things, its applications, architectures, and technologies. The book identifies potential future directions and technologies that facilitate insight into numerous scientific, business, and consumer applications. The Internet of Things (IoT) paradigm promises to make any electronic devices part of the Internet environment. This new paradigm opens the doors to new innovations and interactions between people and things that will enhance the quality of life and utilization of scarce resources. To help realize the full potential of IoT, the book addresses its numerous challenges and develops the conceptual and technological solutions for tackling them. These challenges include the development of scalable architecture, moving from closed systems to open systems, designing interaction protocols, autonomic management, and the privacy and ethical issues around data sensing, storage, and processing. Addresses the main concepts and features of the IoT paradigm Describes different architectures for managing IoT platforms Provides insight on trust, security, and privacy in IoT environments Describes data management techniques applied to the IoT environment Examines the key enablers and solutions to enable practical IoT systems Looks at the key developments that support next generation IoT platforms Includes input from expert contributors from both academia and industry on building and deploying IoT platforms and applications

Web Engineering and Peer-to-Peer Computing

NETWORKING 2002 Workshops, Pisa, Italy, May 19-24, 2002, Revised Papers
Author: Enrico Gregori,Ludmila Cherkasova,Gianpaolo Cugola,Fabio Panzieri,Gian P. Picco
Publisher: Springer
ISBN: 3540457453
Category: Computers
Page: 360
View: 3015

Continue Reading →

This book constitutes the refereed proceedings of the two thematic workshops held jointly with Networking 2002: WEB Engineering and Peer-to-Peer C- puting. Networking 2002 was organized by the Italian National Research Council (CNR) and was sponsored by the IFIP working groups WG 6.2 (Network and Intern- work Architectures), WG 6.3 (Performance of Communication Systems), and WG 6.8 (Wireless Communications). The program of the conference covered ?ve days and included the main conference (three days), two tutorial days, and one day of thematic workshops. TheInternationalWorkshoponWebEngineeringwasdedicatedtothedisc- sionoftheprincipalissuesthatemergeinthedesignandimplementationoflar- scale, complex, Web-based systems. Scalability issues pose a number of ch- lenging problems to solve for both applications and the underlying web/network infrastructure. On one hand, web services and internet applications must take into account network performance and transport protocol design, to achieve - ceptable performance and robustness. On the other hand, emerging network and Web technologies are determined by the requirements of these applications. Fifteen papers were presented that illustrated the current state of the art in this area. In addition to the authors of these papers, the Workshop on Web Engine- ing was attended by about thirty participants, who contributed to the workshop by stimulating fruitful discussions at the end of each presentation. Thus, this workshop provided a excellent opportunity for researchers, from both industry and academia, to gather, exchange ideas, and discuss recent results in the dev- opment of Web-based systems and emerging Internet applications.