Ian H. Witten - Böcker
384 kr
Skickas inom 7-10 vardagar
660 kr
Skickas inom 7-10 vardagar
740 kr
Skickas inom 7-10 vardagar
How to Build a Digital Library reviews knowledge and tools to construct and maintain a digital library, regardless of the size or purpose. A resource for individuals, agencies, and institutions wishing to put this powerful tool to work in their burgeoning information treasuries.
The Second Edition reflects developments in the field as well as in the Greenstone Digital Library open source software. In Part I, the authors have added an entire new chapter on user groups, user support, collaborative browsing, user contributions, and so on. There is also new material on content-based queries, map-based queries, cross-media queries. There is an increased emphasis placed on multimedia by adding a "digitizing" section to each major media type. A new chapter has also been added on "internationalization," which will address Unicode standards, multi-language interfaces and collections, and issues with non-European languages (Chinese, Hindi, etc.).
Part II, the software tools section, has been completely rewritten to reflect the new developments in Greenstone Digital Library Software, an internationally popular open source software tool with a comprehensive graphical facility for creating and maintaining digital libraries.
Outlines the history of libraries on both traditional and digital Written for both technical and non-technical audiences and covers the entire spectrum of media, including text, images, audio, video, and related XML standards Web-enhanced with software documentation, color illustrations, full-text index, source code, and more797 kr
Skickas inom 5-8 vardagar
**2026 Textbook and Academic Authors Association (TAA) Textbook Excellence "Texty" Award Winner**Data Mining: Practical Machine Learning Tools and Techniques, Fifth Edition, offers a thorough grounding in machine learning concepts, along with practical advice on applying these tools and techniques in real-world data mining situations. This highly anticipated new edition of the most acclaimed work on data mining and machine learning teaches readers everything they need to know to get going, from preparing inputs, interpreting outputs, evaluating results, to the algorithmic methods at the heart of successful data mining approaches.Extensive updates reflect the technical changes and modernizations that have taken place in the field since the last edition, including more recent deep learning content on topics such as generative AI (GANs, VAEs, diffusion models), large language models (transformers, BERT and GPT models), and adversarial examples, as well as a comprehensive treatment of ethical and responsible artificial intelligence topics. Authors Ian H. Witten, Eibe Frank, Mark A. Hall, and Christopher J. Pal, along with new author James R. Foulds, include today’s techniques coupled with the methods at the leading edge of contemporary research
Provides a thorough grounding in machine learning concepts, as well as practical advice on applying the tools and techniques to data mining projectsPresents concrete tips and techniques for performance improvement that work by transforming the input or output in machine learning methodsFeatures in-depth information on deep learning and probabilistic modelsCovers performance improvement techniques, including input preprocessing and combining output from different methodsProvides an appendix introducing the WEKA machine learning workbench and links to algorithm implementations in the softwareIncludes all-new exercises for each chapter523 kr
Skickas inom 7-10 vardagar
916 kr
Skickas inom 7-10 vardagar
Managing Gigabytes
Compressing and Indexing Documents and Images, Second Edition
1 012 kr
Skickas inom 7-10 vardagar
In this fully updated second edition of the highly acclaimed Managing Gigabytes, authors Witten, Moffat, and Bell continue to provide unparalleled coverage of state-of-the-art techniques for compressing and indexing data. Whatever your field, if you work with large quantities of information, this book is essential reading--an authoritative theoretical resource and a practical guide to meeting the toughest storage and access challenges. It covers the latest developments in compression and indexing and their application on the Web and in digital libraries. It also details dozens of powerful techniques supported by mg, the authors' own system for compressing, storing, and retrieving text, images, and textual images. mg's source code is freely available on the Web.
Up-to-date coverage of new text compression algorithms such as block sorting, approximate arithmetic coding, and fat Huffman coding New sections on content-based index compression and distributed querying, with 2 new data structures for fast indexing New coverage of image coding, including descriptions of de facto standards in use on the Web (GIF and PNG), information on CALIC, the new proposed JPEG Lossless standard, and JBIG2 New information on the Internet and WWW, digital libraries, web search engines, and agent-based retrieval Accompanied by a public domain system called MG which is a fully worked-out operational example of the advanced techniques developed and explained in the book New appendix on an existing digital library system that uses the MG software