High performance Spark : (Record no. 59178)

000 -LEADER
fixed length control field 03009cam a2200373Ii 4500
001 - CONTROL NUMBER
control field ocn933521387
003 - CONTROL NUMBER IDENTIFIER
control field OCoLC
005 - DATE AND TIME OF LATEST TRANSACTION
control field 20170912161715.0
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field 151225s2017 cauad 001 0 eng d
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 9781491943205
Qualifying information (paperback)
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 1491943203
Qualifying information (paperback)
035 ## - SYSTEM CONTROL NUMBER
System control number (OCoLC)933521387
Canceled/invalid control number (OCoLC)933438698
-- (OCoLC)959851418
040 ## - CATALOGING SOURCE
Original cataloging agency BTCTA
Language of cataloging eng
Description conventions rda
Transcribing agency BTCTA
Modifying agency CNR
049 ## - LOCAL HOLDINGS (OCLC)
Holding library CNRM
050 #4 - LIBRARY OF CONGRESS CALL NUMBER
Classification number QA76.9.D343
Item number K37 2017
090 ## - LOCALLY ASSIGNED LC-TYPE CALL NUMBER (OCLC); LOCAL CALL NUMBER (RLIN)
Classification number (OCLC) (R) ; Classification number, CALL (RLIN) (NR) QA76.9.D343
Local cutter number (OCLC) ; Book number/undivided call number, CALL (RLIN) K37 2017
100 1# - MAIN ENTRY--PERSONAL NAME
Personal name Karau, Holden,
Relator term author.
245 10 - TITLE STATEMENT
Title High performance Spark :
Remainder of title best practices for scaling and optimizing Apache Spark /
Statement of responsibility, etc. Hoolden Karau & Rachel Warren.
250 ## - EDITION STATEMENT
Edition statement First edition : June 2017
264 #1 - PRODUCTION, PUBLICATION, DISTRIBUTION, MANUFACTURE, AND COPYRIGHT NOTICE
Place of production, publication, distribution, manufacture Sebastopol, CA :
Name of producer, publisher, distributor, manufacturer O'Reilly Media, Inc.,
Date of production, publication, distribution, manufacture, or copyright notice 2017
264 #4 - PRODUCTION, PUBLICATION, DISTRIBUTION, MANUFACTURE, AND COPYRIGHT NOTICE
Date of production, publication, distribution, manufacture, or copyright notice 2017
300 ## - PHYSICAL DESCRIPTION
Extent xiv, 341 pages :
Other physical details black and white illustrations, graphs, charts ;
Dimensions 24 cm
336 ## - CONTENT TYPE
Content type term text
Content type code txt
Source rdacontent
337 ## - MEDIA TYPE
Media type term unmediated
Media type code n
Source rdamedia
338 ## - CARRIER TYPE
Carrier type term volume
Carrier type code nc
Source rdacarrier
500 ## - GENERAL NOTE
General note Includes index.
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note Table of Contents : Preface -- 1. Introduction high performance Spark -- 2. How Spark works -- 3. Dataframes, datasets, and Spark SQL -- 4. Joins (SQL and Core) -- 5. Effective transformations -- 6. Working with Key/Value Data -- 7. Going beyond Scala -- 8. Testing and validation -- 9. Spark MLlib and ML -- 10. Spark components and packages -- A. Tuning, debugging and other things developers like to pretend don't exist -- Index.
520 ## - SUMMARY, ETC.
Summary, etc. "Apache Spark is amazing when everything clicks. But if you haven't seen the performance improvements you expected, or still don't feel confident enough to use Spark in production, this practical book is for you. Authors Holden Karau and Rachel Warren demonstrate performance optimizations to help your Spark queries run faster and handle larger data sizes, while using fewer resources. Ideal for software engineers, data engineers, developers, and system administrators working with large-scale data applications, this book describes techniques that can reduce data infrastructure costs and developer hours. Not only will you gain a more comprehensive understanding of Spark, you'll also learn how to make it sing. With this book, you'll explore : How Spark SQL's new interfaces improve performance over SQL's RDD data structure ; The choice between data joins in Core Spark and Spark SQL ; Techniques for getting the most out of standard RDD transformations ; How to work around performance issues in Spark's key/value pair paradigm ; Writing high-performance Spark code without Scala or the JVM ; How to test for functionality and performance when applying suggested improvements ; Using Spark MLlib and Spark ML machine learning libraries ; Spark's Streaming components and external community packages." -- back cover.
630 00 - SUBJECT ADDED ENTRY--UNIFORM TITLE
Uniform title Spark (Electronic resource : Apache Software Foundation)
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element Big data.
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element Data mining
General subdivision Computer programs.
700 1# - ADDED ENTRY--PERSONAL NAME
Personal name Warren, Rachel,
Relator term author.
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Source of classification or shelving scheme
Koha item type BOOK
Holdings
Price effective from Permanent Location Date last seen Not for loan Date acquired Source of classification or shelving scheme Koha item type Lost status Withdrawn status Cost, normal purchase price Source of acquisition Shelving location Damaged status Barcode Current Location Full call number
2017-09-12NCAR Library2017-09-12 2017-09-12 BOOK  33.99purchasedMesa Lab 50583020006486NCAR LibraryQA76.9.D343 K37 2017

Any questions? Ask a Librarian.

Not finding what you are looking for? Request-It - InterLibrary Loan.

Languages: