Data Munging with Perl Paperback – Jan 1 2001
Customers Who Bought This Item Also Bought
No Kindle device required. Download one of the Free Kindle apps to start reading Kindle books on your smartphone, tablet, and computer.
Getting the download link through email is temporarily not available. Please check back later.
To get the free app, enter your mobile phone number.
" . . . well written, informative, thought provoking . . . will be as relevant five years from now as it is today. . . . buy [one]." -- Dr. Dobbs Journal
"A very good resource for programmers who want to learn more about data parsing, data filters, and data conversion..." -- ACM Computing Reviews
"I found the sample problems and the author's solutions to be very well done. I especially liked the design tips..." -- Pikes Peak Perl Mongers
"Well worth the price, and a good starting point for more advanced forays." -- Use.Perl.com
the chapters are concise, the coverage is comprehensive, and the examples are plentiful and relevant. -- Web Techniques Magazine
About the Author
Top Customer Reviews
The book is written for programmers or analysts who transform data as a regular part of their jobs. It assumes a beginning knowledge of Perl programming, as one might gain from reading Learning Perl. Part I introduces data munging as a recurring necessary evil and points out aspects of Perl that recommend it for this task. Part II surveys different types of unstructured and semi-structured data formats and suggests Perl-based strategies for working with them. PART III examines the limitations of simple data formats and discusses parsing strategies and specific techniques for working with HTML, XML and other hierarchical data structures. PART IV extracts some useful lessons from the previous chapters and suggests sources for additional study. The organization is logical and easy to follow.
Cross has written a well-designed book with helpful examples and insights. The accompanying book web site and author web site provide downloadable code and other resources. This book is of course most useful to those working in Perl. But many general concepts and strategies have transferred well to data munging tasks I have done in TextPipe.
One of Perl's mottos is: "There's more than one way to do it." A variety of ways are illustrated and explained in this book. Note that it is over ten years old and does not include the latest evolutions of the Perl language.
As I said, it may be good for data-processing beginners, but Perl experts will hardly find lot's of new information in it.
P.S. I trust him and therefore follow his advices in every script I start to think of ( especially the one about "UNIX filter model" ).
The author gives you enough information, and background to start working with the more advanced Perl functions like map, grep, pack, unpack, etc. It is possible to write Perl without ever having to use these modules, but David Cross shows you how they are more effective, more powerful. This book will expand your Perl vocabulary by leaps and bounds.
I know that some people would say that the book is too thin, and it is thinner than many computer books today, but the thickness of a book does not determine it's merit. Effective Perl Programming by Joseph Hall and Randal Schwartz is often cited as one of the best Perl books ever and it's thinner than this one.
If you are a junior to intermediate level programmer, and you want to improve your Perl skills, pick up this book. You won't be disappointed.
Starting with the source/filter/sink theory of data manipulation and demonstrating every tip and technique with clear and efficient examples, without severe digressions into mythological whimsy, this book would make an excellent second text on the Perl language, or a suitable first for someone who is good with programming languages.
Many of the techniques contained in it are of "trade secret" quality; they are the sort of write-the-number-of-gallons-of-paint-it-took-to-paint- the-room-on-the-back-of-the-light-switch-cover practices that until now had to be learned or happened upon by every programmer, alone, or by example, rather than in the context of a coherent theory.
The theoretical side, in which "munging" is defined and most software activity is described in terms of it, is clear enough that the book might be an interesting read for management, to answer the question "Just what is it about Perl that makes those who use it regularly so confoundedly fanatical?"
If you've ever been mystified by a Perl wizard who found it easier to export the records from the fancy GUI database into a comma delimited text file and then sort and display the data with mysterious little programs rather than use the GUI's native report generator, and want to find out why, or if you would like to become such a person yourself, or if you already are such a person but would like to get better at it, this book is for you.
Most recent customer reviews
This book isn't about arcane corners of Perl theory. It's about how to write Perl programs that perform the "simple" task of converting data from one format to... Read morePublished on July 2 2002 by Sean Burke
After reading this book I rewrote a pretty massive postscript pasrsing and munging system that I was having a lot of trouble with and felt like I did it the _right_ way. Read morePublished on July 24 2001 by Robert J. Monn
I was hoping this book would provide some valuable routines for processing data, but instead it has proved virtually useless in my day to day job as a UNIX data center... Read morePublished on June 11 2001
I was perusing the shelves at my local bookstore, when this title jumped out and grabbed me. Not only is it a unique and interesting title (Something uncommon for computer books),... Read morePublished on Feb. 12 2001 by Adam Sroka
Look for similar items by category
- Books > Computers & Technology > Computer Science
- Books > Computers & Technology > Databases > Relational Databases
- Books > Computers & Technology > Internet & Social Media
- Books > Computers & Technology > Networking & Cloud Computing > Networks, Protocols & APIs
- Books > Computers & Technology > Programming > Languages & Tools
- Books > Computers & Technology > Software > Databases