Ending Spam: Bayesian Content Filtering and the Art of Statistical Language Classification Paperback – Jul 1 2005
Customers Who Bought This Item Also Bought
No Kindle device required. Download one of the Free Kindle apps to start reading Kindle books on your smartphone, tablet, and computer.
Getting the download link through email is temporarily not available. Please check back later.
To get the free app, enter your mobile phone number.
About the Author
Jonathan Zdziarski is better known as the hacker "NerveGas" in the iPhone development community. His work in cracking the iPhone helped lead the effort to port the first open source applications, and his book, iPhone Open Application Development, taught developers how to write applications for the popular device long before Apple introduced its own SDK. Prior to the release of iPhone Forensics, Jonathan wrote and supported an iPhone forensics manual distributed exclusively to law enforcement. Jonathan frequently consults law enforcement agencies and assists forensic examiners in their investigations. He teaches an iPhone forensics workshop in his spare time to train forensic examiners and corporate security personnel.
Jonathan is also a full-time research scientist specializing in machine learning technology to combat online fraud and spam, an effort that led him to develop networking products capable of learning how to protect customers. He is founder of the DSPAM project, a high-profile, next-generation spam filter that was acquired in 2006 by Sensory Networks, Inc. He lectures widely on the topic of spam and is a foremost researcher in the fields of machine-learning and algorithmic theory.
Jonathan's website is zdziarski.com.
Most Helpful Customer Reviews on Amazon.com (beta)
That's not the case at all. Actually Markovian Discrimination is a technique I've used in other programming efforts, and the author explains it in simple and entertaining language. There's nothing here that any competent programmer can't grasp.
I'm a little hesitant to call this book entertaining, although it absolutely is. I only hesitate because that might give the impression that it's more fluff than substance, and that's not the case at all. There's a lot of substance here, both in theory and in practical advice. And although the subject is definitely spam, some of the techniques and methods discussed here apply to other programming challenges as well.
The first part of the book is especially enjoyable. It's a history of spam, and I learned things I hadn't known before about spam's early days. It then segues into analysis; in a sense you get desert before the meat and potatoes.
Overall, worth reading, even by non-programmers wanting to understand more about what current anti-spam efforts are all about.
The author includes an excellent section on spammer tricks and how they get past fileters as well as what to do about it. This section alone makes the book worth the price. Ending Spam is a highly recommended read for anyone in charge of controlling spam in a corporate environment as well as on their own system.
Also it is written on the cover that it is f.e focused towards developers, network admins etc. I would consider the target customer to be IT Managers, or other curious people who want to get an overview.
Thats what it does and it does it very well in my eyes.
The book provides simplified, abstract overviews of some available spam filters solutions.
The book is provided into 3 parts
- An Introduction part to spam filtering (Chapter 1-4)
- A part describing "Fundamentals of Statistical Filtering" (Chapter 5-9)
- an the third part describing "Advanced Concepts of Statistical Filtering" (Chapter 10-14)
Its a bit confusing that Chapter 4 has the same title than Part II. So perhaps Chapter 4 should have been part of "Part II" ?
The Chapters which I found most interesting were:
Chapter 4 "Fundamentals of Statistical Filtering"
Chapter 7 "The Low down dirty Tricks of spammers"
Chapter 9 "Scaling in Large Environments"
I am sure the author could have easily filled the book with Chapter 7 alone. The book is very entertaining and has a nice motivating writing style. You might at times find some rant about the spammers which I have chosen to ignore as it doesnt contain any valuable information or anything which I didnt know already. While I might agree to some of the authors views, I believe that the rant does unfortunately do exactly the opposite in my eyes and does give spammers credit to how they do their work.
I personally was actually looking for a companion book to "The Book of Postfix" to help me further explore new anti spam technology.
I was hoping to find overview charts, being able to compare different solutions,features, (dis)advantages. So in this sense, I was actually looking for workshop style instructions, tuning advice, troubleshooting advice etc.
The authors does explain f.e (Chapter 14) Collaborative Algorithms but he does not go into detail which products support the feature and how to perform the setup. He does provide some weblinks in his book from which the interested reader might further investigate the topic.
From reading the Chapter10 on "Testing Theory" its easier to conclude why the author doesnt go into more detail. If he would have done so, the book could have been easily 2-3 times the size.
I assume, this is partly due to the fact that the anti spam technology /products/market is still fairly young .
"Ending Spam" gives a very BASIC INTRODUCTION to the current available Anti spam technology and some chosen products. After you have read the book you have a first vague idea what type of solutions exist. You will actually need other books to intensify the "knowledge" you have gained here.
The fact that the book is written in simple terms makes it easily acessable for a wide market, however if you are a technichian you will perhaps find that the book just doesnt contain enough "meat" for you.
I would still recommend the book for Managers which need to know only the rough details, beginners, or a first time read for newcomers.
This book gives you a lot of background on the growth of spam and the various type of spam on how each new variant of spam effects different type of spam detection with obvious concentration on statistical analysis and not the tradition aka "dumb" forms of regex matching and etc.
The only gripe I have is that I feel there is too much time spent on the growth/history of spam. It would be better if the entire book is dedicated at the art/science of statisical analysis and fighting spam.
Note that this book does not target the lowest denominator and some the science flew over my head. Well, I did flunk almost all my higher ed physics and stat classes so take this with a spoon of salt.
Look for similar items by category
- Books > Computers & Technology > Hardware > Microprocessors & System Design > DSPs
- Books > Computers & Technology > History & Culture > Privacy
- Books > Computers & Technology > Internet & Social Media
- Books > Computers & Technology > Networking & Cloud Computing > Data in the Enterprise
- Books > Computers & Technology > Networking & Cloud Computing > Internet, Groupware, & Telecommunications
- Books > Computers & Technology > Networking & Cloud Computing > Intranets & Extranets
- Books > Computers & Technology > Networking & Cloud Computing > Network Administration > Email Administration
- Books > Computers & Technology > Networking & Cloud Computing > Network Security
- Books > Computers & Technology > Networking & Cloud Computing > Networks, Protocols & APIs > LAN
- Books > Computers & Technology > Security & Encryption > Privacy & Online Safety
- Books > Computers & Technology > Security & Encryption > Viruses
- Books > Computers & Technology > Software > E-mail
- Books > Textbooks > Computer Science & Information Systems > Networking