McCallum, Q. Ethan.

Bad data handbook / Q. Ethan McCallum. - Sebastopol, CA : O'Reilly, 2013. - xvi, 245 p. : ill.

Reprint. Originally published: 2012.

Includes bibliographical references and index.

Setting the pace : what is bad data? -- Is it just me, or does this data smell funny? -- Data intended for human consumption, not machine consumption -- Bad data lurking in plain text -- (Re)organizing the web's data -- Detecting liars and the confused in contradictory online reviews -- Will the bad data please stand up? -- Blood, sweat, and urine -- When data and reality don't match -- Subtle sources of bias and error -- Don't let the perfect be the enemy of the good : is bad data really bad? -- When databases attack : a guide for when to stick to files -- Crouching table, hidden network -- Myths of cloud computing -- The dark side of data science -- How to feed and care for your machine-learning experts -- Data traceability -- Social media : erasable ink? -- Data quality analysis demystified : knowing when your data is good enough.

9781449321888 1449321887


Database management--Handbooks, manuals, etc.
Electronic data processing--Handbooks, manuals, etc.
Data editing.
Databases--Quality control.

QA76.9.D3 / M337 2013