was successfully added to your cart.

Automated de-identification of medical documents & images

One kind of noisy data that healthcare data scientists deal with is scanned documents and images: from PDF attachments of lab results, referrals, or genetic testing to DICOM files with medical imaging.

These files are challenging to de-identify because personal health information (PHI) can appear anywhere in free text – so cannot be removed with rules or regular expressions – or “burned” into images so that it’s not even available as digital text, to begin with.

Developing language of collaboration for healthier neighborhoods

At Cityblock, we aim to deliver better healthcare by addressing the root causes of health, be it medical, behavioral or social, or...