brandonrobertz - Overview

Skip to content

Navigation Menu

Sign in

Appearance settings

View brandonrobertz's full-sized avatar

Brandon Roberts brandonrobertz

Organizations

@html-extract @next-LI

Block or report brandonrobertz

Yes Hello

I'm Brandon Roberts. I'm an investigative journalist specializing in open source and bringing computational techniques to journalism projects. You can read more on my site: bxroberts.org

Pinned Loading

  1. A Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.

    Python 150 27

  2. ProPublica's collaborative tip-gathering framework. Import and manage CSV, Google Sheets and Screendoor data with ease.

    Python 100 18

  3. An automated, programming-free web scraper for interactive sites

    HTML 111 20

  4. A proof of concept tool for using ChatGPT to transform messy text documents into structured JSON

    Python 122 11

  5. Use Hext in a browser or with node. Hext is a domain-specific language for extracting structured data from HTML documents.

    C++ 6 1

  6. A proof of concept tool for using local LLMs to transform messy text documents into structured JSON

    Python 25 1