IPBlocklist

IPBlocklist aggregates IP and ASN threat intelligence into four release artifacts:

blocklist.bin: compact binary data for application lookups
blocklist.txt: scored, CIDR-minimized text blocklist for firewalls
asns.json: normalized ASN lists keyed by feed name
asn_prefixes.json: cached ASN → announced prefixes

The current dataset is built from 163 feeds and includes IPv4, IPv6, CIDR ranges, announced prefixes derived from ASN feeds, and proxy-type ranges from IP2X.

The feed set includes OXL risk-db-lists sources for hosting, crawlers, VPNs, scanners, proxies, Tor, ISP, education, dynamic, and top-reported reputation lists across ASN, network, and IP scopes.

Demo

A live lookup page is available at ipblocklist.tn3w.dev. It loads blocklist.bin, feeds.json, asns.json, and asn_prefixes.json client-side and supports IP and ASN queries with detailed results, feed metadata tooltips, score visualization, and announced prefix listings per ASN.

For a minimal, highly optimized API server see tn3w/ipblocklist-api:

curl https://ipblocklist-api.tn3w.dev/lookup/1.2.3.4

{
    "ip": "1.2.3.4",
    "max_score": 0.81,
    "top_category": "spam",
    "categories": ["malware", "spam"],
    "flags": ["is_spammer", "is_phishing"],
    "feeds": ["hphosts_psh", "hphosts_fsa"]
}

Downloads

wget https://github.com/tn3w/IPBlocklist/releases/latest/download/blocklist.bin
wget https://github.com/tn3w/IPBlocklist/releases/latest/download/blocklist.txt
wget https://github.com/tn3w/IPBlocklist/releases/latest/download/asns.json
wget https://github.com/tn3w/IPBlocklist/releases/latest/download/asn_prefixes.json

Visualizations

flowchart LR
    A[feeds.json] --> B[aggregator.py]
    B --> C[blocklist.bin]
    B --> D[asns.json]
    B --> AP[asn_prefixes.json]
    B --> E[scored ranges]
    E --> F[cidr_minimizer]
    F --> G[blocklist.txt]

flowchart TD
    A[IP and CIDR feeds] --> D[normalize and deduplicate]
    B[ASN feeds from remote sources] --> C[RIPEstat announced prefixes]
    S[Static ASN lists in feeds.json] --> C
    C --> D
    C --> AP[asn_prefixes.json]
    S --> J[asns.json]
    B --> J
    D --> E[blocklist.bin]
    D --> F[scoring and thresholding]
    F --> G[cidr_minimizer]
    G --> H[blocklist.txt]

Pipeline

aggregator.py downloads feeds, resolves ASNs to prefixes via RIPEstat, merges overlapping ranges, and writes blocklist.bin, asns.json, and blocklist.txt.

Feeds marked is_asn support remote (url + regex) or static (asns) input. Use base_score: 0.0 to include an ASN feed in blocklist.bin/asns.json without affecting blocklist.txt.

Artifacts

`blocklist.bin`

Self-describing binary format (v2) for fast lookups. No external JSON needed.

[4 bytes: magic "IPBL"]
[1 byte: version (2)]
[4 bytes: timestamp (unix, LE)]
[1 byte: flag count]
for each flag:
  [1 byte: name length]
  [N bytes: flag name (utf-8)]
[1 byte: category count]
for each category:
  [1 byte: name length]
  [N bytes: category name (utf-8)]
[2 bytes: feed count (LE)]
for each feed:
  [1 byte: feed name length]
  [N bytes: feed name (utf-8)]
  [1 byte: base_score (0-200, divide by 200.0)]
  [1 byte: confidence (0-200, divide by 200.0)]
  [4 bytes: flags bitmask (LE, bit i = flag at index i)]
  [1 byte: categories bitmask (bit i = category at index i)]
  [4 bytes: range count (LE)]
  for each range:
    [varint: start delta from previous start]
    [varint: range size (end - start)]

Flags and categories are stored as string tables followed by bitmasks per feed, keeping the format compact and fully self-contained.

See the examples/ directory for lookup implementations in many languages.

`blocklist.txt`

Text blocklist generated from scored ranges after thresholding, CIDR promotion, and non-routable range removal.

Supported output forms:

Single IPv4: 1.2.3.4
IPv4 CIDR: 1.2.3.0/24
IPv4 range: 1.2.3.1-1.2.3.254
Single IPv6: 2001:db8::1
IPv6 CIDR: 2001:db8::/32
IPv6 range: 2001:db8::1-2001:db8::ff

`asns.json`

JSON object keyed by feed name.

{
    "datacenter_asns": ["16509", "15169"],
    "bgptools_c2_asns": ["14618"],
    "bgptools_tor_asns": ["60729", "53667"],
    "tor_static_asns": ["60729", "53667"]
}

`asn_prefixes.json`

JSON object keyed by ASN.

{
    "16509": ["192.0.2.0/24", "198.51.100.0/24"],
    "15169": ["203.0.113.0/24"]
}

Feed Model

Common fields:

name
description
base_score
confidence
flags
categories

flags are boolean indicators. Canonical values:

is_anycast
is_brute_force
is_c2_server
is_cdn
is_compromised
is_datacenter
is_isp
is_malware
is_mobile
is_phishing
is_crawler
is_proxy
is_scanner
is_spammer
is_tor
is_vpn

categories are scoring buckets. Supported values:

anonymizer
attacks
botnet
compromised
infrastructure
malware
spam

IP and CIDR feed fields:

url
regex

ASN feed fields:

is_asn
url and regex, or asns

Optional fields:

provider_name
asns

Usage

Build the artifacts locally:

python aggregator.py

ASN prefix lookups are cached in asn_prefixes.json (ASN → announced prefixes). On the first run the cache is empty and every ASN is resolved via RIPEstat. Subsequent runs skip resolved ASNs and only fetch new ones, making incremental rebuilds significantly faster. Delete asn_prefixes.json to force a full refresh.

Query blocklist.bin for one or more IPs:

python lookup.py 8.8.8.8 1.1.1.1

Output includes feed metadata:

8.8.8.8: x4bnet_datacenter_ipv4 | score=0.11 | flags=is_datacenter | cats=infrastructure

Example Implementations

The examples/ directory contains complete single-file lookup implementations:

Language	File	IPv6
C	`lookup.c`	yes
C++	`lookup.cpp`	no
C#	`lookup.cs`	no
Crystal	`lookup.cr`	no
D	`lookup.d`	no
Dart	`lookup.dart`	no
Elixir	`lookup.exs`	no
Erlang	`lookup.erl`	yes
Go	`lookup.go`	yes
Haskell	`lookup.hs`	no
Java	`lookup.java`	no
JavaScript	`lookup.js`	no
Kotlin	`lookup.kt`	no
Lua	`lookup.lua`	no
Nim	`lookup.nim`	no
Perl	`lookup.pl`	no
PHP	`lookup.php`	no
Python	`lookup.py`	yes
Ruby	`lookup.rb`	yes
Rust	`lookup.rs`	yes
Scala	`lookup.scala`	no
Shell	`lookup.sh`	no
Swift	`lookup.swift`	no
TypeScript	`lookup.ts`	no
Zig	`lookup.zig`	no

A fully typed Python variant is in lookup_typed.py.

Load the text blocklist into ipset:

ipset create blocklist hash:net
while IFS= read -r line; do
  [[ "$line" =~ ^# ]] && continue
  ipset add blocklist "$line" 2>/dev/null
done < blocklist.txt

Read asns.json in Python:

import json


with open("asns.json") as file:
    asn_lists = json.load(file)

tor_asns = set(asn_lists["bgptools_tor_asns"])
print("60729" in tor_asns)

Read asn_prefixes.json in Python:

import json

with open("asn_prefixes.json") as file:
    asn_prefixes = json.load(file)

asn = "16509"
prefixes = asn_prefixes.get(asn, [])
print(f"ASN {asn} announces these prefixes: {prefixes}")

Check whether an IP is covered by blocklist.txt in Python:

import ipaddress


def line_matches_ip(line, address):
  if not line or line.startswith("#"):
    return False

  if "-" in line:
    start_text, end_text = line.split("-", 1)
    start = ipaddress.ip_address(start_text)
    end = ipaddress.ip_address(end_text)
    return int(start) <= int(address) <= int(end)

  if "/" in line:
    return address in ipaddress.ip_network(line, strict=False)

  return address == ipaddress.ip_address(line)


def ip_in_blocklist_txt(ip_value, path="blocklist.txt"):
  address = ipaddress.ip_address(ip_value)

  with open(path) as file:
    for raw_line in file:
      if line_matches_ip(raw_line.strip(), address):
        return True

  return False


print(ip_in_blocklist_txt("8.8.8.8"))

Performance

Total feeds: 163
Proxy type ranges: 4.1M
Total entries: about 9.1M
Typical lookup latency: under 1 ms
Binary size: about 12 MB
In-memory footprint: about 120 MB

Contributers

AI Disclosure

This project was developed with the assistance of AI tools, including GPT-5.4 and Claude Opus 4.6. These tools were used to help generate code, documentation, and other content. The human contributors provided guidance, review, and oversight throughout the development process to ensure the quality and accuracy of the final product. An example of AI-generated content is ./examples whch contains lookup implementations in multiple programming languages, created with the help of AI tools.

Check feed count

python3 -c "import json; d=json.load(open('feeds.json')); print(len(d))"

License

LICENSE

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
.github/workflows		.github/workflows
cidr_minimizer		cidr_minimizer
docs		docs
examples		examples
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
aggregator.py		aggregator.py
feeds.json		feeds.json
lookup.py		lookup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IPBlocklist

Demo

Downloads

Visualizations

Pipeline

Artifacts

`blocklist.bin`

`blocklist.txt`

`asns.json`

`asn_prefixes.json`

Feed Model

Usage

Example Implementations

Performance

Contributers

AI Disclosure

Check feed count

License

About

Uh oh!

Releases 5

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

IPBlocklist

Demo

Downloads

Visualizations

Pipeline

Artifacts

blocklist.bin

blocklist.txt

asns.json

asn_prefixes.json

Feed Model

Usage

Example Implementations

Performance

Contributers

AI Disclosure

Check feed count

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 5

Contributors

Uh oh!

Languages

`blocklist.bin`

`blocklist.txt`

`asns.json`

`asn_prefixes.json`