diff --git a/.gitignore b/.gitignore
index 79e5387c..582de118 100644
--- a/.gitignore
+++ b/.gitignore
@@ -15,3 +15,5 @@ test
 *.csv
 # LLM Tools
 .claude
+# Link checker reports
+link-reports/
diff --git a/docs/bee/faq.md b/docs/bee/faq.md
index fa999db4..a289bdfc 100644
--- a/docs/bee/faq.md
+++ b/docs/bee/faq.md
@@ -165,7 +165,7 @@ Therefore, the rule is, each node must have:
 
 ### How can I add Gnosis / Sepolia to Metamask?
 
-You can easily add Sepolia or Gnosis to metamask using the [official guide from Metamask](https://support.metamask.io/networks-and-sidechains/managing-networks/how-to-add-a-custom-network-rpc/).
+You can easily add Sepolia or Gnosis to metamask using the [official guide from Metamask](https://support.metamask.io/configure/networks/how-to-add-a-custom-network-rpc/).
 
 If you are using a different wallet which does not have an easy option for adding networks like Metamask does, then you may need to add the networks manually. You need to fill in four pieces of information to do so:
 
diff --git a/docs/bee/installation/connectivity.md b/docs/bee/installation/connectivity.md
index eba8bd1b..5b5afc6d 100644
--- a/docs/bee/installation/connectivity.md
+++ b/docs/bee/installation/connectivity.md
@@ -17,7 +17,7 @@ swarm, below you will find a detailed guide to navigating your way
 through your network and making it out into the wild so you can buzz
 around fellow bees and maximize your chances of earning xBZZ. If
 you still have problems, please join us in our [Discord
-server](https://discord.gg/wdghaQsGq5) and we'll help you find the
+server](https://discord.gg/kHRyMNpw7t) and we'll help you find the
 way! 🐝 🐝 🐝 🐝 🐝
 
 :::warning
diff --git a/docs/bee/installation/hive.md b/docs/bee/installation/hive.md
index 969cdaf4..cff46d8d 100644
--- a/docs/bee/installation/hive.md
+++ b/docs/bee/installation/hive.md
@@ -37,4 +37,4 @@ Configure your nodes as desired, but ensure that the values `api-addr`, `data-di
 
 ### Monitoring
 
-See the [logging section](./../working-with-bee/logs-and-files.md) for more information on how to access your node's metrics. Share your community creations (such as [swarmMonitor](https://github.com/doristeo/SwarmMonitoring) - thanks doristeo!) in the [#node-operators](https://discord.gg/X3ph5yGRFU) channel of our Discord server so we can add you to our list of all things that are [awesome](https://github.com/ethersphere/awesome-swarm) and Swarm. 🧡
+See the [logging section](./../working-with-bee/logs-and-files.md) for more information on how to access your node's metrics. Share your community creations (such as [swarmMonitor](https://github.com/doristeo/SwarmMonitoring) - thanks doristeo!) in the [#node-operators](https://discord.gg/kHRyMNpw7t) channel of our Discord server so we can add you to our list of all things that are [awesome](https://github.com/ethersphere/awesome-swarm) and Swarm. 🧡
diff --git a/docs/bee/installation/package-manager.md b/docs/bee/installation/package-manager.md
index 55be3581..9b9ac07c 100644
--- a/docs/bee/installation/package-manager.md
+++ b/docs/bee/installation/package-manager.md
@@ -106,7 +106,7 @@ Config: /etc/bee/bee.yaml
 
 Bee requires a Gnosis Chain RPC endpoint to function. By default this is expected to be found at ws://localhost:8546.
 
-Please see https://docs.ethswarm.org/docs/installation/install for more details on how to configure your node.
+Please see https://docs.ethswarm.org/docs/bee/installation/getting-started for more details on how to configure your node.
 
 After you finish configuration run 'sudo bee-get-addr' and fund your node with XDAI, and also XBZZ if so desired.
 
@@ -122,7 +122,7 @@ When Bee is installed using a package manager, a `bee.yaml` file containing the
 While this package manager install guide uses the `bee.yaml` file for setting configuration options, there are  [several other available methods for setting node options](./../working-with-bee/configuration.md).
 :::
 
-After installation, you can check that the file was successfully generated and contains the [default configuration](https://github.com/ethersphere/bee/blob/master/packaging) for your system:
+After installation, you can check that the file was successfully generated and contains the [default configuration](https://github.com/ethersphere/bee/tree/master/packaging) for your system:
 
 <Tabs
 defaultValue="linux"
@@ -427,7 +427,7 @@ Success! The Bee API is now listening!
 Next, let's see if we have connected with any peers by sending a query to the Bee API (port 1633 by default - `localhost:1633`).
 
 :::info
-Here we are using the `jq` [utility](https://stedolan.github.io/jq/) to parse our javascript. Use your package manager to install `jq`, or simply remove everything after and including the first `|` to view the raw json without it.
+Here we are using the `jq` [utility](https://jqlang.github.io/jq/) to parse our javascript. Use your package manager to install `jq`, or simply remove everything after and including the first `|` to view the raw json without it.
 :::
 
 ```bash
diff --git a/docs/bee/working-with-bee/bee-api.md b/docs/bee/working-with-bee/bee-api.md
index 50a87934..626a9614 100644
--- a/docs/bee/working-with-bee/bee-api.md
+++ b/docs/bee/working-with-bee/bee-api.md
@@ -4,7 +4,7 @@ id: bee-api
 description: Comprehensive reference for Bee's HTTP API endpoints enabling programmatic access to node management uploads downloads and monitoring.
 ---
 
-The Bee HTTP API is the primary interface to a running Bee node. API-endpoints can be queried using familiar HTTP requests, and will respond with semantically accurate [HTTP status and error codes](https://developer.mozilla.org/en-US/docs/Web/HTTP/Status) as well as data payloads in [JSON](https://www.json.org/json-en.html) format where appropriate.
+The Bee HTTP API is the primary interface to a running Bee node. API-endpoints can be queried using familiar HTTP requests, and will respond with semantically accurate [HTTP status and error codes](https://developer.mozilla.org/en-US/docs/Web/HTTP/Reference/Status) as well as data payloads in [JSON](https://www.json.org/json-en.html) format where appropriate.
 
 The Bee API provides full access to all core functionalities of a Bee node, including uploading, downloading, staking, postage stamp batch purchasing and management, and node monitoring. By default, it runs on port `:1633`.
 
@@ -43,7 +43,7 @@ curl -s http://localhost:1633/peers | jq '.peers | length'
 Great! We can see that we are currently connected with 23 other nodes!
 
 :::info
-Here we are using the `jq` command line utility to count the amount of objects in the `peers` array in the JSON response we have received from our API, learn more about how to install and use `jq` [here](https://stedolan.github.io/jq/).
+Here we are using the `jq` command line utility to count the amount of objects in the `peers` array in the JSON response we have received from our API, learn more about how to install and use `jq` [here](https://jqlang.github.io/jq/).
 :::
 
 Let's review a handful of endpoints which will provide you with important information relevant to detecting and diagnosing problems with your nodes.
diff --git a/docs/bee/working-with-bee/cashing-out.md b/docs/bee/working-with-bee/cashing-out.md
index 8dfa5264..caa7cfa0 100644
--- a/docs/bee/working-with-bee/cashing-out.md
+++ b/docs/bee/working-with-bee/cashing-out.md
@@ -158,7 +158,7 @@ curl -X POST http://localhost:1633/chequebook/cashout/d7881307e793e389642ea73345
 ```
 
 You may check the status of your transaction using the [xDAI
-Blockscout](https://blockscout.com/xdai/mainnet).
+Blockscout](https://gnosis.blockscout.com/).
 
 Finally, we can now see the status of the cashout transaction by sending a GET request to the same URL.
 
@@ -201,7 +201,7 @@ curl -X POST http://localhost:1633/chequebook/deposit\?amount\=1000 | jq
 }
 ```
 
-You may then use [Blockscout](https://blockscout.com/xdai/mainnet) to
+You may then use [Blockscout](https://gnosis.blockscout.com/) to
 track your transaction and make sure it completed successfully.
 
 ## Managing uncashed cheques
diff --git a/docs/bee/working-with-bee/configuration.md b/docs/bee/working-with-bee/configuration.md
index 5a1b9d60..b0513042 100644
--- a/docs/bee/working-with-bee/configuration.md
+++ b/docs/bee/working-with-bee/configuration.md
@@ -610,7 +610,7 @@ Ensure `nat-addr` and `nat-wss-addr` if used are set to valid `host:port` values
 
 ## ENS Resolution (Optional)
 
-The [ENS](https://ens.domains/) domain resolution system is used to host websites on Bee, and in order to use this your Bee must be connected to a mainnet Ethereum blockchain node. We recommend you run your own ethereum node. An option for resource restricted devices is geth+nimbus and a guide can be found [here](https://ethereum-on-arm-documentation.readthedocs.io/en/latest/). Other options include [dappnode](https://dappnode.io/), [nicenode](https://www.nicenode.xyz/), [stereum](https://stereum.net/) and [avado](https://ava.do/). 
+The [ENS](https://ens.domains/) domain resolution system is used to host websites on Bee, and in order to use this your Bee must be connected to a mainnet Ethereum blockchain node. We recommend you run your own ethereum node. An option for resource restricted devices is geth+nimbus and a guide can be found [here](https://ethereum-on-arm-documentation.readthedocs.io/en/latest/). Other options include [dappnode](https://dappnode.com/), [nicenode](https://www.nicenode.xyz/), [stereum](https://stereum.net/) and [avado](https://ava.do/). 
 
 If you do not wish to run your own Ethereum node, you may use a blockchain RPC service provider such as [Infura](https://infura.io). After signing up for Infura, simply set your `--resolver-options` to `https://mainnet.infura.io/v3/your-api-key`.
 
diff --git a/docs/bee/working-with-bee/monitoring.md b/docs/bee/working-with-bee/monitoring.md
index b1f6f1b7..8e80795d 100644
--- a/docs/bee/working-with-bee/monitoring.md
+++ b/docs/bee/working-with-bee/monitoring.md
@@ -43,4 +43,4 @@ rate(bee_swap_cheques_sent[1d])
 rate(bee_swap_cheques_rejected[1d])
 ```
 
-Share your creations in the [#node-operators](https://discord.gg/X3ph5yGRFU) channel of our Discord server!
+Share your creations in the [#node-operators](https://discord.gg/kHRyMNpw7t) channel of our Discord server!
diff --git a/docs/concepts/DISC/erasure-coding.md b/docs/concepts/DISC/erasure-coding.md
index 107d5cca..830cad6f 100644
--- a/docs/concepts/DISC/erasure-coding.md
+++ b/docs/concepts/DISC/erasure-coding.md
@@ -16,8 +16,6 @@ Specifically, data is divided into **m** chunks, and **k** additional chunks are
 
 For an 8KB image, if we set **m = 2** and **k = 1**, we create 3 chunks (2 original + 1 redundant). As long as any 2 of these 3 chunks are available, we can reconstruct the original data. By increasing **k** to 4, we can tolerate the loss of up to 4 chunks while still recovering the original data.
 
-![Erasure Code Example](https://www.ethswarm.org/uploads/erasure-coding-01.png)
-
 ### Levels of Protection
 
 In Swarm's implementation of erasure coding, there are five levels of protection, None, Medium, Strong, Insane, and Paranoid. For each level, the **m** and **k** values have been adjusted in order to meet a certain level of data protection:
diff --git a/docs/desktop/configuration.md b/docs/desktop/configuration.md
index b1b1678e..f349eac7 100644
--- a/docs/desktop/configuration.md
+++ b/docs/desktop/configuration.md
@@ -50,7 +50,7 @@ Next simply send your xDAI to that address. Before sending, make sure you have s
 | **New RPC URL:** | https://xdai.fairdatasociety.org |
 | **Chain ID:**| 100 |
 | **Symbol:**|  xDai   |
-| **Block Explorer URL (Optional):**|  https://blockscout.com/xdai/mainnet   |
+| **Block Explorer URL (Optional):**|  https://gnosis.blockscout.com/   |
 
 ![](/img/config4.png)
 
diff --git a/docs/develop/contribute/introduction.md b/docs/develop/contribute/introduction.md
index a42090df..fc361c56 100644
--- a/docs/develop/contribute/introduction.md
+++ b/docs/develop/contribute/introduction.md
@@ -10,7 +10,7 @@ Welcome to the Dev area! We love PR's! 🐝
 
 We would would love you to get involved with our [Github repo](https://github.com/ethersphere/bee).
 
-Connect with other Bee developers over at the official [Discord Server](https://discord.gg/wdghaQsGq5). Sign up and get involved with our buzzing hive of daily dev chat.
+Connect with other Bee developers over at the official [Discord Server](https://discord.gg/kHRyMNpw7t). Sign up and get involved with our buzzing hive of daily dev chat.
 
 - If you would like to contribute, please read the [coding guidelines](https://github.com/ethersphere/bee/blob/master/CODING.md) before you get started.
 
diff --git a/docs/develop/files.md b/docs/develop/files.md
index 0ab4caee..7bbde0a2 100644
--- a/docs/develop/files.md
+++ b/docs/develop/files.md
@@ -338,10 +338,6 @@ Full script:
 Before running the third script, make sure that you have updated your `.env` variable `SCRIPT_03_MANIFEST` with the manifest reference returned by the second script (see terminal output from `Updated manifest reference:`).
 :::
 
-Full script:
-
-* [`script-03.js`](https://github.com/ethersphere/examples/blob/main/manifests/directory/script-03.js)
-
 This is done by:
 
 1. Locating the existing file entry
diff --git a/docs/develop/host-your-website.md b/docs/develop/host-your-website.md
index 3b830c40..02987429 100644
--- a/docs/develop/host-your-website.md
+++ b/docs/develop/host-your-website.md
@@ -18,15 +18,17 @@ The guides below assume you already have a registered ENS domain name. By using
 :::
 
 :::tip FIX FOR ENS NOT WORKING ON LOCALHOST
-If the site doesn’t load from localhost, it’s probably an with the resolver RPC (the RPC endpoint for the Ethereum node used to resolve your ENS domain name). 
+If the site doesn’t load from localhost, it’s may be an issue with the resolver RPC (the RPC endpoint for the Ethereum node used to resolve your ENS domain name). This is even more likely if you are using a free RPC since they generally don't come with the same guarantees as paid ones.
 
-Some endpoints, such as:
+Some endpoints, such as...
 
 ```
 https://cloudflare-eth.com
 ```
 
-may not resolve properly on localhost.
+...may not resolve properly on localhost. Even if they previously worked, it may be the case that the RPC has been [changed or removed](https://developers.cloudflare.com/web3/reference/migration-guide/?utm_source=chatgpt.com).
+
+
 
 As of the writing of this guide, both of these free and public endpoints work reliably for localhost resolution:
 
@@ -37,7 +39,7 @@ https://eth-mainnet.public.blastapi.io
 
 Alternatively, you can run your own Ethereum node and use that as the RPC.
 :::
-
+ 
 ## Host a Site With **swarm-cli** 
 
 This guide shows you how to get your website hosted on Swarm with just a few simple commands by using `swarm-cli` from your terminal. 
@@ -54,7 +56,7 @@ This guide shows you how to get your website hosted on Swarm with just a few sim
 ### Upload & Access by Hash
 
 
-You can download the example website files from the [ethersphere/examples](https://github.com/ethersphere/examples/tree/main/basic-static-website) repository.
+You can download the example website files from the [ethersphere/examples](https://github.com/ethersphere/examples/tree/main/website) repository.
 
 
 #### Uploading the Website
@@ -286,7 +288,7 @@ For developers building apps, tools, or automated deployments, `bee-js` offers p
 * A running Bee node (either a [standard installation](./../bee/installation/quick-start.md) or [Swarm Desktop](./../desktop/install.md))
 * A valid postage stamp batch
 * Node.js (18+) and `@ethersphere/bee-js` installed in your project
-* Static website files (HTML, CSS, etc.) - feel free to use the [provided example site](https://github.com/ethersphere/examples/tree/main/basic-static-website)
+* Static website files (HTML, CSS, etc.) - feel free to use the [provided example site](https://github.com/ethersphere/examples/tree/main/website)
 * (Optional for part one - "Upload & Access by Hash") An ENS domain which you [previously registered](https://support.ens.domains/en/articles/7882582-how-to-register-a-eth-name)
 
 
diff --git a/docs/develop/multi-author-blog.md b/docs/develop/multi-author-blog.md
index 8709ddd4..edb96ad0 100644
--- a/docs/develop/multi-author-blog.md
+++ b/docs/develop/multi-author-blog.md
@@ -786,12 +786,12 @@ const cfg = JSON.parse(readFileSync("config.json", "utf-8"));
 const indexTopic = Topic.fromString(cfg.topics.index);
 const indexOwner = new EthAddress(cfg.admin.owner);
 const indexReader = bee.makeFeedReader(indexTopic, indexOwner);
-const indexResult = await indexReader.download();
+const indexResult = await indexReader.downloadReference();
 console.log("Index feed at index:", indexResult.feedIndex.toBigInt());
 
 // Download the authors.json manifest
 const authorsData = await bee.downloadFile(indexResult.reference);
-const authors = JSON.parse(new TextDecoder().decode(authorsData.data));
+const authors = JSON.parse(authorsData.data.toUtf8());
 
 console.log(`\n${authors.length} authors in blog:\n`);
 
diff --git a/docs/develop/tools-and-features/bee-js.md b/docs/develop/tools-and-features/bee-js.md
index 1cd70793..1c25ead4 100644
--- a/docs/develop/tools-and-features/bee-js.md
+++ b/docs/develop/tools-and-features/bee-js.md
@@ -6,5 +6,5 @@ description: Documentation for the JavaScript library providing programmatic acc
 
 bee-js is Bee's complementary JavaScript library. It is the technology underpinning [swarm-cli](./../../bee/working-with-bee/swarm-cli.md) and [Swarm Desktop](./../../desktop/introduction.md) and is a powerful tool for building completely decentralized apps.
 
-See the [bee-js](https://bee-js.ethswarm.org/) documentation for detailed information on using and installing the library.
+See the [bee-js](https://bee-js.ethswarm.org/docs/) documentation for detailed information on using and installing the library.
 
diff --git a/docs/develop/tools-and-features/chunk-types.md b/docs/develop/tools-and-features/chunk-types.md
index 53590524..9cf9ac93 100644
--- a/docs/develop/tools-and-features/chunk-types.md
+++ b/docs/develop/tools-and-features/chunk-types.md
@@ -35,4 +35,4 @@ more info read [The Book of Swarm](https://www.ethswarm.org/the-book-of-swarm-2.
 
 Although all chunks must satisfy the constraints of either being addressed by the BMT hash of their payload, or assigned by the owner of an Ethereum private key pair, so much more is possible. How else can you use the DISC to distribute and store your data? We're excited to see what you come up with! 💡
 
-Share your creations in the [#develop-on-swarm](https://discord.gg/C6dgqpxZkU) channel of our [Discord Server](https://discord.gg/wdghaQsGq5).
+Share your creations in the [#builders](https://discord.gg/8SMCfvm3kw) channel of our [Discord Server](https://discord.gg/kHRyMNpw7t).
diff --git a/docs/develop/tools-and-features/feeds.md b/docs/develop/tools-and-features/feeds.md
index 540b52ec..de7ce47e 100644
--- a/docs/develop/tools-and-features/feeds.md
+++ b/docs/develop/tools-and-features/feeds.md
@@ -16,8 +16,8 @@ a little data juggling and crypto magic. For the easiest route, see
 [the bee-js feeds functionality](./bee-js.md) and
 [swarm-cli](./../../bee/working-with-bee/swarm-cli.md), or for the super 1337,
 share your implementations in other languages in the
-[#develop-on-swarm](https://discord.gg/C6dgqpxZkU) channel of our
-[Discord Server](https://discord.gg/wdghaQsGq5).
+[#builders](https://discord.gg/8SMCfvm3kw) channel of our
+[Discord Server](https://discord.gg/kHRyMNpw7t).
 :::
 
 ### What are Feeds?
diff --git a/docs/develop/tools-and-features/introduction.md b/docs/develop/tools-and-features/introduction.md
index 6cb3d652..e90d11d1 100644
--- a/docs/develop/tools-and-features/introduction.md
+++ b/docs/develop/tools-and-features/introduction.md
@@ -47,8 +47,8 @@ over the swarm.
 
 If you want your users to be able to access Swarm without running
 their own Bee node, for the time being you will need to make use of the [Gateway Proxy tool](https://github.com/ethersphere/gateway-proxy). Join us in the
-[#develop-on-swarm](https://discord.gg/C6dgqpxZkU) room in our
-[Discord Server](https://discord.gg/wdghaQsGq5) for more information on how to make your Swarm based applications accessible to everyone. 
+[#builders](https://discord.gg/8SMCfvm3kw) room in our
+[Discord Server](https://discord.gg/kHRyMNpw7t) for more information on how to make your Swarm based applications accessible to everyone. 
 
 
 
diff --git a/docs/develop/tools-and-features/starting-a-test-network.md b/docs/develop/tools-and-features/starting-a-test-network.md
index e3d310c1..56e1a6c3 100644
--- a/docs/develop/tools-and-features/starting-a-test-network.md
+++ b/docs/develop/tools-and-features/starting-a-test-network.md
@@ -92,7 +92,7 @@ curl -s http://localhost:1733/topology | jq .connected
 No connections yet? Right! Let's remedy that!
 
 :::info
-Here we are using the `jq` command line utility to count the amount of objects in the `peers` array in the JSON response we have received from our API, learn more about how to install and use `jq` [here](https://stedolan.github.io/jq/).
+Here we are using the `jq` command line utility to count the amount of objects in the `peers` array in the JSON response we have received from our API, learn more about how to install and use `jq` [here](https://jqlang.github.io/jq/).
 :::
 
 ### Making a network
@@ -119,7 +119,7 @@ curl localhost:1633/addresses | jq
 }
 ```
 
-Here, we get firstly the **overlay address** - this is the permanent address Swarm uses as your anonymous identity in the network and secondly, a list of all the [multiaddresses](https://docs.libp2p.io/reference/glossary/#multiaddr), which are physical network addresses at which you node can be found by peers.
+Here, we get firstly the **overlay address** - this is the permanent address Swarm uses as your anonymous identity in the network and secondly, a list of all the [multiaddresses](https://libp2p.io/docs/peers/#peer-ids-in-multiaddrs), which are physical network addresses at which you node can be found by peers.
 
 Note the addresses starting with an `/ip4`, followed by `127.0.0.1`, which is the `localhost` internal network in your computer. Now we can use this full address to be the bootnode of our second node so that when it starts up, it goes to this address and both nodes become peers of each other. Let's add this into our config_2.yaml file.
 
diff --git a/docs/develop/upload-and-download.md b/docs/develop/upload-and-download.md
index fb72f87e..e7dbfe4e 100644
--- a/docs/develop/upload-and-download.md
+++ b/docs/develop/upload-and-download.md
@@ -16,7 +16,7 @@ Uploading to Swarm has two steps: (1) **buy storage** as a **postage stamp batch
 
 The `bee-js` library is the **official SDK for building Swarm-based applications**. It works in both **browser** and **Node.js** environments and **greatly simplifies development** compared with using the Bee HTTP API directly. It is the recommended method for developing applications on Swarm.
 
-Refer to the [`bee-js` documentation](https://bee-js.ethswarm.org/) for more usage guides.
+Refer to the [`bee-js` documentation](https://bee-js.ethswarm.org/docs/) for more usage guides.
 
 :::tip
 **Environment-specific methods:**
diff --git a/docs/references/awesome-list.mdx b/docs/references/awesome-list.mdx
index 5d28a76c..49d9eb6a 100644
--- a/docs/references/awesome-list.mdx
+++ b/docs/references/awesome-list.mdx
@@ -7,7 +7,7 @@ description: Curated list of community resources tools and projects related to S
 
 *Contribute to the Awesome Swarm list on [GitHub](https://github.com/ethersphere/awesome-swarm).*
 
-[Swarm](https://www.ethswarm.org/) is an incentivized peer-to-peer storage and communication system. [Join the decentralized network with a Bee node](https://docs.ethswarm.org/docs/installation/quick-start), the basic building block of Swarm.
+[Swarm](https://www.ethswarm.org/) is an incentivized peer-to-peer storage and communication system. [Join the decentralized network with a Bee node](https://docs.ethswarm.org/docs/bee/installation/quick-start), the basic building block of Swarm.
 
 This is a list of free and open source projects related to Swarm and its growing ecosystem.
 
@@ -44,7 +44,7 @@ See [CONTRIBUTING.md](https://github.com/ethersphere/awesome-swarm/blob/master/C
 
 [Bee Dashboard](https://github.com/ethersphere/bee-dashboard) - React project to troubleshoot and interact with your Bee node.
 
-[Gateway](https://github.com/ethersphere/gateway) - Gateway to the Swarm project, for uploading, downloading and sharing assets on the network.
+[Gateway](https://github.com/ethersphere/gateway-ui) - Gateway to the Swarm project, for uploading, downloading and sharing assets on the network.
 
 [Swarmy](https://swarmy.cloud/) - Swarm as a service, makes it simple to store and retrieve data on Swarm. 
 
@@ -52,9 +52,9 @@ See [CONTRIBUTING.md](https://github.com/ethersphere/awesome-swarm/blob/master/C
 
 [Etherjot](https://github.com/ethersphere/etherjot) - Bring your web3 blog live in minutes with Etherjot, a graphical blogging application natively supporting Swarm.
 
-[buzzMint](https://github.com/ethersphere/awesome-swarm/blob/master/buzz-mint.eth.limo) - A decentralised NFT creator.
+[buzzMint](https://buzzmint.io/) - A decentralised NFT creator.
 
-[Bchan](https://bchan.bzz.limo/) - A private message board allowing users to post images, text, and links across various threads.
+[Bchan](https://github.com/Cafe137/bchan-public) - A private message board allowing users to post images, text, and links across various threads.
 
 ### Tools
 
@@ -96,7 +96,7 @@ See [CONTRIBUTING.md](https://github.com/ethersphere/awesome-swarm/blob/master/C
 
 [Bee-JS Docs](https://github.com/ethersphere/bee-js-docs) - Documentation for the Swarm Bee-js javascript library. View at [bee-js.ethswarm.org](https://bee-js.ethswarm.org/docs/).
 
-[Swarm Specification](https://papers.ethswarm.org/p/swarm-specification/) - The Swarm specification document is an essential resource for developers and software engineers seeking to build their own Swarm client or integrate Swarm's functionalities into their applications.
+[Swarm Specification](https://papers.ethswarm.org/p/swarm-protocol-spec/) - The Swarm specification document is an essential resource for developers and software engineers seeking to build their own Swarm client or integrate Swarm's functionalities into their applications.
 
 [Swarm Erasure Coding paper](https://papers.ethswarm.org/p/erasure/) - The erasure coding paper provides a technical exploration of erasure coding in the Swarm network, focusing on ensuring data integrity and resilience.
 
@@ -121,19 +121,17 @@ See [CONTRIBUTING.md](https://github.com/ethersphere/awesome-swarm/blob/master/C
 
 [Fairdrive code](https://github.com/fairDataSociety/fairdrive-theapp) - Code for decentralised and unstoppable "Dropbox" for end-users and developers using Fair Data Protocol.
 
-[Galileo](https://app.galileo.fairdatasociety.org/) - Open Street Maps on Swarm.
-
-[SwarmScan](https://swarmscan.resenje.org/) - Get network insights.
+[SwarmScan](https://swarmscan.io/) - Get network insights.
 
 [Etherna.io](https://etherna.io/) - Decentralised media platform on Swarm.
 
 [SwarmNFT library](https://github.com/igar1991/SwarmNFT) - JavaScript library for creating NFTs on Ethereum-compatible blockchains and storing content on Swarm.
 
-[videoNFT](https://github.com/pabloVoorvaart/videoNFT/) - NFT live streaming with Swarm (winner of EthBerlin3 2022 Freedom to Transact Track).
+[videoNFT](https://github.com/pblvrt/videoNFT) - NFT live streaming with Swarm (winner of EthBerlin3 2022 Freedom to Transact Track).
 
-[DeBoot](https://github.com/awmacpherson/deboot) - DeBoot is a project to research and implement approaches to bootloading OS images from a decentralized storage network such as Swarm or IPFS.
+[DeBoot](https://github.com/debootdevs/deboot) - DeBoot is a project to research and implement approaches to bootloading OS images from a decentralized storage network such as Swarm or IPFS.
 
-[Swarm DAppNode Package](https://github.com/rndlabs/dappnodepackage-swarm) - Swarm DAppNode package for Swarm Mainnet with multi-platform (x86_64 and arm64) support. Testnet DAppNode packages can be found [here](https://github.com/rndlabs/dappnodepackage-swarm-testnet).
+[Swarm DAppNode Package](https://github.com/w3rkspacelabs/DAppNodePackage-Swarm) - Swarm DAppNode package for Swarm Mainnet with multi-platform (x86_64 and arm64) support.
 
 [Mipasa Swarm Connector](https://github.com/MiPasa/mipasa-swarm-connector/) - MiPasa connector for Swarm (BZZ) distributed storage network.
 
diff --git a/docs/references/community.md b/docs/references/community.md
index 4dbbc953..1512c5ba 100644
--- a/docs/references/community.md
+++ b/docs/references/community.md
@@ -8,7 +8,7 @@ description: Information about Swarm community channels resources and contributi
 ## Official Links
 
 [Twitter](https://twitter.com/ethswarm)  
-[Discord server](https://discord.gg/wdghaQsGq5)  
+[Discord server](https://discord.gg/kHRyMNpw7t)  
 [Reddit](https://www.reddit.com/r/ethswarm/)  
 [GitHub](https://github.com/ethersphere)  
 [Blog](https://blog.ethswarm.org)  
diff --git a/docs/references/fair-data-society.md b/docs/references/fair-data-society.md
index 896974c4..cb59da50 100644
--- a/docs/references/fair-data-society.md
+++ b/docs/references/fair-data-society.md
@@ -18,7 +18,6 @@ FDS's software is currently in beta or earlier and has no guarantees of file int
 1. [Fairdrive](https://fairdrive.fairdatasociety.org/) - Decentralised storage on Swarm.
 1. [Fairdrop](https://fairdrop.fairdatasociety.org/) - An easy and secure way to send your files. No central server. No tracking. No backdoors. 
 1. [FairOS](https://docs.fairos.fairdatasociety.org/docs/) - The operating system for the decentralised web.
-1. [Galileo](https://app.galileo.fairdatasociety.org/) - An open source project that allows you to use, create and modify maps of various participants.
 
 ## Links
 
diff --git a/docs/references/faq.md b/docs/references/faq.md
index 6f921ae1..c4378ca4 100644
--- a/docs/references/faq.md
+++ b/docs/references/faq.md
@@ -11,22 +11,22 @@ description: Frequently asked questions about Swarm protocols and general usage.
 
 ### What are the Swarm Foundation's official channels?
 
-- Website: [https://ethswarm.org/](https://ethswarm.org/)
+- Website: [https://www.ethswarm.org/](https://www.ethswarm.org/)
 - Blog:[https://blog.ethswarm.org/](https://blog.ethswarm.org/)
 - Github: [https://github.com/ethersphere](https://github.com/ethersphere)
 - e-mail: info@ethswarm.org
-- Discord: [https://discord.ethswarm.org/](https://discord.ethswarm.org/)
+- Discord: [https://discord.gg/kHRyMNpw7t](https://discord.gg/kHRyMNpw7t)
 - Twitter: [https://twitter.com/ethswarm](https://twitter.com/ethswarm)
 - Reddit: [https://www.reddit.com/r/ethswarm](https://www.reddit.com/r/ethswarm)
 - Youtube: [https://www.youtube.com/channel/UCu6ywn9MTqdREuE6xuRkskA](https://www.youtube.com/channel/UCu6ywn9MTqdREuE6xuRkskA)
 
 ### Where can I find technical support and get answers to my other questions?
 
-The Swarm community is centered around our Discord server where you will find many people willing and able to help with your every need! [https://discord.ethswarm.org/](https://discord.ethswarm.org/)
+The Swarm community is centered around our Discord server where you will find many people willing and able to help with your every need! [https://discord.gg/kHRyMNpw7t](https://discord.gg/kHRyMNpw7t)
 
 ### Where can I find support for running Bee node on Dappnode?
 
-You can find support for running Bee on Dappnode on the Dappnode Discord server: [https://discord.gg/dRd5CrjF](https://discord.gg/dRd5CrjF)
+You can find support for running Bee on Dappnode on the Dappnode Discord server: [https://discord.gg/dappnode](https://discord.gg/dappnode)
 
 ### Who can I contact for other inquiries?
 
diff --git a/docs/references/glossary.md b/docs/references/glossary.md
index c5a6b2a0..bc481474 100644
--- a/docs/references/glossary.md
+++ b/docs/references/glossary.md
@@ -13,7 +13,7 @@ Swarm is also the name of the core organization that oversees the development an
 
 ## Gnosis Chain
 
-[Gnosis Chain](https://www.gnosis.io/) (previously known as xDai chain) is a [PoS](https://www.gnosis.io/validators), [EVM](https://ethereum.org/en/developers/docs/evm/) compatible Ethereum [sidechain](https://ethereum.org/en/developers/docs/scaling/sidechains/) which uses the same addressing scheme as Ethereum. Swarm's smart contracts have been issued on Gnosis Chain.
+[Gnosis Chain](https://www.gnosis.io/) (previously known as xDai chain) is a [PoS](https://www.gnosis.io/validators), [EVM](https://ethereum.org/developers/docs/evm/) compatible Ethereum [sidechain](https://ethereum.org/developers/docs/scaling/sidechains/) which uses the same addressing scheme as Ethereum. Swarm's smart contracts have been issued on Gnosis Chain.
 
 ## Smart Contracts
 
@@ -185,7 +185,7 @@ Bridged tokens are tokens from one blockchain which have been _bridged_ to anoth
 
 ## BZZ Token
 
-BZZ is Swarm's [ERC-20](https://ethereum.org/en/developers/docs/standards/tokens/erc-20/) token issued on Ethereum.   
+BZZ is Swarm's [ERC-20](https://ethereum.org/developers/docs/standards/tokens/erc-20/) token issued on Ethereum.   
 
 
 ## xBZZ Token
@@ -196,11 +196,11 @@ It is used as payment for [postage stamps](#postage-stamps) and as the unit of a
 
 ## DAI Token
 
-[DAI](https://developer.makerdao.com/dai/1/) is an [ERC-20](https://ethereum.org/en/developers/docs/standards/tokens/erc-20/) stable token issued on the Ethereum blockchain, tracking USD.
+[DAI](https://docs.gnosischain.com/about/tokens/xdai) is an [ERC-20](https://ethereum.org/developers/docs/standards/tokens/erc-20/) stable token issued on the Ethereum blockchain, tracking USD.
 
 ## xDAI Token
 
-xDAI is [DAI](https://developer.makerdao.com/dai/1/) [bridged](#bridged-tokens) to the [Gnosis Chain](https://www.gnosis.io) using [xDai Bridge](https://bridge.gnosischain.com/). It is also the native token of the Gnosis Chain, i.e. transaction fees are paid in xDai.
+xDAI is [DAI](https://docs.gnosischain.com/about/tokens/xdai) [bridged](#bridged-tokens) to the [Gnosis Chain](https://www.gnosis.io) using [xDai Bridge](https://bridge.gnosischain.com/). It is also the native token of the Gnosis Chain, i.e. transaction fees are paid in xDai.
 
 ## Sepolia
 
@@ -210,7 +210,7 @@ Sepolia is an Ethereum testnet. It is an environment where smart contracts can b
 
 A cryptocurrency faucet supplies small amounts of cryptocurrency to requestors (typically for testing purposes).
 
-Check out the [Fund Your Node](In order to acquire sETH and sBZZ, refer to the [Fund Your Node](./../bee/installation/fund-your-node.md) page.) page for more information.
+Check out the [Fund Your Node](./../bee/installation/fund-your-node.md) page for more information.
 
 
 ## RPC Endpoint
diff --git a/docusaurus.config.mjs b/docusaurus.config.mjs
index 511f374f..c7e6b703 100644
--- a/docusaurus.config.mjs
+++ b/docusaurus.config.mjs
@@ -84,6 +84,7 @@ export default {
            { to: '/docs/develop/act', from: '/docs/develop/access-the-swarm/act' },
            { to: '/docs/develop/ultra-light-nodes', from: '/docs/develop/access-the-swarm/ultra-light-nodes' },
            { to: '/docs/develop/introduction', from: '/docs/develop/access-the-swarm' },
+           { to: '/docs/concepts/DISC/erasure-coding', from: '/docs/develop/access-the-swarm/erasure-coding' },
         ],
       },
     ],
diff --git a/package.json b/package.json
index 6812c557..69a002b4 100644
--- a/package.json
+++ b/package.json
@@ -10,7 +10,8 @@
     "build:quiet": "cross-env NODE_OPTIONS=\"--disable-warning=DEP0040 --disable-warning=DEP0169\" docusaurus build",
     "swizzle": "docusaurus swizzle",
     "deploy": "docusaurus deploy",
-    "serve": "docusaurus serve"
+    "serve": "docusaurus serve",
+    "check:links": "python scripts/check_links.py"
   },
   "dependencies": {
     "@docsearch/js": "^4.3.2",
diff --git a/scripts/check_links.py b/scripts/check_links.py
new file mode 100644
index 00000000..9db0bdac
--- /dev/null
+++ b/scripts/check_links.py
@@ -0,0 +1,1441 @@
+#!/usr/bin/env python3
+"""
+Link checker for bee-docs Docusaurus site.
+Checks:
+  1. Internal links in source docs (markdown/mdx) — verified against the BUILD output:
+       • page existence checked by looking up the corresponding HTML file in build/
+       • anchor existence checked by reading actual id attributes from rendered HTML
+       No slug inference — only what Docusaurus actually produced is trusted.
+  2. Internal links in build output (HTML) — file existence + anchors
+  3. External links in source docs — real HTTP requests (HEAD/GET)
+
+Requirements: run 'npm run build' before running this script so build/ is current.
+
+Usage:
+    python scripts/check_links.py [--no-external] [--threads N]
+    npm run check:links
+"""
+
+import os
+import re
+import sys
+import time
+import threading
+import queue
+import subprocess
+from pathlib import Path
+
+# On Windows, npm is a .cmd script and must be invoked as 'npm.cmd'
+_NPM = 'npm.cmd' if sys.platform == 'win32' else 'npm'
+from html.parser import HTMLParser
+from urllib.parse import urlparse, unquote
+from collections import defaultdict
+from urllib.request import Request, urlopen, HTTPRedirectHandler, build_opener
+from urllib.error import URLError, HTTPError
+import http.client
+import socket
+import argparse
+
+# ─────────────────────────────────────────────
+# Configuration
+# ─────────────────────────────────────────────
+
+PROJECT_DIR = Path(__file__).resolve().parent.parent
+DOCS_DIR    = PROJECT_DIR / "docs"
+BUILD_DIR   = PROJECT_DIR / "build"
+STATIC_DIR  = PROJECT_DIR / "static"
+REPORT_PATH       = PROJECT_DIR / "link-reports/dead_links_report.md"
+HUMAN_REPORT_PATH = PROJECT_DIR / "link-reports/dead_links_audit.md"
+
+# The live domain — full-URL links using this domain are treated as internal
+# and checked against the local build directory instead of via HTTP.
+SITE_DOMAIN   = "docs.ethswarm.org"
+SITE_BASE_URL = f"https://{SITE_DOMAIN}"
+
+# External link checker settings
+EXT_TIMEOUT  = 15   # seconds per request
+EXT_THREADS  = 8    # concurrent HTTP workers
+EXT_DELAY    = 0.15 # seconds between requests per thread (politeness)
+
+USER_AGENT = (
+    "Mozilla/5.0 (Windows NT 10.0; Win64; x64) "
+    "AppleWebKit/537.36 (KHTML, like Gecko) "
+    "Chrome/122.0 Safari/537.36 bee-docs-link-checker/2.0"
+)
+
+# Schemes to collect for external checking (everything http/https)
+EXTERNAL_SCHEMES = ("http://", "https://")
+# Schemes to ignore entirely
+IGNORE_SCHEMES   = ("mailto:", "javascript:", "tel:", "ftp:", "data:")
+
+# Hostnames/prefixes to skip — example/placeholder URLs in documentation
+IGNORE_HOSTS = (
+    "localhost",
+    "127.0.0.1",
+    "192.168.",
+    "10.0.",
+    "0.0.0.0",
+)
+
+
+# ─────────────────────────────────────────────
+# Helpers — markdown link extraction
+# ─────────────────────────────────────────────
+
+def strip_code_blocks(content):
+    content = re.sub(r'```[^\n]*\n[\s\S]*?```', '', content)
+    content = re.sub(r'~~~[^\n]*\n[\s\S]*?~~~', '', content)
+    content = re.sub(r'`[^`\n]+`', '', content)
+    return content
+
+
+def extract_md_links(content):
+    """Return list of (link_text, url) from markdown content."""
+    content = strip_code_blocks(content)
+    links = []
+
+    # Inline links: [text](url) or [text](url "title")
+    # The URL pattern allows balanced parentheses (e.g. Wikipedia URLs like /wiki/APT_(software))
+    for m in re.finditer(r'\[([^\]]*)\]\(((?:[^)(]|\([^)]*\))*?)(?:\s+"[^"]*")?\)', content):
+        url = m.group(2).strip().split('"')[0].strip().split("'")[0].strip()
+        links.append((m.group(1), url))
+
+    # Reference-style definitions
+    ref_defs = {}
+    for m in re.finditer(r'^\[([^\]]+)\]:\s*(\S+)', content, re.MULTILINE):
+        ref_defs[m.group(1).lower()] = m.group(2)
+
+    # Reference-style uses
+    for m in re.finditer(r'\[([^\]]+)\]\[([^\]]*)\]', content):
+        text = m.group(1)
+        ref  = m.group(2).lower() if m.group(2) else text.lower()
+        if ref in ref_defs:
+            links.append((text, ref_defs[ref]))
+
+    # HTML anchors and images in markdown
+    for m in re.finditer(r'<a\s[^>]*href=["\']([^"\']+)["\']', content, re.IGNORECASE):
+        links.append(('', m.group(1)))
+    for m in re.finditer(r'<img\s[^>]*src=["\']([^"\']+)["\']', content, re.IGNORECASE):
+        links.append(('', m.group(1)))
+
+    # Bare URLs — plain http(s) URLs not inside a markdown link or HTML attribute.
+    # Collect all URL positions already captured above to avoid double-reporting.
+    seen_spans = set()
+    for m in re.finditer(r'\[([^\]]*)\]\(([^)]+)\)', content):
+        seen_spans.add(m.start(2))
+    for m in re.finditer(r'^\[([^\]]+)\]:\s*(\S+)', content, re.MULTILINE):
+        seen_spans.add(m.start(2))
+    for m in re.finditer(r'(?:href|src)=["\']([^"\']+)["\']', content, re.IGNORECASE):
+        seen_spans.add(m.start(1))
+
+    for m in re.finditer(r'https?://[^\s\]>"\'\\<*`]+', content):
+        if m.start() not in seen_spans:
+            url = m.group(0).rstrip('.,;:!')
+            # Strip trailing unbalanced close-parens
+            while url.endswith(')') and url.count('(') < url.count(')'):
+                url = url[:-1]
+            links.append(('', url))
+
+    return links
+
+
+# ─────────────────────────────────────────────
+# Helpers — build-output link resolution
+# ─────────────────────────────────────────────
+
+def _frontmatter_id(md_file):
+    """Return the 'id' value from YAML frontmatter, or None."""
+    try:
+        text = md_file.read_text(encoding='utf-8', errors='replace')
+        if not text.startswith('---'):
+            return None
+        end = text.find('\n---', 3)
+        if end == -1:
+            return None
+        for line in text[3:end].splitlines():
+            if line.startswith('id:'):
+                return line[3:].strip().strip('"\'')
+    except Exception:
+        pass
+    return None
+
+
+def _build_docid_map():
+    """
+    Scan all HTML files in the build and return a dict {doc_id: html_path}.
+
+    Docusaurus embeds the doc ID in the <html> class as 'docs-doc-id-{id}',
+    e.g. class="... docs-doc-id-concepts/DISC/disc ...".
+    This is the ground truth for what page is at what path — no inference needed.
+    """
+    mapping = {}
+    if not BUILD_DIR.exists():
+        return mapping
+    for html_file in BUILD_DIR.rglob('index.html'):
+        try:
+            # Only read the <html> opening tag (first ~500 bytes) for performance
+            with html_file.open(encoding='utf-8', errors='replace') as fh:
+                head = fh.read(800)
+            m = re.search(r'docs-doc-id-([^\s"\']+)', head)
+            if m:
+                mapping[m.group(1)] = html_file
+        except Exception:
+            pass
+    return mapping
+
+
+# Populated once at first call to md_path_to_build_html()
+_DOCID_MAP = None
+
+
+def md_path_to_build_html(md_file):
+    """Map a source .md/.mdx file to the HTML file Docusaurus built from it.
+
+    Uses the build's own HTML files (via the embedded docs-doc-id class) as the
+    authoritative source — no path inference or slug computation.
+
+    Falls back to a computed path when the build map lookup misses.
+    """
+    global _DOCID_MAP
+    if _DOCID_MAP is None:
+        _DOCID_MAP = _build_docid_map()
+
+    try:
+        rel = md_file.relative_to(DOCS_DIR)
+    except ValueError:
+        return None
+
+    # Compute the full doc ID: parent/local_id
+    local_id = _frontmatter_id(md_file) or rel.with_suffix('').name
+    parent   = str(rel.parent).replace('\\', '/')
+    doc_id   = local_id if parent == '.' else f"{parent}/{local_id}"
+
+    # Look up in the reverse map first (authoritative)
+    if doc_id in _DOCID_MAP:
+        return _DOCID_MAP[doc_id]
+
+    # Fallback: compute expected path
+    parent_path = rel.parent
+    if local_id == 'index':
+        return BUILD_DIR / 'docs' / parent_path / 'index.html'
+    return BUILD_DIR / 'docs' / parent_path / local_id / 'index.html'
+
+
+def resolve_internal_to_build_html(source_md, link_path):
+    """Resolve an internal (non-http) link path to the build HTML file it corresponds to.
+
+    Checks the build/ directory only — no slug inference, no source-file guessing.
+    Returns (html_path_or_None, error_reason_or_None).
+    Caller is responsible for splitting off any '#anchor' before calling.
+    """
+    decoded = unquote(link_path)
+
+    # ── Absolute path (/docs/… or /static/…) ──
+    if decoded.startswith('/'):
+        rel = decoded.lstrip('/')
+        candidates = [
+            BUILD_DIR / rel,
+            BUILD_DIR / rel / 'index.html',
+            BUILD_DIR / (rel + '.html'),
+        ]
+        for c in candidates:
+            if c.exists() and c.is_file():
+                return c, None
+        return None, f"Not found in build: /{rel}"
+
+    # ── Relative path ──
+    target = (source_md.parent / decoded).resolve()
+
+    # Non-markdown file (image, PDF, asset): check static/ and on-disk path
+    if target.suffix not in ('', '.md', '.mdx'):
+        if target.exists():
+            return target, None
+        try:
+            static_candidate = STATIC_DIR / target.relative_to(PROJECT_DIR)
+            if static_candidate.exists():
+                return static_candidate, None
+        except ValueError:
+            pass
+        return None, f"File not found: {target.name}"
+
+    # Markdown / no extension: find source file → map to build HTML
+    md_candidates = (
+        [target] if target.suffix in ('.md', '.mdx')
+        else [target.with_suffix('.md'), target.with_suffix('.mdx'),
+              target / 'index.md',       target / 'index.mdx']
+    )
+    for md_cand in md_candidates:
+        if md_cand.exists() and md_cand.is_file():
+            build_html = md_path_to_build_html(md_cand)
+            if build_html is None:
+                return None, "Could not map source file to build path"
+            if build_html.exists():
+                return build_html, None
+            return None, "Source file exists but its build HTML was not found — is the build current?"
+
+    return None, "Source file not found"
+
+
+def resolve_site_url_locally(url):
+    """Check a full docs.ethswarm.org URL against the local build output."""
+    parsed = urlparse(url)
+    rel    = parsed.path.rstrip('/').lstrip('/')
+    candidates = [
+        BUILD_DIR / rel,
+        BUILD_DIR / rel / 'index.html',
+        BUILD_DIR / (rel + '.html'),
+    ]
+    for c in candidates:
+        if c.exists() and c.is_file():
+            return True, str(c)
+    return False, str(BUILD_DIR / rel)
+
+
+# ─────────────────────────────────────────────
+# External URL checker
+# ─────────────────────────────────────────────
+
+EXT_STATUS_OK       = 'ok'
+EXT_STATUS_404      = '404'
+EXT_STATUS_DOWN     = 'down'
+EXT_STATUS_REDIRECT = 'redirect'
+EXT_STATUS_ERROR    = 'error'
+EXT_STATUS_INTERNAL = 'internal_404'   # full site URL that resolves locally but build says 404
+
+
+class _NoFollowRedirectHandler(HTTPRedirectHandler):
+    """Prevent urllib from automatically following redirects."""
+    def redirect_request(self, req, fp, code, msg, headers, newurl):
+        return None  # returning None makes urllib raise HTTPError with the 3xx code
+
+
+def _build_no_redirect_opener():
+    return build_opener(_NoFollowRedirectHandler())
+
+
+def _fetch(url, headers, method='HEAD', follow_redirects=False):
+    """
+    Make a single HTTP request.
+
+    follow_redirects=False: do not follow redirects; 3xx responses return the
+        code and Location header so the caller can decide what to do.
+    follow_redirects=True: follow the full redirect chain (standard urlopen behaviour).
+
+    Returns (status_code_or_None, final_url, location_header_or_None, error_str_or_None).
+    """
+    try:
+        req = Request(url, headers=headers, method=method)
+        if follow_redirects:
+            with urlopen(req, timeout=EXT_TIMEOUT) as resp:
+                return resp.status, resp.url, None, None
+        else:
+            opener = _build_no_redirect_opener()
+            with opener.open(req, timeout=EXT_TIMEOUT) as resp:
+                return resp.status, url, resp.headers.get('Location'), None
+    except HTTPError as e:
+        loc = e.headers.get('Location') if hasattr(e, 'headers') and e.headers else None
+        return e.code, url, loc, None
+    except (URLError, socket.timeout, socket.error, ConnectionRefusedError,
+            http.client.RemoteDisconnected, http.client.IncompleteRead) as e:
+        return None, url, None, str(e)
+    except Exception as e:
+        return None, url, None, f'{type(e).__name__}: {str(e)[:80]}'
+
+
+def _classify_connection_error(result, err):
+    """Populate result with the right status for a network-level error string."""
+    if 'ECONNREFUSED' in err or 'Connection refused' in err:
+        result['status']    = EXT_STATUS_DOWN
+        result['error_msg'] = 'ECONNREFUSED — server down'
+    elif ('Name or service not known' in err or 'getaddrinfo' in err
+          or 'nodename' in err.lower() or 'No address' in err):
+        result['status']    = EXT_STATUS_DOWN
+        result['error_msg'] = 'DNS resolution failed'
+    elif 'timed out' in err.lower() or 'timeout' in err.lower():
+        result['status']    = EXT_STATUS_DOWN
+        result['error_msg'] = 'Connection timed out'
+    elif 'SSL' in err or 'ssl' in err:
+        result['status']    = EXT_STATUS_DOWN
+        result['error_msg'] = f'SSL error: {err[:80]}'
+    else:
+        result['status']    = EXT_STATUS_DOWN
+        result['error_msg'] = f'Connection error: {err[:80]}'
+    return result
+
+
+def _check_destination(dest_url, headers):
+    """
+    Verify that a redirect destination is actually reachable (200).
+    Follows the full redirect chain from dest_url.
+    Returns (status_code_or_None, final_url, error_str_or_None).
+    """
+    code, final, _, err = _fetch(dest_url, headers, method='HEAD', follow_redirects=True)
+    if err:
+        return None, dest_url, err
+    if code in (403, 405):
+        # Some servers reject HEAD — retry with GET
+        code, final, _, err = _fetch(dest_url, headers, method='GET', follow_redirects=True)
+        if err:
+            return None, dest_url, err
+    return code, final or dest_url, None
+
+
+def check_external_url(url):
+    """
+    Check a single external URL.
+
+    Strategy:
+      1. HEAD request WITHOUT following redirects so we can see whether
+         the URL itself redirects (and where).
+      2. If 3xx: explicitly fetch the redirect destination and verify it
+         returns 200. Only report as EXT_STATUS_REDIRECT if the destination
+         is reachable. A redirect that leads to a 404/down is reported as
+         the appropriate broken status.
+      3. If HEAD is rejected (403/405): retry with GET, same logic.
+
+    Returns dict: {url, status, http_code, final_url, error_msg}
+    """
+    result = {
+        'url':       url,
+        'status':    EXT_STATUS_ERROR,
+        'http_code': None,
+        'final_url': None,
+        'error_msg': None,
+    }
+
+    # Special case: links to our own live site — check against local build
+    parsed = urlparse(url)
+    if parsed.netloc == SITE_DOMAIN:
+        exists, tried = resolve_site_url_locally(url)
+        if exists:
+            result['status'] = EXT_STATUS_OK
+        else:
+            result['status']    = EXT_STATUS_INTERNAL
+            result['error_msg'] = f"Not in local build: {tried}"
+        return result
+
+    headers = {
+        'User-Agent': USER_AGENT,
+        'Accept': 'text/html,application/xhtml+xml,*/*;q=0.8',
+        'Accept-Language': 'en-US,en;q=0.9',
+    }
+
+    # ── Step 1: initial request (no auto-redirect) ──
+    code, _, location, err = _fetch(url, headers, method='HEAD', follow_redirects=False)
+
+    if err:
+        return _classify_connection_error(result, err)
+
+    # HEAD rejected → retry with GET (no auto-redirect)
+    if code in (403, 405):
+        code, _, location, err = _fetch(url, headers, method='GET', follow_redirects=False)
+        if err:
+            return _classify_connection_error(result, err)
+        if code in (403, 405):
+            result['status']    = EXT_STATUS_ERROR
+            result['http_code'] = code
+            result['error_msg'] = f"HTTP {code} (GET retry)"
+            result['final_url'] = url
+            return result
+
+    result['http_code'] = code
+
+    # ── Step 2: classify based on response code ──
+    if code is None:
+        result['status'] = EXT_STATUS_ERROR
+        return result
+
+    if code == 200:
+        result['status']    = EXT_STATUS_OK
+        result['final_url'] = url
+
+    elif code == 404:
+        result['status']    = EXT_STATUS_404
+        result['error_msg'] = 'HTTP 404'
+        result['final_url'] = url
+
+    elif code in (301, 302, 303, 307, 308):
+        # ── Redirect: verify the destination is actually reachable ──
+        dest = location or url
+        # Make dest absolute if it's a relative Location header
+        if dest and not dest.startswith('http'):
+            p = urlparse(url)
+            dest = f"{p.scheme}://{p.netloc}{dest}"
+
+        dest_code, dest_final, dest_err = _check_destination(dest, headers)
+
+        if dest_err:
+            result['status']    = EXT_STATUS_DOWN
+            result['error_msg'] = f"Redirect to {dest!r} failed: {dest_err[:80]}"
+            result['final_url'] = dest
+        elif dest_code is None:
+            result['status']    = EXT_STATUS_DOWN
+            result['error_msg'] = f"Redirect destination unreachable: {dest!r}"
+            result['final_url'] = dest
+        elif dest_code == 200:
+            if _urls_differ_meaningfully(url, dest_final):
+                result['status']    = EXT_STATUS_REDIRECT
+                result['final_url'] = dest_final
+            else:
+                result['status']    = EXT_STATUS_OK
+                result['final_url'] = dest_final
+        elif dest_code == 404:
+            result['status']    = EXT_STATUS_404
+            result['error_msg'] = f"Redirect target returned 404 ({dest!r})"
+            result['final_url'] = dest
+        else:
+            result['status']    = EXT_STATUS_ERROR
+            result['error_msg'] = f"Redirect target returned HTTP {dest_code}"
+            result['final_url'] = dest
+
+    else:
+        # Any other 2xx is fine; other codes treated as errors
+        if 200 <= code < 300:
+            result['status']    = EXT_STATUS_OK
+            result['final_url'] = url
+        else:
+            result['status']    = EXT_STATUS_ERROR
+            result['error_msg'] = f"HTTP {code}"
+            result['final_url'] = url
+
+    return result
+
+
+def _urls_differ_meaningfully(original, final):
+    """True if the URLs differ in a way that's worth reporting (not just http→https or trailing slash)."""
+    if not final or original == final:
+        return False
+    o = urlparse(original)
+    f = urlparse(final)
+    o_path = o.path.rstrip('/')
+    f_path = f.path.rstrip('/')
+    # Same host+path, only scheme or trailing-slash differs → not meaningful
+    if o.netloc == f.netloc and o_path == f_path and o.query == f.query:
+        return False
+    # http → https upgrade on same host/path → not meaningful
+    if (o.netloc == f.netloc and o_path == f_path
+            and o.scheme == 'http' and f.scheme == 'https'):
+        return False
+    return True
+
+
+def check_external_urls_threaded(url_to_sources, threads=EXT_THREADS):
+    """
+    Check a dict of {url: [source_files]} concurrently.
+    Returns dict of {url: check_result_dict}.
+    """
+    urls    = list(url_to_sources.keys())
+    results = {}
+    lock    = threading.Lock()
+    q       = queue.Queue()
+
+    for url in urls:
+        q.put(url)
+
+    total = len(urls)
+    done  = [0]
+
+    def worker():
+        while True:
+            try:
+                url = q.get_nowait()
+            except queue.Empty:
+                break
+            time.sleep(EXT_DELAY)
+            res = check_external_url(url)
+            with lock:
+                results[url] = res
+                done[0] += 1
+                n = done[0]
+                if n % 10 == 0 or n == total:
+                    print(f"  External: {n}/{total} checked...", end='\r', flush=True)
+            q.task_done()
+
+    thread_list = [threading.Thread(target=worker, daemon=True) for _ in range(min(threads, len(urls)))]
+    for t in thread_list:
+        t.start()
+    for t in thread_list:
+        t.join()
+
+    print()  # newline after \r progress
+    return results
+
+
+# ─────────────────────────────────────────────
+# Markdown file checker
+# ─────────────────────────────────────────────
+
+def check_markdown_files(check_external=True):
+    """
+    Scan all .md/.mdx source files.
+
+    Internal links are verified against the BUILD output:
+      - page existence: does the corresponding build HTML file exist?
+      - anchor existence: is the anchor present as an id attribute in the rendered HTML?
+    No slug inference is performed at any point.
+
+    Returns:
+      - broken_internal: list of broken internal link dicts
+      - external_url_to_sources: dict {url: [(source_file, link_text)]}
+      - stats
+    """
+    broken_internal     = []
+    external_url_to_src = defaultdict(list)
+    files_checked       = 0
+    links_checked       = 0
+    html_id_cache       = {}   # str(html_path) → frozenset of id strings
+
+    if not BUILD_DIR.exists():
+        print("  WARNING: build/ directory not found.")
+        print("  Run 'npm run build' first — internal links cannot be checked without it.")
+
+    md_files = sorted(list(DOCS_DIR.rglob('*.md')) + list(DOCS_DIR.rglob('*.mdx')))
+
+    for md_file in md_files:
+        files_checked += 1
+        try:
+            content = md_file.read_text(encoding='utf-8', errors='replace')
+        except Exception as e:
+            broken_internal.append({
+                'source': str(md_file), 'link_text': '', 'link_url': '',
+                'resolved': '', 'reason': f'Could not read file: {e}',
+            })
+            continue
+
+        # Build HTML for this source file — used for anchor-only (#frag) links
+        source_build_html = md_path_to_build_html(md_file)
+
+        links = extract_md_links(content)
+
+        for link_text, url in links:
+            url = url.strip()
+            if not url or url == '#':
+                continue
+            if any(url.startswith(s) for s in IGNORE_SCHEMES):
+                continue
+
+            parsed_url = urlparse(url)
+            if any(parsed_url.hostname and parsed_url.hostname.startswith(h) for h in IGNORE_HOSTS):
+                continue
+
+            links_checked += 1
+
+            # ── External / self-site links ──
+            if any(url.startswith(s) for s in EXTERNAL_SCHEMES):
+                if check_external:
+                    external_url_to_src[url].append((str(md_file), link_text))
+                continue
+
+            # ── Split anchor from path ──
+            anchor    = None
+            link_path = url
+            if '#' in link_path:
+                link_path, anchor = link_path.split('#', 1)
+
+            # ── Determine target build HTML ──
+            if not link_path:
+                # Anchor-only link — same page
+                target_html = source_build_html
+            else:
+                target_html, reason = resolve_internal_to_build_html(md_file, link_path)
+                if reason or target_html is None or not target_html.exists():
+                    broken_internal.append({
+                        'source':    str(md_file),
+                        'link_text': link_text,
+                        'link_url':  url,
+                        'resolved':  str(target_html) if target_html else link_path,
+                        'reason':    reason or 'Build HTML not found',
+                    })
+                    continue
+
+            # ── Check anchor in rendered HTML ──
+            if anchor and target_html and target_html.exists():
+                key = str(target_html)
+                if key not in html_id_cache:
+                    html_id_cache[key] = get_html_ids(target_html)
+                if anchor not in html_id_cache[key]:
+                    broken_internal.append({
+                        'source':    str(md_file),
+                        'link_text': link_text,
+                        'link_url':  url,
+                        'resolved':  f'{target_html}#{anchor}',
+                        'reason':    f'Anchor "#{anchor}" not found in rendered HTML',
+                    })
+
+    return broken_internal, dict(external_url_to_src), files_checked, links_checked, len(md_files)
+
+
+# ─────────────────────────────────────────────
+# HTML build checker
+# ─────────────────────────────────────────────
+
+class LinkExtractor(HTMLParser):
+    def __init__(self):
+        super().__init__()
+        self.links = []
+        self.ids   = set()
+
+    def handle_starttag(self, tag, attrs):
+        attrs_dict = dict(attrs)
+        if 'id' in attrs_dict:
+            self.ids.add(attrs_dict['id'])
+        if tag == 'a' and 'href' in attrs_dict:
+            self.links.append(('href', attrs_dict['href']))
+        elif tag in ('img', 'script') and 'src' in attrs_dict:
+            self.links.append(('src', attrs_dict['src']))
+        elif tag == 'link' and 'href' in attrs_dict:
+            self.links.append(('href', attrs_dict['href']))
+
+
+def get_html_ids(html_file):
+    try:
+        content = html_file.read_text(encoding='utf-8', errors='replace')
+    except Exception:
+        return set()
+    parser = LinkExtractor()
+    parser.feed(content)
+    return parser.ids
+
+
+def resolve_html_link(source_html, href, build_root):
+    anchor = None
+    if '#' in href:
+        href, anchor = href.split('#', 1)
+
+    href = unquote(href)
+    if not href:
+        return None, anchor, None
+
+    if href.startswith('/'):
+        rel        = href.lstrip('/')
+        target     = build_root / rel
+        candidates = [target]
+        if target.suffix == '':
+            candidates.append(target / 'index.html')
+    else:
+        source_dir = source_html.parent
+        target     = (source_dir / href).resolve()
+        candidates = [target]
+        if target.suffix == '':
+            candidates.append(target / 'index.html')
+
+    for c in candidates:
+        if c.exists() and c.is_file():
+            return c, anchor, None
+    return target, anchor, "File not found"
+
+
+def check_html_files():
+    broken        = []
+    files_checked = 0
+    links_checked = 0
+    id_cache      = {}
+
+    html_files = sorted(BUILD_DIR.rglob('*.html'))
+
+    for html_file in html_files:
+        files_checked += 1
+        try:
+            content = html_file.read_text(encoding='utf-8', errors='replace')
+        except Exception as e:
+            broken.append({'source': str(html_file), 'attr': 'href', 'link_url': '',
+                           'resolved': '', 'reason': f'Could not read: {e}'})
+            continue
+
+        parser = LinkExtractor()
+        parser.feed(content)
+        file_ids = parser.ids
+
+        for attr, url in parser.links:
+            url = url.strip()
+            if not url or url == '#':
+                continue
+            if any(url.startswith(s) for s in EXTERNAL_SCHEMES + IGNORE_SCHEMES + ('data:',)):
+                continue
+
+            links_checked += 1
+
+            if url.startswith('#'):
+                anchor = url[1:]
+                if anchor and anchor not in file_ids:
+                    broken.append({'source': str(html_file), 'attr': attr, 'link_url': url,
+                                   'resolved': f'{html_file}#{anchor}',
+                                   'reason': f'Anchor "#{anchor}" not found in same page'})
+                continue
+
+            resolved, anchor, reason = resolve_html_link(html_file, url, BUILD_DIR)
+
+            if reason:
+                broken.append({'source': str(html_file), 'attr': attr, 'link_url': url,
+                               'resolved': str(resolved) if resolved else url, 'reason': reason})
+                continue
+
+            if anchor and resolved and resolved.exists():
+                key = str(resolved)
+                if key not in id_cache:
+                    id_cache[key] = get_html_ids(resolved)
+                if anchor not in id_cache[key]:
+                    broken.append({'source': str(html_file), 'attr': attr, 'link_url': url,
+                                   'resolved': f'{resolved}#{anchor}',
+                                   'reason': f'Anchor "#{anchor}" not found in target HTML'})
+
+    return broken, files_checked, links_checked, len(html_files)
+
+
+# ─────────────────────────────────────────────
+# Deduplication
+# ─────────────────────────────────────────────
+
+def deduplicate_html_broken(broken):
+    groups = defaultdict(list)
+    for item in broken:
+        groups[(item['link_url'], item['reason'])].append(item)
+    result = []
+    for (url, reason), items in sorted(groups.items()):
+        rep = dict(items[0])
+        rep['count'] = len(items)
+        rep['example_sources'] = [it['source'] for it in items[:3]]
+        result.append(rep)
+    return result
+
+
+# ─────────────────────────────────────────────
+# Report
+# ─────────────────────────────────────────────
+
+def make_short_path(path_str, base):
+    try:
+        return str(Path(path_str).relative_to(base))
+    except ValueError:
+        try:
+            return str(Path(path_str).relative_to(PROJECT_DIR))
+        except ValueError:
+            return path_str
+
+
+def write_report(
+    md_broken, ext_results, ext_url_to_src,
+    md_files_checked, md_links_checked, md_total_files,
+    html_broken, html_files_checked, html_links_checked, html_total_files,
+    staged_replacements=None,
+):
+    import datetime
+    today = datetime.date.today().isoformat()
+
+    # Categorise external results
+    ext_404      = {u: r for u, r in ext_results.items() if r['status'] == EXT_STATUS_404}
+    ext_down     = {u: r for u, r in ext_results.items() if r['status'] == EXT_STATUS_DOWN}
+    ext_redirect = {u: r for u, r in ext_results.items() if r['status'] == EXT_STATUS_REDIRECT}
+    ext_internal = {u: r for u, r in ext_results.items() if r['status'] == EXT_STATUS_INTERNAL}
+    ext_error    = {u: r for u, r in ext_results.items() if r['status'] == EXT_STATUS_ERROR}
+
+    _staged = staged_replacements or {}
+
+    def _repl(url, res=None):
+        if url in _staged:
+            return _staged[url]
+        final = (res or {}).get('final_url') or ''
+        return final if final and final != url else ''
+
+    deduped_html = deduplicate_html_broken(html_broken)
+
+    lines = []
+    lines.append("# Dead Links Report\n")
+    lines.append(f"Generated: {today}\n")
+    lines.append("")
+
+    # ── Summary ──
+    lines.append("## Summary\n")
+    lines.append("| Category | Count |")
+    lines.append("|---|---|")
+    lines.append(f"| Source doc files checked | {md_files_checked} / {md_total_files} |")
+    lines.append(f"| Internal links checked (source) | {md_links_checked} |")
+    lines.append(f"| **Broken internal links (source)** | **{len(md_broken)}** |")
+    lines.append(f"| External URLs checked | {len(ext_results)} |")
+    lines.append(f"| **External 404s** | **{len(ext_404) + len(ext_internal)}** |")
+    lines.append(f"| **External down / refused** | **{len(ext_down)}** |")
+    lines.append(f"| **Stale redirects** | **{len(ext_redirect)}** |")
+    lines.append(f"| External errors (timeout/misc) | {len(ext_error)} |")
+    lines.append(f"| Build HTML files checked | {html_files_checked} / {html_total_files} |")
+    lines.append(f"| **Broken links in build output** | **{len(deduped_html)} patterns** |")
+    lines.append("")
+
+    # ── Section 1: Internal broken links ──
+    lines.append("---\n")
+    lines.append("## Section 1: Broken Internal Links in Source Docs\n")
+
+    if not md_broken:
+        lines.append("_No broken internal links._\n")
+    else:
+        by_file = defaultdict(list)
+        for item in md_broken:
+            by_file[item['source']].append(item)
+        for source in sorted(by_file):
+            short = make_short_path(source, DOCS_DIR)
+            lines.append(f"### `{short}`\n")
+            lines.append("| Link Text | URL | Resolved Path | Reason |")
+            lines.append("|---|---|---|---|")
+            for item in by_file[source]:
+                text     = item['link_text'].replace('|', '\\|')[:60]
+                url      = item['link_url'].replace('|', '\\|')[:80]
+                resolved = make_short_path(item['resolved'], DOCS_DIR).replace('|', '\\|')[:100]
+                reason   = item['reason'].replace('|', '\\|')
+                lines.append(f"| {text} | `{url}` | `{resolved}` | {reason} |")
+            lines.append("")
+
+    # ── Section 2: External 404s ──
+    lines.append("---\n")
+    lines.append("## Section 2: External 404s\n")
+
+    all_404 = {**ext_404, **ext_internal}
+    if not all_404:
+        lines.append("_No external 404s found._\n")
+    else:
+        lines.append("| URL | Replacement URL | Notes | Instances (Link Text — File) |")
+        lines.append("|---|---|---|---|")
+        for url, res in sorted(all_404.items()):
+            instances = _fmt_instances(ext_url_to_src.get(url, []))
+            code_str  = f"HTTP {res['http_code']}" if res['http_code'] else (res['error_msg'] or '')
+            if res['status'] == EXT_STATUS_INTERNAL:
+                code_str = "Not found in local build"
+            repl = _repl(url, res)
+            lines.append(f"| `{url[:100]}` | {repl} | {code_str} | {instances} |")
+        lines.append("")
+
+    # ── Section 3: Down / refused ──
+    lines.append("---\n")
+    lines.append("## Section 3: Down / Connection Refused\n")
+
+    if not ext_down:
+        lines.append("_No unreachable external links._\n")
+    else:
+        lines.append("| URL | Replacement URL | Error | Instances (Link Text — File) |")
+        lines.append("|---|---|---|---|")
+        for url, res in sorted(ext_down.items()):
+            instances = _fmt_instances(ext_url_to_src.get(url, []))
+            err       = res.get('error_msg', '') or ''
+            repl      = _repl(url, res)
+            lines.append(f"| `{url[:100]}` | {repl} | {err} | {instances} |")
+        lines.append("")
+
+    # ── Section 4: Stale redirects ──
+    lines.append("---\n")
+    lines.append("## Section 4: Stale Redirects (Update to Final URL)\n")
+
+    if not ext_redirect:
+        lines.append("_No stale redirects found._\n")
+    else:
+        lines.append("| Original URL | Redirects To | Instances (Link Text — File) |")
+        lines.append("|---|---|---|")
+        for url, res in sorted(ext_redirect.items()):
+            instances = _fmt_instances(ext_url_to_src.get(url, []))
+            repl      = _repl(url, res)
+            lines.append(f"| `{url[:80]}` | `{repl[:80]}` | {instances} |")
+        lines.append("")
+
+    # ── Section 5: Errors / timeouts ──
+    if ext_error:
+        lines.append("---\n")
+        lines.append("## Section 5: External Check Errors (timeout / misc)\n")
+        lines.append("| URL | Error | Instances (Link Text — File) |")
+        lines.append("|---|---|---|")
+        for url, res in sorted(ext_error.items()):
+            instances = _fmt_instances(ext_url_to_src.get(url, []))
+            err       = res.get('error_msg', '') or ''
+            lines.append(f"| `{url[:100]}` | {err} | {instances} |")
+        lines.append("")
+
+    # ── Section 6: Build HTML broken links ──
+    lines.append("---\n")
+    lines.append("## Section 6: Broken Links in Build Output\n")
+    lines.append("_Deduplicated by (url, reason) pattern._\n")
+
+    if not deduped_html:
+        lines.append("_No broken links in build output._\n")
+    else:
+        lines.append("| Count | URL | Reason | Example Source |")
+        lines.append("|---|---|---|---|")
+        for item in sorted(deduped_html, key=lambda x: -x['count']):
+            url     = item['link_url'].replace('|', '\\|')[:80]
+            reason  = item['reason'].replace('|', '\\|')
+            example = make_short_path(item['example_sources'][0], BUILD_DIR).replace('|', '\\|')[:80]
+            lines.append(f"| {item['count']} | `{url}` | {reason} | `{example}` |")
+        lines.append("")
+
+    REPORT_PATH.parent.mkdir(parents=True, exist_ok=True)
+    REPORT_PATH.write_text('\n'.join(lines), encoding='utf-8')
+    print(f"Report written to: {REPORT_PATH}")
+
+
+# ─────────────────────────────────────────────
+# Human-readable audit report
+# ─────────────────────────────────────────────
+
+def _source_to_page_link(path_str):
+    """Return a markdown link like [/docs/foo/bar](https://docs.ethswarm.org/docs/foo/bar)."""
+    try:
+        rel = Path(path_str).relative_to(DOCS_DIR)
+    except ValueError:
+        return path_str
+    url_path = str(rel).replace('\\', '/').replace('.mdx', '').replace('.md', '')
+    display = f"/docs/{url_path}"
+    url     = f"https://{SITE_DOMAIN}/docs/{url_path}"
+    return f"[{display}]({url})"
+
+
+def _fmt_sources(sources_list, max_show=2):
+    """Format a list of (file, text) source tuples into page link(s)."""
+    if not sources_list:
+        return "Unknown"
+    seen = []
+    for f, _ in sources_list:
+        lnk = _source_to_page_link(f)
+        if lnk not in seen:
+            seen.append(lnk)
+    if len(seen) > max_show:
+        return ", ".join(seen[:max_show]) + f" _(+{len(seen)-max_show} more)_"
+    return ", ".join(seen)
+
+
+def _fmt_instances(sources_list, docs_dir=None):
+    """
+    Format a list of (file_path, link_text) tuples as bullet points separated
+    by <br> tags (for inline rendering in markdown table cells).
+
+    Each bullet: • "link text" — `relative/file/path.md`
+    """
+    if not sources_list:
+        return "_unknown_"
+    if docs_dir is None:
+        docs_dir = DOCS_DIR
+    bullets = []
+    for f, text in sources_list:
+        short = make_short_path(f, docs_dir).replace('|', '\\|')
+        safe_text = (text or '').strip().replace('|', '\\|')[:80]
+        if safe_text:
+            bullets.append(f'• "{safe_text}" — `{short}`')
+        else:
+            bullets.append(f'• `{short}`')
+    return "<br>".join(bullets)
+
+
+def write_human_report(
+    md_broken, ext_results, ext_url_to_src,
+    md_files_checked, md_links_checked, md_total_files,
+    html_broken, html_files_checked, html_links_checked, html_total_files,
+    staged_replacements=None,
+):
+    import datetime
+    today = datetime.date.today().isoformat()
+
+    # Categorise external results
+    ext_404      = {u: r for u, r in ext_results.items() if r['status'] == EXT_STATUS_404}
+    ext_down     = {u: r for u, r in ext_results.items() if r['status'] == EXT_STATUS_DOWN}
+    ext_redirect = {u: r for u, r in ext_results.items() if r['status'] == EXT_STATUS_REDIRECT}
+    ext_internal = {u: r for u, r in ext_results.items() if r['status'] == EXT_STATUS_INTERNAL}
+    ext_error    = {u: r for u, r in ext_results.items() if r['status'] == EXT_STATUS_ERROR}
+
+    # Self-site 404s (docs.ethswarm.org old paths) vs truly external 404s
+    self_404 = {**ext_internal}   # checked against local build, not found
+    real_404 = {**ext_404}        # HTTP 404 from external server
+
+    _staged = staged_replacements or {}
+
+    def _repl(url, res=None):
+        if url in _staged:
+            return _staged[url]
+        final = (res or {}).get('final_url') or ''
+        return final if final and final != url else ''
+
+    n_dead       = len(md_broken) + len(self_404) + len(real_404)
+    n_down       = len(ext_down)
+    n_redirects  = len(ext_redirect)
+    n_errors     = len(ext_error)
+    n_total      = n_dead + n_down + n_redirects
+
+    lines = []
+    lines.append("## Context\n")
+    lines.append(
+        f"Dead link audit of {SITE_DOMAIN} found **{n_total}** broken, down, or stale links. "
+        f"Audit date: {today}.\n"
+    )
+
+    # ── Dead Links (404) ──────────────────────────────────────────────────────
+    lines.append("---\n")
+    lines.append("## Dead Links (404)\n")
+
+    if not md_broken and not self_404 and not real_404:
+        lines.append("_No dead links found._\n")
+    else:
+        lines.append("| Dead Link | Replacement URL | Status | Instances (Link Text — File) |")
+        lines.append("|---|---|---|---|")
+
+        # Broken internal links (wrong file path or missing anchor)
+        for item in md_broken:
+            url       = item['link_url'].replace('|', '\\|')
+            reason    = item['reason'].replace('|', '\\|')
+            instances = _fmt_instances([(item['source'], item.get('link_text', ''))])
+            lines.append(f"| `{url}` | | **Broken** — {reason} | {instances} |")
+
+        # Self-site 404s (old docs.ethswarm.org paths not in local build)
+        for url, _res in sorted(self_404.items()):
+            instances = _fmt_instances(ext_url_to_src.get(url, []))
+            lines.append(f"| {url} | | **404** — not found in local build (old path?) | {instances} |")
+
+        # External 404s
+        for url, res in sorted(real_404.items()):
+            instances = _fmt_instances(ext_url_to_src.get(url, []))
+            repl = _repl(url, res)
+            lines.append(f"| {url} | {repl} | **404** | {instances} |")
+
+    lines.append("")
+
+    # ── Forbidden / Down ─────────────────────────────────────────────────────
+    lines.append("---\n")
+    lines.append("## Forbidden / Down\n")
+
+    if not ext_down:
+        lines.append("_No unreachable links._\n")
+    else:
+        lines.append("| Dead Link | Replacement URL | Status | Instances (Link Text — File) |")
+        lines.append("|---|---|---|---|")
+        for url, res in sorted(ext_down.items()):
+            instances = _fmt_instances(ext_url_to_src.get(url, []))
+            err       = res.get('error_msg') or 'connection failed'
+            repl      = _repl(url, res)
+            # Simplify error messages
+            if 'DNS' in err or 'getaddrinfo' in err.lower():
+                status = "**DNS failure** — domain not found"
+            elif 'ECONNREFUSED' in err or 'Connection refused' in err:
+                status = "**ECONNREFUSED** — server down"
+            elif 'timed out' in err.lower() or 'timeout' in err.lower():
+                status = "**Timeout** — server unresponsive"
+            elif 'SSL' in err or 'ssl' in err:
+                status = "**SSL error** — handshake failure"
+            else:
+                status = f"**Down** — {err[:80]}"
+            lines.append(f"| {url} | {repl} | {status} | {instances} |")
+
+    lines.append("")
+
+    # ── Stale Redirects ───────────────────────────────────────────────────────
+    lines.append("---\n")
+    lines.append("## Stale Redirects (Should Update)\n")
+
+    if not ext_redirect:
+        lines.append("_No stale redirects._\n")
+    else:
+        lines.append("| Old Link | Redirects To | Instances (Link Text — File) |")
+        lines.append("|---|---|---|")
+        for url, res in sorted(ext_redirect.items()):
+            instances = _fmt_instances(ext_url_to_src.get(url, []))
+            repl = _repl(url, res)
+            lines.append(f"| {url} | {repl} | {instances} |")
+
+    lines.append("")
+
+    # ── Errors / Timeouts ────────────────────────────────────────────────────
+    if ext_error:
+        lines.append("---\n")
+        lines.append("## Check Errors (timeout / blocked)\n")
+        lines.append("_These URLs could not be verified — check manually._\n")
+        lines.append("| URL | Error | Instances (Link Text — File) |")
+        lines.append("|---|---|---|")
+        for url, res in sorted(ext_error.items()):
+            instances = _fmt_instances(ext_url_to_src.get(url, []))
+            err       = res.get('error_msg') or ''
+            lines.append(f"| {url} | {err} | {instances} |")
+        lines.append("")
+
+    # ── Summary ───────────────────────────────────────────────────────────────
+    lines.append("---\n")
+    lines.append("## Summary\n")
+    lines.append(f"- **Broken internal links:** {len(md_broken)}")
+    lines.append(f"- **Hard 404s (external):** {len(real_404) + len(self_404)}")
+    lines.append(f"- **Forbidden / Down:** {n_down}")
+    lines.append(f"- **Stale redirects:** {n_redirects}")
+    if ext_error:
+        lines.append(f"- **Check errors (unverified):** {n_errors}")
+    lines.append(f"- **Total actionable:** {n_total}")
+    lines.append("")
+
+    # ── Priority ─────────────────────────────────────────────────────────────
+    lines.append("---\n")
+    lines.append("## Priority\n")
+    priority = []
+    if md_broken:
+        priority.append(f"1. Fix {len(md_broken)} broken internal links (wrong paths / missing anchors)")
+    if self_404:
+        priority.append(f"{len(priority)+1}. Update {len(self_404)} old self-referential `{SITE_DOMAIN}` path(s) to current URLs")
+    if real_404:
+        priority.append(f"{len(priority)+1}. Remove or replace {len(real_404)} dead external link(s) (HTTP 404)")
+    if ext_down:
+        priority.append(f"{len(priority)+1}. Evaluate {len(ext_down)} down/refused server link(s) — remove or replace")
+    if ext_redirect:
+        priority.append(f"{len(priority)+1}. Update {len(ext_redirect)} stale redirect(s) to their final URL")
+    if ext_error:
+        priority.append(f"{len(priority)+1}. Manually verify {len(ext_error)} URL(s) that returned errors during check")
+    for item in priority:
+        lines.append(item)
+    lines.append("")
+
+    HUMAN_REPORT_PATH.parent.mkdir(parents=True, exist_ok=True)
+    HUMAN_REPORT_PATH.write_text('\n'.join(lines), encoding='utf-8')
+    print(f"Human report written to: {HUMAN_REPORT_PATH}")
+
+
+# ─────────────────────────────────────────────
+# Staged-changes URL replacement map
+# ─────────────────────────────────────────────
+
+def get_staged_url_replacements():
+    """
+    Parse `git diff --cached` to find URL replacements in staged changes.
+    Within each diff hunk, URLs on removed lines (-) are matched to URLs on
+    added lines (+) in order. Returns {old_url: new_url}.
+    """
+    url_re = re.compile(r'https?://[^\s\])"\'<>`\\]+')
+    try:
+        result = subprocess.run(
+            ['git', 'diff', '--cached', '--unified=0'],
+            cwd=str(PROJECT_DIR),
+            capture_output=True, text=True,
+        )
+        if result.returncode != 0 or not result.stdout:
+            return {}
+    except Exception:
+        return {}
+
+    replacements = {}
+    removed, added = [], []
+
+    def _flush():
+        removed_set = set(removed)
+        added_set   = set(added)
+        gone = [u for u in removed if u not in added_set]
+        new  = [u for u in added  if u not in removed_set]
+        if gone and new:
+            for old, new_url in zip(gone, new):
+                replacements[old] = new_url
+
+    for line in result.stdout.splitlines():
+        if line.startswith(('diff --git', 'index ', '--- ', '+++ ')):
+            continue
+        if line.startswith('@@'):
+            _flush()
+            removed, added = [], []
+        elif line.startswith('-'):
+            removed.extend(url_re.findall(line[1:]))
+        elif line.startswith('+'):
+            added.extend(url_re.findall(line[1:]))
+    _flush()
+    return replacements
+
+
+# ─────────────────────────────────────────────
+# Build helper
+# ─────────────────────────────────────────────
+
+def _build_is_outdated():
+    """
+    Return True if any source file in docs/, static/, or key config files
+    was modified more recently than the build directory itself.
+    """
+    try:
+        build_mtime = BUILD_DIR.stat().st_mtime
+    except FileNotFoundError:
+        return True  # no build at all
+
+    watch_dirs = [DOCS_DIR, STATIC_DIR]
+    watch_files = [
+        PROJECT_DIR / "docusaurus.config.mjs",
+        PROJECT_DIR / "sidebars.js",
+    ]
+
+    for d in watch_dirs:
+        if d.exists():
+            for f in d.rglob("*"):
+                if f.is_file() and f.stat().st_mtime > build_mtime:
+                    return True
+
+    for f in watch_files:
+        if f.exists() and f.stat().st_mtime > build_mtime:
+            return True
+
+    return False
+
+
+def trigger_build():
+    """
+    Ensure a current build exists before running local checks.
+
+    Behaviour:
+      • No build found     → build immediately, no prompt needed.
+      • Build found, up to date  → ask permission to overwrite.
+      • Build found, outdated    → warn user, ask if they want to rebuild.
+
+    Returns True if the build is ready to use, False on build failure or abort.
+    """
+    if not BUILD_DIR.exists():
+        print("\nNo existing build found — running: npm run build")
+        print("-" * 40)
+        result = subprocess.run([_NPM, 'run', 'build'], cwd=str(PROJECT_DIR))
+        print("-" * 40)
+        if result.returncode != 0:
+            print("ERROR: Build failed (see output above).", file=sys.stderr)
+            return False
+        print("Build complete.\n")
+        return True
+
+    # Build exists — check freshness
+    outdated = _build_is_outdated()
+    if outdated:
+        print(f"\nWARNING: The existing build at {BUILD_DIR} is outdated")
+        print("         (source files have changed since it was last built).")
+        prompt = "Rebuild now to get accurate results? This will overwrite it. [Y/n]: "
+    else:
+        print(f"\nAn existing build was found at: {BUILD_DIR} (appears up to date).")
+        prompt = "Rebuild now anyway? This will overwrite it. [y/N]: "
+
+    try:
+        resp = input(prompt).strip().lower()
+    except (EOFError, KeyboardInterrupt):
+        print()
+        resp = ''
+
+    # For outdated builds default is YES; for current builds default is NO
+    if outdated:
+        do_build = resp not in ('n', 'no')
+    else:
+        do_build = resp in ('y', 'yes')
+
+    if not do_build:
+        if outdated:
+            print("Skipping rebuild — results may not reflect latest changes.\n")
+        else:
+            print("Skipping rebuild — using existing build.\n")
+        return True
+
+    print("\nRunning: npm run build")
+    print("-" * 40)
+    result = subprocess.run([_NPM, 'run', 'build'], cwd=str(PROJECT_DIR))
+    print("-" * 40)
+    if result.returncode != 0:
+        print("ERROR: Build failed (see output above).", file=sys.stderr)
+        return False
+    print("Build complete.\n")
+    return True
+
+
+# ─────────────────────────────────────────────
+# Main
+# ─────────────────────────────────────────────
+
+def main():
+    parser = argparse.ArgumentParser(
+        description='Bee-docs link checker',
+        formatter_class=argparse.RawDescriptionHelpFormatter,
+        epilog=(
+            "Modes:\n"
+            "  local  — build the site locally and check source docs + build output\n"
+            "  live   — fetch the live site at docs.ethswarm.org and check all links\n"
+        ),
+    )
+    parser.add_argument(
+        '--mode', choices=['local', 'live'], default=None,
+        help='Check mode: "local" (build + source check) or "live" (live site crawl). '
+             'If omitted you will be prompted.',
+    )
+    parser.add_argument('--no-external', action='store_true',
+                        help='(local mode only) Skip external URL checking')
+    parser.add_argument('--threads', type=int, default=EXT_THREADS,
+                        help=f'Concurrent HTTP threads (default: {EXT_THREADS})')
+    args = parser.parse_args()
+
+    # ── Mode selection ──
+    mode = args.mode
+    if mode is None:
+        print("=== Bee-docs Link Checker ===\n")
+        print("Which site do you want to check?")
+        print("  1) local  — build locally and check source docs + build output")
+        print("  2) live   — fetch the live site at docs.ethswarm.org\n")
+        try:
+            choice = input("Enter 1 or 2 [default: 1]: ").strip()
+        except (EOFError, KeyboardInterrupt):
+            print()
+            choice = '1'
+        mode = 'live' if choice == '2' else 'local'
+        print()
+
+    # ── Live mode: delegate to check_live_links.py ──
+    if mode == 'live':
+        live_script = Path(__file__).parent / 'check_live_links.py'
+        if not live_script.exists():
+            print(f"ERROR: {live_script} not found.", file=sys.stderr)
+            sys.exit(1)
+        cmd = [sys.executable, str(live_script), '--threads', str(args.threads)]
+        print(f"Running live checker: {' '.join(cmd)}\n")
+        result = subprocess.run(cmd)
+        sys.exit(result.returncode)
+
+    # ── Local mode ──
+    check_ext = not args.no_external
+
+    print("=== Bee-docs Link Checker — Local Mode ===")
+    print(f"Docs dir  : {DOCS_DIR}")
+    print(f"Build dir : {BUILD_DIR}")
+    print(f"External  : {'enabled' if check_ext else 'disabled (--no-external)'}")
+    print()
+
+    if not DOCS_DIR.exists():
+        print(f"ERROR: Docs dir not found: {DOCS_DIR}")
+        sys.exit(1)
+
+    # Always trigger a build for local mode
+    if not trigger_build():
+        sys.exit(1)
+
+    print("Scanning source docs (internal links)...")
+    md_broken, ext_url_to_src, md_files, md_links, md_total = check_markdown_files(check_ext)
+    print(f"  Files: {md_files}/{md_total}, Links: {md_links}, Broken internal: {len(md_broken)}")
+    print(f"  Unique external URLs collected: {len(ext_url_to_src)}")
+
+    ext_results = {}
+    if check_ext and ext_url_to_src:
+        print(f"\nChecking {len(ext_url_to_src)} external URLs ({args.threads} threads)...")
+        ext_results = check_external_urls_threaded(ext_url_to_src, threads=args.threads)
+        ok       = sum(1 for r in ext_results.values() if r['status'] == EXT_STATUS_OK)
+        redirects = sum(1 for r in ext_results.values() if r['status'] == EXT_STATUS_REDIRECT)
+        not_found = sum(1 for r in ext_results.values() if r['status'] in (EXT_STATUS_404, EXT_STATUS_INTERNAL))
+        down      = sum(1 for r in ext_results.values() if r['status'] == EXT_STATUS_DOWN)
+        errors    = sum(1 for r in ext_results.values() if r['status'] == EXT_STATUS_ERROR)
+        print(f"  OK: {ok}  Redirect: {redirects}  404: {not_found}  Down: {down}  Error: {errors}")
+
+    html_broken = []
+    html_files = html_links = html_total = 0
+    if BUILD_DIR.exists():
+        print("\nChecking build output (HTML internal links)...")
+        html_broken, html_files, html_links, html_total = check_html_files()
+        print(f"  Files: {html_files}/{html_total}, Links: {html_links}, Broken: {len(html_broken)}")
+
+    staged = get_staged_url_replacements()
+    if staged:
+        print(f"\nFound {len(staged)} staged URL replacement(s) from git diff.")
+
+    print("\nWriting report...")
+    write_report(
+        md_broken, ext_results, ext_url_to_src,
+        md_files, md_links, md_total,
+        html_broken, html_files, html_links, html_total,
+        staged_replacements=staged,
+    )
+    write_human_report(
+        md_broken, ext_results, ext_url_to_src,
+        md_files, md_links, md_total,
+        html_broken, html_files, html_links, html_total,
+        staged_replacements=staged,
+    )
+
+
+if __name__ == '__main__':
+    main()
diff --git a/scripts/check_live_links.py b/scripts/check_live_links.py
new file mode 100644
index 00000000..b22334c5
--- /dev/null
+++ b/scripts/check_live_links.py
@@ -0,0 +1,689 @@
+#!/usr/bin/env python3
+"""
+Live site link checker for docs.ethswarm.org.
+
+Fetches all pages listed in the sitemap, extracts every <a href> link,
+then checks each link with explicit redirect handling (no auto-following).
+
+Usage:
+    python scripts/check_live_links.py [--threads N] [--max-pages N]
+    npm run check:links  (then select live mode)
+
+Output:
+    .claude/live_links_audit.md   — human-readable report
+"""
+
+import re
+import sys
+import time
+import queue
+import socket
+import threading
+import http.client
+import subprocess
+import xml.etree.ElementTree as ET
+import argparse
+import datetime
+from html.parser import HTMLParser
+from pathlib import Path
+from urllib.parse import urlparse, urljoin, unquote
+from urllib.request import Request, urlopen, HTTPRedirectHandler, build_opener
+from urllib.error import URLError, HTTPError
+from collections import defaultdict
+
+# ─────────────────────────────────────────────
+# Configuration
+# ─────────────────────────────────────────────
+
+SITE_BASE    = "https://docs.ethswarm.org"
+SITEMAP_URL  = f"{SITE_BASE}/sitemap.xml"
+PROJECT_DIR  = Path(__file__).resolve().parent.parent
+REPORT_PATH  = PROJECT_DIR / "link-reports/live_links_audit.md"
+
+EXT_TIMEOUT = 15   # seconds per HTTP request
+EXT_THREADS = 8    # concurrent URL checkers
+EXT_DELAY   = 0.05 # seconds between requests per thread
+
+USER_AGENT = (
+    "Mozilla/5.0 (Windows NT 10.0; Win64; x64) "
+    "AppleWebKit/537.36 (KHTML, like Gecko) "
+    "Chrome/122.0 Safari/537.36 bee-docs-live-checker/1.0"
+)
+
+IGNORE_SCHEMES = ("mailto:", "javascript:", "tel:", "ftp:", "data:", "#")
+IGNORE_HOSTS   = ("localhost", "127.0.0.1", "192.168.", "10.0.", "0.0.0.0")
+# Hostnames that end with these suffixes are placeholder/example URLs in docs
+IGNORE_HOST_SUFFIXES = (".example", ".local", ".invalid", ".test")
+
+# URL substrings to silently ignore — systematic redirects that aren't actionable doc fixes.
+# e.g. every page has an "Edit this page" link using the old GitHub repo name.
+IGNORE_URL_PATTERNS = (
+    "github.com/ethersphere/docs.github.io",  # "Edit this page" links using old repo name
+)
+
+# Hostnames to ignore because they are example/template values in documentation
+IGNORE_EXAMPLE_HOSTS = (
+    "yourname.eth.limo",
+    "yourname.bzz.link",
+    "bee-1",       # example service hostname in docker/gateway examples
+)
+
+EXT_STATUS_OK       = 'ok'
+EXT_STATUS_404      = '404'
+EXT_STATUS_DOWN     = 'down'
+EXT_STATUS_REDIRECT = 'redirect'
+EXT_STATUS_ERROR    = 'error'
+
+
+# ─────────────────────────────────────────────
+# HTTP helpers (explicit redirect handling)
+# ─────────────────────────────────────────────
+
+class _NoFollowRedirectHandler(HTTPRedirectHandler):
+    def redirect_request(self, req, fp, code, msg, headers, newurl):
+        return None
+
+
+def _build_no_redirect_opener():
+    return build_opener(_NoFollowRedirectHandler())
+
+
+def _fetch(url, method='HEAD', follow_redirects=False, timeout=EXT_TIMEOUT):
+    """
+    Single HTTP request.
+    Returns (status_code, final_url, location_header, error_str).
+    """
+    headers = {
+        'User-Agent': USER_AGENT,
+        'Accept': 'text/html,application/xhtml+xml,*/*;q=0.8',
+    }
+    try:
+        req = Request(url, headers=headers, method=method)
+        if follow_redirects:
+            with urlopen(req, timeout=timeout) as resp:
+                return resp.status, resp.url, None, None
+        else:
+            opener = _build_no_redirect_opener()
+            with opener.open(req, timeout=timeout) as resp:
+                return resp.status, url, resp.headers.get('Location'), None
+    except HTTPError as e:
+        loc = e.headers.get('Location') if hasattr(e, 'headers') and e.headers else None
+        return e.code, url, loc, None
+    except (URLError, socket.timeout, socket.error, ConnectionRefusedError,
+            http.client.RemoteDisconnected, http.client.IncompleteRead) as e:
+        return None, url, None, str(e)
+    except Exception as e:
+        return None, url, None, f'{type(e).__name__}: {str(e)[:120]}'
+
+
+def _classify_err(result, err):
+    if 'ECONNREFUSED' in err or 'Connection refused' in err:
+        result.update(status=EXT_STATUS_DOWN, error_msg='ECONNREFUSED — server down')
+    elif ('Name or service not known' in err or 'getaddrinfo' in err
+          or 'nodename' in err.lower() or 'No address' in err):
+        result.update(status=EXT_STATUS_DOWN, error_msg='DNS resolution failed')
+    elif 'timed out' in err.lower() or 'timeout' in err.lower():
+        result.update(status=EXT_STATUS_DOWN, error_msg='Connection timed out')
+    elif 'SSL' in err or 'ssl' in err:
+        result.update(status=EXT_STATUS_DOWN, error_msg=f'SSL error: {err[:80]}')
+    else:
+        result.update(status=EXT_STATUS_DOWN, error_msg=f'Connection error: {err[:80]}')
+    return result
+
+
+def _urls_differ(original, final):
+    if not final or original == final:
+        return False
+    o, f = urlparse(original), urlparse(final)
+    op, fp = o.path.rstrip('/'), f.path.rstrip('/')
+    if o.netloc == f.netloc and op == fp and o.query == f.query:
+        return False
+    if o.netloc == f.netloc and op == fp and o.scheme == 'http' and f.scheme == 'https':
+        return False
+    return True
+
+
+def _check_dest(dest_url):
+    """Follow redirect destination and verify it returns 200."""
+    code, final, _, err = _fetch(dest_url, method='HEAD', follow_redirects=True)
+    if err:
+        return None, dest_url, err
+    if code in (403, 405):
+        code, final, _, err = _fetch(dest_url, method='GET', follow_redirects=True)
+        if err:
+            return None, dest_url, err
+    return code, final or dest_url, None
+
+
+def check_url(url):
+    """
+    Check a single URL with explicit redirect handling.
+    Returns dict: {url, status, http_code, final_url, error_msg}
+    """
+    result = dict(url=url, status=EXT_STATUS_ERROR,
+                  http_code=None, final_url=None, error_msg=None)
+
+    # Step 1: HEAD without following redirects
+    code, _, location, err = _fetch(url, method='HEAD', follow_redirects=False)
+    if err:
+        return _classify_err(result, err)
+
+    # HEAD rejected → retry with GET
+    if code in (403, 405):
+        code, _, location, err = _fetch(url, method='GET', follow_redirects=False)
+        if err:
+            return _classify_err(result, err)
+        if code in (403, 405):
+            result.update(status=EXT_STATUS_ERROR, http_code=code,
+                          error_msg=f'HTTP {code} (GET retry)', final_url=url)
+            return result
+
+    result['http_code'] = code
+
+    if code is None:
+        result['status'] = EXT_STATUS_ERROR
+        return result
+    if code == 200:
+        result.update(status=EXT_STATUS_OK, final_url=url)
+    elif code == 404:
+        result.update(status=EXT_STATUS_404, error_msg='HTTP 404', final_url=url)
+    elif code in (301, 302, 303, 307, 308):
+        dest = location or url
+        if dest and not dest.startswith('http'):
+            p = urlparse(url)
+            dest = f"{p.scheme}://{p.netloc}{dest}"
+        dest_code, dest_final, dest_err = _check_dest(dest)
+        if dest_err:
+            result.update(status=EXT_STATUS_DOWN,
+                          error_msg=f"Redirect to {dest!r} failed: {dest_err[:80]}",
+                          final_url=dest)
+        elif dest_code is None:
+            result.update(status=EXT_STATUS_DOWN,
+                          error_msg=f"Redirect destination unreachable",
+                          final_url=dest)
+        elif dest_code == 200:
+            if _urls_differ(url, dest_final):
+                result.update(status=EXT_STATUS_REDIRECT, final_url=dest_final)
+            else:
+                result.update(status=EXT_STATUS_OK, final_url=dest_final)
+        elif dest_code == 404:
+            result.update(status=EXT_STATUS_404,
+                          error_msg=f"Redirect target returned 404",
+                          final_url=dest)
+        else:
+            result.update(status=EXT_STATUS_ERROR,
+                          error_msg=f"Redirect target returned HTTP {dest_code}",
+                          final_url=dest)
+    elif 200 <= code < 300:
+        result.update(status=EXT_STATUS_OK, final_url=url)
+    else:
+        result.update(status=EXT_STATUS_ERROR,
+                      error_msg=f'HTTP {code}', final_url=url)
+
+    return result
+
+
+# ─────────────────────────────────────────────
+# Sitemap fetcher
+# ─────────────────────────────────────────────
+
+def fetch_sitemap_urls(sitemap_url):
+    """Fetch sitemap.xml and return list of page URLs."""
+    print(f"Fetching sitemap: {sitemap_url}")
+    try:
+        req = Request(sitemap_url, headers={'User-Agent': USER_AGENT})
+        with urlopen(req, timeout=30) as resp:
+            xml_data = resp.read()
+    except Exception as e:
+        print(f"ERROR fetching sitemap: {e}", file=sys.stderr)
+        return []
+
+    urls = []
+    try:
+        root = ET.fromstring(xml_data)
+        # Handle namespace
+        ns = ''
+        if root.tag.startswith('{'):
+            ns = root.tag.split('}')[0] + '}'
+        for loc in root.iter(f'{ns}loc'):
+            u = loc.text.strip() if loc.text else ''
+            if u:
+                urls.append(u)
+    except ET.ParseError as e:
+        print(f"ERROR parsing sitemap XML: {e}", file=sys.stderr)
+
+    print(f"  Found {len(urls)} URLs in sitemap")
+    return urls
+
+
+# ─────────────────────────────────────────────
+# HTML link extractor
+# ─────────────────────────────────────────────
+
+class LinkExtractor(HTMLParser):
+    def __init__(self):
+        super().__init__()
+        self.links_with_text = []  # list of (href, link_text) from <a> tags
+        self.text_chunks = []      # all visible text (including code blocks) for bare URL extraction
+        self._skip_depth = 0       # depth inside <script>/<style> — skip those text nodes
+        self._pending_href = None  # href of the currently open <a> tag
+        self._a_text_buf = []      # text buffer for the currently open <a> tag
+
+    def handle_starttag(self, tag, attrs):
+        attrs = dict(attrs)
+        if tag in ('script', 'style'):
+            self._skip_depth += 1
+        if tag == 'a' and 'href' in attrs:
+            href = attrs['href'].strip()
+            if href:
+                self._pending_href = href
+                self._a_text_buf = []
+
+    def handle_endtag(self, tag):
+        if tag in ('script', 'style'):
+            self._skip_depth = max(0, self._skip_depth - 1)
+        if tag == 'a':
+            if self._pending_href:
+                link_text = ' '.join(self._a_text_buf).strip()
+                self.links_with_text.append((self._pending_href, link_text))
+            self._pending_href = None
+            self._a_text_buf = []
+
+    def handle_data(self, data):
+        # Collect all visible text (prose AND code blocks), skip script/style
+        if self._skip_depth == 0:
+            self.text_chunks.append(data)
+            if self._pending_href is not None:
+                self._a_text_buf.append(data)
+
+
+def _normalise_url(raw, page_url):
+    """
+    Convert a raw href or bare URL string to an absolute, fragment-stripped URL.
+    Returns the normalised URL string, or None if it should be skipped.
+    """
+    if not raw or raw == '#' or raw.startswith('#'):
+        return None
+    if any(raw.startswith(s) for s in IGNORE_SCHEMES):
+        return None
+    # Strip fragment
+    url = raw.split('#')[0] if '#' in raw else raw
+    if not url:
+        return None
+    # Make absolute (handles both already-absolute and relative URLs)
+    abs_url = urljoin(page_url, url) if not url.startswith('http') else url
+    parsed = urlparse(abs_url)
+    if not parsed.scheme.startswith('http'):
+        return None
+    host = parsed.hostname or ''
+    if any(host.startswith(h) for h in IGNORE_HOSTS):
+        return None
+    if any(host.endswith(s) for s in IGNORE_HOST_SUFFIXES):
+        return None
+    if host in IGNORE_EXAMPLE_HOSTS or any(host.startswith(h) for h in IGNORE_EXAMPLE_HOSTS):
+        return None
+    if any(pat in abs_url for pat in IGNORE_URL_PATTERNS):
+        return None
+    return abs_url
+
+
+def fetch_page_links(page_url):
+    """
+    Fetch a live page and return list of absolute URLs referenced from it.
+
+    Extracts URLs from two sources:
+      1. <a href> anchor tags
+      2. Bare http(s):// URLs appearing in prose and code blocks
+         (catches links not wrapped in anchor tags)
+
+    Returns (links_list, error_str_or_None).
+    links_list: deduplicated, filtered list of absolute URL strings.
+    """
+    try:
+        req = Request(page_url, headers={'User-Agent': USER_AGENT,
+                                          'Accept': 'text/html'})
+        with urlopen(req, timeout=20) as resp:
+            html = resp.read().decode('utf-8', errors='replace')
+    except Exception as e:
+        return [], str(e)
+
+    parser = LinkExtractor()
+    parser.feed(html)
+
+    seen = set()
+    links_with_text = []   # list of (normalised_url, link_text)
+
+    def _add(url, text=''):
+        n = _normalise_url(url, page_url)
+        if n and n not in seen:
+            seen.add(n)
+            links_with_text.append((n, text))
+
+    # Source 1: <a href> links (with captured link text)
+    for href, text in parser.links_with_text:
+        _add(href, text)
+
+    # Source 2: bare URLs in all visible text (prose + code blocks).
+    # Join with a space so adjacent text nodes don't concatenate into garbled URLs
+    # e.g. "https://example.com" + "may" → "https://example.com may" not "https://example.commay"
+    full_text = ' '.join(parser.text_chunks)
+    for m in re.finditer(r'https?://[^\s\]>"\'\\<`*]+', full_text):
+        raw = m.group(0).rstrip('.,;:!)')
+        # Strip trailing unbalanced close-parens
+        while raw.endswith(')') and raw.count('(') < raw.count(')'):
+            raw = raw[:-1]
+        _add(raw, '')   # bare URLs have no link text
+
+    return links_with_text, None
+
+
+# ─────────────────────────────────────────────
+# Parallel URL checker
+# ─────────────────────────────────────────────
+
+def check_urls_threaded(url_to_sources, threads=EXT_THREADS):
+    """
+    Check a dict of {url: [source_pages]} concurrently.
+    Returns dict of {url: check_result}.
+    """
+    urls    = list(url_to_sources.keys())
+    results = {}
+    lock    = threading.Lock()
+    q       = queue.Queue()
+    total   = len(urls)
+    done    = [0]
+
+    for url in urls:
+        q.put(url)
+
+    def worker():
+        while True:
+            try:
+                url = q.get_nowait()
+            except queue.Empty:
+                break
+            time.sleep(EXT_DELAY)
+            res = check_url(url)
+            with lock:
+                results[url] = res
+                done[0] += 1
+                n = done[0]
+                if n % 20 == 0 or n == total:
+                    print(f"  Checked {n}/{total} URLs...", end='\r', flush=True)
+            q.task_done()
+
+    ts = [threading.Thread(target=worker, daemon=True)
+          for _ in range(min(threads, max(1, len(urls))))]
+    for t in ts:
+        t.start()
+    for t in ts:
+        t.join()
+    print()
+    return results
+
+
+# ─────────────────────────────────────────────
+# Staged-changes URL replacement map
+# ─────────────────────────────────────────────
+
+def get_staged_url_replacements():
+    """
+    Parse `git diff --cached` to find URL replacements in staged changes.
+    Within each diff hunk, URLs removed (- lines) are matched to URLs added
+    (+ lines) in order. Returns {old_url: new_url}.
+    """
+    url_re = re.compile(r'https?://[^\s\])"\'<>`\\]+')
+    try:
+        result = subprocess.run(
+            ['git', 'diff', '--cached', '--unified=0'],
+            cwd=str(PROJECT_DIR),
+            capture_output=True, text=True,
+        )
+        if result.returncode != 0 or not result.stdout:
+            return {}
+    except Exception:
+        return {}
+
+    replacements = {}
+    removed, added = [], []
+
+    def _flush():
+        removed_set = set(removed)
+        added_set   = set(added)
+        gone = [u for u in removed if u not in added_set]
+        new  = [u for u in added  if u not in removed_set]
+        if gone and new:
+            for old, new_url in zip(gone, new):
+                replacements[old] = new_url
+
+    for line in result.stdout.splitlines():
+        if line.startswith(('diff --git', 'index ', '--- ', '+++ ')):
+            continue
+        if line.startswith('@@'):
+            _flush()
+            removed, added = [], []
+        elif line.startswith('-'):
+            removed.extend(url_re.findall(line[1:]))
+        elif line.startswith('+'):
+            added.extend(url_re.findall(line[1:]))
+    _flush()
+    return replacements
+
+
+# ─────────────────────────────────────────────
+# Report writer
+# ─────────────────────────────────────────────
+
+def write_report(url_to_sources, results, pages_crawled, total_links_found, staged_replacements=None):
+    today = datetime.date.today().isoformat()
+
+    broken_404  = {u: r for u, r in results.items() if r['status'] == EXT_STATUS_404}
+    broken_down = {u: r for u, r in results.items() if r['status'] == EXT_STATUS_DOWN}
+    redirects   = {u: r for u, r in results.items() if r['status'] == EXT_STATUS_REDIRECT}
+    errors      = {u: r for u, r in results.items() if r['status'] == EXT_STATUS_ERROR}
+
+    n_action = len(broken_404) + len(broken_down) + len(redirects)
+    _staged  = staged_replacements or {}
+
+    def _repl(url, res):
+        """Return the best replacement URL: staged fix first, then redirect final_url."""
+        if url in _staged:
+            return _staged[url]
+        final = (res or {}).get('final_url') or ''
+        return final if final and final != url else ''
+
+    def fmt_instances(url):
+        """Return bullet-point instances as a single string with <br> separators."""
+        srcs = url_to_sources.get(url, [])
+        if not srcs:
+            return "_unknown_"
+        bullets = []
+        for page_url, link_text in srcs:
+            safe_text = (link_text or '').strip().replace('|', '\\|')[:80]
+            safe_page = page_url.replace('|', '\\|')
+            if safe_text:
+                bullets.append(f'• "{safe_text}" — [{safe_page}]({safe_page})')
+            else:
+                bullets.append(f'• [{safe_page}]({safe_page})')
+        return "<br>".join(bullets)
+
+    lines = []
+    lines.append("## Context\n")
+    lines.append(f"Dead link audit of docs.ethswarm.org found **{n_action}** broken, down, or stale links. Audit date: {today}.\n")
+    lines.append("")
+
+    # 404s
+    lines.append("---\n")
+    lines.append("## Dead Links (404)\n")
+    if broken_404:
+        lines.append("| Dead Link | Replacement URL | Status | Instances (Link Text — Source Page) |")
+        lines.append("|---|---|---|---|")
+        for url, res in sorted(broken_404.items()):
+            instances = fmt_instances(url)
+            err       = res.get('error_msg') or 'HTTP 404'
+            repl      = _repl(url, res)
+            lines.append(f"| {url} | {repl} | **Broken** — {err} | {instances} |")
+    else:
+        lines.append("_No 404s found._")
+    lines.append("")
+
+    # Down / refused
+    lines.append("---\n")
+    lines.append("## Forbidden / Down\n")
+    if broken_down:
+        lines.append("| Dead Link | Replacement URL | Status | Instances (Link Text — Source Page) |")
+        lines.append("|---|---|---|---|")
+        for url, res in sorted(broken_down.items()):
+            instances = fmt_instances(url)
+            err       = res.get('error_msg') or 'Connection error'
+            repl      = _repl(url, res)
+            lines.append(f"| {url} | {repl} | **{err}** | {instances} |")
+    else:
+        lines.append("_No down/refused links found._")
+    lines.append("")
+
+    # Stale redirects
+    lines.append("---\n")
+    lines.append("## Stale Redirects (Should Update)\n")
+    if redirects:
+        lines.append("| Old Link | Redirects To | Instances (Link Text — Source Page) |")
+        lines.append("|---|---|---|")
+        for url, res in sorted(redirects.items()):
+            instances = fmt_instances(url)
+            repl      = _repl(url, res)
+            lines.append(f"| {url} | {repl} | {instances} |")
+    else:
+        lines.append("_No stale redirects found._")
+    lines.append("")
+
+    # Errors
+    if errors:
+        lines.append("---\n")
+        lines.append("## Check Errors (timeout / blocked)\n")
+        lines.append("_These URLs could not be verified — check manually._\n")
+        lines.append("| URL | Error | Instances (Link Text — Source Page) |")
+        lines.append("|---|---|---|")
+        for url, res in sorted(errors.items()):
+            instances = fmt_instances(url)
+            err = res.get('error_msg') or ''
+            lines.append(f"| {url} | {err} | {instances} |")
+        lines.append("")
+
+    # Summary
+    lines.append("---\n")
+    lines.append("## Summary\n")
+    lines.append(f"- **Pages crawled:** {pages_crawled}")
+    lines.append(f"- **Unique URLs checked:** {len(results)}")
+    lines.append(f"- **Broken internal links:** {sum(1 for u in broken_404 if SITE_BASE in u)}")
+    lines.append(f"- **Hard 404s (external):** {sum(1 for u in broken_404 if SITE_BASE not in u)}")
+    lines.append(f"- **Forbidden / Down:** {len(broken_down)}")
+    lines.append(f"- **Stale redirects:** {len(redirects)}")
+    lines.append(f"- **Check errors (unverified):** {len(errors)}")
+    lines.append(f"- **Total actionable:** {n_action}")
+    lines.append("")
+
+    REPORT_PATH.parent.mkdir(parents=True, exist_ok=True)
+    REPORT_PATH.write_text('\n'.join(lines), encoding='utf-8')
+    print(f"\nReport written to: {REPORT_PATH}")
+
+
+# ─────────────────────────────────────────────
+# Main
+# ─────────────────────────────────────────────
+
+def main():
+    parser = argparse.ArgumentParser(description='Check links on the live docs site.')
+    parser.add_argument('--threads',   type=int, default=EXT_THREADS,
+                        help='Concurrent URL checker threads (default: 8)')
+    parser.add_argument('--max-pages', type=int, default=0,
+                        help='Max pages to crawl (0 = all)')
+    args = parser.parse_args()
+
+    # 1. Fetch all page URLs from sitemap
+    page_urls = fetch_sitemap_urls(SITEMAP_URL)
+    if not page_urls:
+        print("No pages found in sitemap. Exiting.", file=sys.stderr)
+        sys.exit(1)
+
+    if args.max_pages > 0:
+        page_urls = page_urls[:args.max_pages]
+
+    # 2. Fetch each page and collect links
+    print(f"\nCrawling {len(page_urls)} pages to extract links...")
+    url_to_sources = defaultdict(list)   # {link_url: [(source_page_url, link_text), ...]}
+    fetch_errors = []
+
+    for i, page_url in enumerate(page_urls, 1):
+        if i % 20 == 0 or i == len(page_urls):
+            print(f"  Crawling page {i}/{len(page_urls)}...", end='\r', flush=True)
+        links_with_text, err = fetch_page_links(page_url)
+        if err:
+            fetch_errors.append((page_url, err))
+        for link_url, link_text in links_with_text:
+            url_to_sources[link_url].append((page_url, link_text))
+
+    print(f"\n  Found {len(url_to_sources)} unique URLs across {len(page_urls)} pages")
+    if fetch_errors:
+        print(f"  WARNING: {len(fetch_errors)} pages failed to fetch")
+
+    # 3. Check all collected URLs
+    print(f"\nChecking {len(url_to_sources)} URLs ({args.threads} threads)...")
+    results = check_urls_threaded(dict(url_to_sources), threads=args.threads)
+
+    # 4. Print summary to console
+    broken_404  = [u for u, r in results.items() if r['status'] == EXT_STATUS_404]
+    broken_down = [u for u, r in results.items() if r['status'] == EXT_STATUS_DOWN]
+    redirects   = [u for u, r in results.items() if r['status'] == EXT_STATUS_REDIRECT]
+    errors_list = [u for u, r in results.items() if r['status'] == EXT_STATUS_ERROR]
+
+    print(f"\n{'='*60}")
+    print(f"RESULTS")
+    print(f"{'='*60}")
+    print(f"Pages crawled:     {len(page_urls)}")
+    print(f"Unique URLs:       {len(results)}")
+    print(f"404 (broken):      {len(broken_404)}")
+    print(f"Down / refused:    {len(broken_down)}")
+    print(f"Stale redirects:   {len(redirects)}")
+    print(f"Errors/timeouts:   {len(errors_list)}")
+    print(f"{'='*60}")
+
+    if broken_404:
+        print("\n── 404 Broken Links ──")
+        for url in sorted(broken_404):
+            srcs = url_to_sources.get(url, [])
+            src_page, src_text = srcs[0] if srcs else ('?', '')
+            print(f"  [404] {url}")
+            print(f"        from: {src_page}" + (f' ("{src_text}")' if src_text else ''))
+
+    if broken_down:
+        print("\n── Down / Refused ──")
+        for url in sorted(broken_down):
+            err  = results[url].get('error_msg', '')
+            srcs = url_to_sources.get(url, [])
+            src_page, src_text = srcs[0] if srcs else ('?', '')
+            print(f"  [DOWN] {url}")
+            print(f"         {err}")
+            print(f"         from: {src_page}" + (f' ("{src_text}")' if src_text else ''))
+
+    if redirects:
+        print("\n── Stale Redirects ──")
+        for url in sorted(redirects):
+            final = results[url].get('final_url', '?')
+            srcs  = url_to_sources.get(url, [])
+            src_page, src_text = srcs[0] if srcs else ('?', '')
+            print(f"  [REDIR] {url}")
+            print(f"          → {final}")
+            print(f"          from: {src_page}" + (f' ("{src_text}")' if src_text else ''))
+
+    # 5. Write report
+    staged = get_staged_url_replacements()
+    if staged:
+        print(f"\nFound {len(staged)} staged URL replacement(s) from git diff.")
+    write_report(dict(url_to_sources), results,
+                 pages_crawled=len(page_urls),
+                 total_links_found=len(url_to_sources),
+                 staged_replacements=staged)
+
+
+if __name__ == '__main__':
+    main()