Proposal: Peer Hint URI scheme

sebaseba · January 9, 2019, 4:50pm

From what I understand how IPFS works, adding files on IPFS works like this:

We put a file to IPFS on computer A, it computes [hash]
We enter ipfs://[hash] on computer B
Computer B starts the search on DHT where [hash] is located, this takes a while
When it does, it connects to the peer containing the [hash] and starts to download it (assuming it’s a single chunk file)

OK, but the thing is that when we first enter the IPFS address on the first other computer, i.e. computer B in this case, it takes 10-20 min for smaller files to several hours for bigger files to be quickly available to computer B, because of the slow DHT search. After that, it is quickly available to computer C, D, …

This I think is unacceptable for quickly sharing files with someone or dynamic applications.

Instead, a more rational way for this would be to optionally add to the URI a hint where the file is available, i.e. which peer is the originator. This would hasten the download while the file isn’t popular yet.

For example instead of ipfs://[hash]/ we could have ipfs://[hash]?peer=1.2.3.4/4001 or something similar. Which would mean that the peer 1.2.3.4:4001 has the [hash]. That way IPFS could do a DHT search + automatically add the 1.2.3.4 peer as having the [hash] and thereby speed up the whole process of initially puting files on IPFS.

What do you think?
Am I misunderstanding how IPFS works?
Is this a bad idea?

I bounced the idea on #ipfs@freenode and the ones that responded seemed to like it.

timmc · January 9, 2019, 6:29pm

I like this idea, and I think there might be some things to learn here from the magnet: URIs that bittorrent uses.

lidel · May 19, 2019, 5:53pm

16 posts were split to a new topic: Debugging slow content discovery

sebaseba · January 9, 2019, 11:41pm

Yes, magnet has the alternative download location and tracker url. This would be similar.

swedneck · January 19, 2019, 12:13am

I just want to note that this is exactly how matrix handles extra discoverability, which is kinda interesting.

github.com

matrix-org/matrix-doc/blob/master/proposals/1704-matrix.to-permalinks.md

# matrix.to permalink navigation

Currently Matrix uses matrix.to URIs to reference rooms and other entities in a
permanent manner. With just a room ID, users can't get into rooms if their server
is not already aware of the room. This makes permalinks to rooms or events difficult
as the user won't actually be able to join. A matrix.to link generated using a
room's alias is not a permanent link due to aliases being transferable.

In lieu of an improved way to reference entities permanently in Matrix, a new parameter
is to be added to matrix.to URIs to assist clients and servers receiving permanent links
in joining the room.

For reference, existing permalinks look like this:

```
https://matrix.to/#/!somewhere:example.org
https://matrix.to/#/!somewhere:example.org/$something:example.org
```

By adding a new parameter to the end, receivers can more easily join the room:

This file has been truncated. show original

lidel · May 19, 2019, 7:03pm

Interesting idea.

Analog to magnet links is obvious, but in IPFS there are two types of hints we could think about:

Just PeerID(s)

QmSoLnSGccFuZQJzRadHn95W2CrSFmZuTdDWP8HXaHca9z

speeds up content discovery by providing a hint for connecting to specified peers
short
requires additional lookup to discover peer’s multiaddrs

Peer Multiaddrs

/ip4/104.236.176.52/tcp/4001/ipfs/QmSoLnSGccFuZQJzRadHn95W2CrSFmZuTdDWP8HXaHca9z

speeds up content discovery by providing a hint to try to connect to specified peers using specific route/protocol
does not require DHT call if provided multiaddr works
includes PeerID, so in case specific multiaddr can’t be used, alternative ones can be retrieved from DHT
URI gets very very long

Universal Provider Hint: “provs=”

Given the fact multiaddr will always start with /, we could support both PeerIDs and multiaddrs under a single URI parameter:

ipfs://{cid}?provs={multiaddr1},{peerId2},{peerId3}`

Open Questions / Concerns

There are some unknowns, so if we introduce support for this type of hint to tools like IPFS Companion, we should make it an opt-in experiment (ipfs-companion/issues/722)

Privacy

While I see the appeal of this speeding up things like sharing apps, I am bit worried that this type of hint becomes a p2p version of ping attribute used for tracking. Imagine link-specific peers that are listed as provs but do not host data, just log IDs of peers connecting to it

Unintended DDoS

Link to popular content could cause DDoS of peers listed in URI hint.
Similar concerns were raised in this discussion about seed services.

jasonzhouu · May 28, 2019, 11:08am

@lidel @sebaseba
Recommend to add this solution to the newly created meta issue below.

jasonzhouu · July 18, 2019, 12:09am

IPFS team is considering to implement similar feature in IPFS gateway.
So after we ipfs add some data in node A, then we can fetch it in IPFS gateway, with the IP address of node A specified, so as to help the gateway find the data faster.

Topic		Replies	Views
Torrent like usage? Kubo go-ipfs	7	1252	December 14, 2021
How does ipfs get works? Help	3	317	November 23, 2020
Does IPFS have a unique hash for every possible file? go-ipfs , multihash , files	3	2479	September 27, 2018
About the availability and distribution of IPFS IPFS	3	480	July 1, 2019
Wrapping my head around IPFS basics for newbies ipns	9	1584	December 5, 2017