How many CIDs could be direct child of a CID?

bellbird · April 14, 2021, 12:05am

how many files or directories could be put as the direct child of a CID ?
is it possible to have thousands , millions , billions of CIDs as first child of another CID ?

hector · April 14, 2021, 12:56pm

There is a limit unless you enable sharding: go-ipfs/experimental-features.md at master · ipfs/go-ipfs · GitHub

The limit is when the directory block is larger than 4MB (ipfs block stat <cid> should tell how big a block is). These blocks can be created locally, but they will not move around in the network. No unit bigger than 4MB can be transferred to other nodes.

bellbird · April 14, 2021, 2:03pm

and this sharding does not introduce other problems ? , like IPNS which is a solution but an unusably slow one .
wouldn’t you recommend a hierarchical folder distribution in which the user avoids giving a CID more than a certain number of children . instead of using sharding which if is not by default active I assume puts some sort of a burden on the system .

hector · April 14, 2021, 3:24pm

The main thing is that with sharding enabled the CIDs of the generated folders are different. That is not really a burden on the system, it is just a different way of building the DAGs.

The reasons why it is not active by default are in the doc I listed, main one being that it uses the new format even when not needed (I think).

If you can live without enabling sharding then do, but it is there for the times when having folders with lots of links becomes a hard requirement.

bellbird · April 14, 2021, 6:10pm

if we’re talking about not only more than 4MB but potentially millions, billions, trillions of children is the sharding still good , or another strategy is needed ?
and is there a an estimate of how many children make about 4MB ?

hector · April 14, 2021, 6:28pm

if we’re talking about not only more than 4MB but potentially millions, billions, trillions of children is the sharding still good , or another strategy is needed ?

The difference between sharding and not sharding is that sharding makes the needed hierarchical structure transparent. In both cases there is a hierarchical structure. The same type of issues will appear when you request to list items in a sharded folder with a trillion of them vs. requesting the recursive listing of a trillion items in a hierarchical folder structure manually created.

I don’t know other approaches, other than not having a trillion objects.

and is there a an estimate of how many children make about 4MB ?

A ballpark would be 4MB/34byte ~= 100k directories? Please double-check.

bellbird · April 14, 2021, 7:45pm

any estimate of how many folders before things getting weird and unusable (using sharding)

hector · April 14, 2021, 7:53pm

I have no idea, directly proportional to machine specs, type of storage, etc…

bellbird · April 15, 2021, 1:07am

and does this 4MB limit or any other limit apply to the number of objects one could place in an IPFS nodes root ?

hector · April 15, 2021, 9:05am

Yes. By default, when adding an file to IPFS, it gets chunked in 256K pieces. If those pieces were larger than 4MB, then they will not be able to be moved around in the network. This applies to any raw-object (i.e. something manually created with ipfs dag put etc).

bellbird · April 15, 2021, 11:48am

100K x 256KB = 25.6GB
that means a file bigger than ~25.6 GB could cause the problem .
but my question is the number of objects at the root of an IPFS node , not the number of chunks in a CID .

bellbird · April 15, 2021, 12:42pm

is this a good method to avoid the 4MB limit problem :

instead of :
/3463901
have this :
/3/4/6/3/9/0/1/=

hector · April 16, 2021, 8:18am

Note that you can make your chunks bigger too (up to 4MB).

That is essentially what directory sharding does in a way that is transparent to the user.

bellbird · April 16, 2021, 1:00pm

so how do you compare this “transparent to the user” sharding to the IPFS machine sharding ?
specially from the performance point of view , is it the same , better or worse ?

Topic		Replies	Views
Unixfs and the 1MB block limit Help go-ipfs	2	613	July 24, 2017
Fastest way to get CID directory Help go-ipfs	4	434	October 20, 2021
File of 8888 images, each around 13MB each, not being discovered by other IPFS services Help	8	419	March 7, 2022
Getting a CID by combining chunks of file	4	350	July 24, 2022
What is the maximum IPFS block size IPFS	3	1131	October 18, 2021

How many CIDs could be direct child of a CID?

Related Topics