We started to build JuiceFS since 2016, released it as a SaaS solution in 2017. ...

bogomipz · on Jan 13, 2021

Hi. I'm pretty interested and excited about this project. Under "Credits" the project states:

>"The design of JuiceFS was inspired by Google File System, HDFS and MooseFS, thanks to their great work."

Would you consider writing up a design doc for JuiceFS. I would be interested to know more about what specific implementation ideas you used for each of those if any, design choices and tradeoffs made, learnings etc. It would make a great blog post. Cheers.

daviesliu · on Jan 13, 2021

The whole idea was came from GFS: separate the metadata and data, load all the meta into memory, single meta server for simplicity, fixed-size chunk.

The POSIX and FUSE stuff was learned from MooseFS, but changed to use read-only chunk, and merge them together, and do compaction in background. Since most of object storage provide eventual consistency, the model work pretty well, also simplify the burden on cache eviction. In order to access object store in parallel, we divide the chunk into smaller blocks (4MB), which is also a good unit for caching.

The Hadoop SDK (not released yet) was learned from HDFS.

One key thing in the implementation is to use Redis transaction to guarantee atomicy on metadata operations, otherwise we will get into millions of random bugs.

bogomipz · on Jan 13, 2021

Thanks. Is the Hadoop SDK part of the official Hadoop project? I wasn't able to find any information about it.

daviesliu · on Jan 13, 2021

No, The Hadoop SDK is a connector to access JuiceFS, is built using JuiceFS client, will be open sourced later on, stay tuned.

crest · on Jan 13, 2021

What's the recommended Redis configuration for JuiceFS?

daviesliu · on Jan 13, 2021

Redisraft [1] is a good thing to expect.

[1] https://github.com/RedisLabs/redisraft

daviesliu · on Jan 13, 2021

Right now, the hosted Redis is the easiest one.

If you run it on your own, please pay attention on the persistency options and HA solution, there should be plenty of article on these.

yossigo · on Jan 13, 2021

Congratulations for this work! Do you have some documentation describing how the metadata is modelled in Redis?

daviesliu · on Jan 13, 2021

They are summaried here: https://github.com/juicedata/juicefs/blob/main/pkg/redis/red...

crest · on Jan 13, 2021

1. Does JuiceFS support TLS encrypted Redis?

2. Is there any access control and if there is who enforces it?

daviesliu · on Jan 13, 2021

> 1. Does JuiceFS support TLS encrypted Redis?

Yes, please use `rediss://host:port/`

> 2. Is there any access control and if there is who enforces it?

You may specify port 0 to disable the non-TLS port completely. To enable only TLS on the default Redis port, use:

-port 0 -tls-port 6379

pjmlp · on Jan 13, 2021

Congratulations on picking Go instead of being yet another C project.

Good luck with the project.