[Feature]: 2024 plan #3064

leonrayang · 2024-02-01T15:38:52Z

Contact Details

No response

Is there an existing issue for this?

I have searched all the existing issues

Is your feature request related to a problem? Please describe.

2024 plan

Describe the solution you'd like.

Feature	Type	Version	Status	Branch	Release Date
Automatic migration	Stability	Release-3.4.0	QA Testing	develop-v3.4.0	JUNE
Snapshot	Feature	Release-3.4.0	QA Testing	develop-v3.4.0	JUNE
Hybrid Cloud automatic data hierarchy	cost optimization	Release-3.5.0	QA Testing	develop-hybridcloudlifecycle	JULY
Distributed Cache	Feature	Release-3.6.0	SELF-Testing/Unit Test	flash_cache	AUG
Metanode persist with rocksdb	cost optimization	Release-3.6.0	SELF-Testing/Unit Test	metanode_rocksdb_dev	AUG
RDMA	Performance	Release-3.6.0	SELF-Testing/Unit Test	cubefs-rdma	AUG
Kernel FileSystem Client And GPU Direct Storage	Performance	Release-3.7.0	SELF-Testing/Unit Test	cubefs-kernel-rdma	OCT
Call Chain	Feature	Release-3.7.0	SELF-Testing/Unit Test	blobstore-tracelog	OCT

Architecture refactoring （high priority）

The storage engine is reconstructed to provide an append only file system, with lower latency and higher throughput for data reading and writing.
Hybrid cloud: Hybrid cloud projects support a unified namespace, provide the ability to use multiple storage systems in a mixed manner, and provide external S3 and HDFS capabilities. Support life cycle driven data flow between different media, storage types, and on and off the cloud, reducing costs and increasing efficiency. The first issue will be released soon.

Improved stability and reliability

Disk CRC enhancement to improve CRC checking capabilities such as master-slave synchronization and random writing.
Automatic disk migration reduces the atomicity problem of metadata information during the migration process and improves the level of operational automation.
System module operation monitoring and alarms are strengthened to enhance observability.
The data node adds learner capabilities and supports multi-active deployment in the same city.

Performance improvements

Full-link acceleration to better support scenarios such as database calculation separation and AI training acceleration.
Client: Provides a kernel client and supports GDS (GPU Direct Storage) and RDMA technology to reduce IO latency and CPU overhead.
Server: Rebuild the communication mechanism based on RDMA, thereby reducing the overall latency of read and write services and improving throughput.
Distributed cache: further optimize the distributed multi-level cache architecture to support cross-computer room and cross-cloud read and write acceleration capabilities to support AI training acceleration needs.
Optimize the reading and writing capabilities of existing systems based on TCP links.
Optimize client local cache (level one cache) performance

characteristic

Metadata storage is implemented based on RocksDB, and the full metadata cache is optimized to on-demand caching to reduce memory overhead.
The erasure coding subsystem removes Kafka component dependencies and provides SDK for direct client access, shortening the data transmission path.
Provides event notification features, S3api QoS, objnode audit log function, cross-region replication, QPS and bandwidth metering and billing capabilities;

Describe an alternate solution.

No response

Anything else? (Additional Context)

No response

xiaochunhe · 2024-02-02T02:25:12Z

It is recommended to update this to roadmap.md with a more concise description

bladehliu · 2024-02-02T05:55:17Z

The top2 things in 2024:

Move CubeFS forward to be more cloud-native - it can manage multiple data sources - its own private cloud storage and/or S3-like public cloud storage
Run more analytical/search databases on top of CubeFS to enable separation of storage/compute

bladehliu · 2024-02-03T00:57:05Z

Let's make CubeFS the best solution for separation of storage and computing.

WAL optimization as high priority

guohao-rosicky · 2024-03-04T08:32:37Z

Hi, @leonrayang thanks work on this. I'm interested in these two features, is there a design document for them?

Hybrid cloud: Hybrid cloud projects support a unified namespace, provide the ability to use multiple storage systems in a mixed manner, and provide external S3 and HDFS capabilities. Support life cycle driven data flow between different media, storage types, and on and off the cloud, reducing costs and increasing efficiency. The first issue will be released soon.

Distributed cache: further optimize the distributed multi-level cache architecture to support cross-computer room and cross-cloud read and write acceleration capabilities to support AI training acceleration needs.

tengallonhead-lv · 2024-03-26T06:46:47Z

Let's make CubeFS the best solution for separation of storage and computing.

WAL optimization as high priority

Hi，@bladehliu thanks work on this.I'm insterested in this work, is there a design document for them?

leonrayang added the enhancement New feature or request label Feb 1, 2024

leonrayang self-assigned this Feb 1, 2024

sejust pinned this issue Feb 2, 2024

guojidan mentioned this issue Mar 6, 2024

[Enhancement]: Optimize client local cache (level one cache) performance #3188

Closed

1 task

leonrayang unpinned this issue Apr 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: 2024 plan #3064

[Feature]: 2024 plan #3064

leonrayang commented Feb 1, 2024 •

edited

xiaochunhe commented Feb 2, 2024

bladehliu commented Feb 2, 2024 •

edited

bladehliu commented Feb 3, 2024 •

edited

guohao-rosicky commented Mar 4, 2024

tengallonhead-lv commented Mar 26, 2024

[Feature]: 2024 plan #3064

[Feature]: 2024 plan #3064

Comments

leonrayang commented Feb 1, 2024 • edited

Contact Details

Is there an existing issue for this?

Is your feature request related to a problem? Please describe.

Describe the solution you'd like.

Describe an alternate solution.

Anything else? (Additional Context)

xiaochunhe commented Feb 2, 2024

bladehliu commented Feb 2, 2024 • edited

bladehliu commented Feb 3, 2024 • edited

guohao-rosicky commented Mar 4, 2024

tengallonhead-lv commented Mar 26, 2024

leonrayang commented Feb 1, 2024 •

edited

bladehliu commented Feb 2, 2024 •

edited

bladehliu commented Feb 3, 2024 •

edited