HDFSCachePool

Hadoop Distributed File System (HDFS) has slow serial read speed of the log stream file, it cannot meet the requirements of the stream processing system and can not fully exert the processing capability of the stream processing system, thus restricting the stream processing system. Improve performance. On the basis of HDFS, this software adopts parallel data transmission methods to fully utilize network bandwidth and effectively improve the throughput of distributed file system and meet the demand of stream processing system for data source throughput. At the same time, with respect to the highly ordered nature of the log stream, the software controls the timing of the data to ensure the timing of the log file and provides high-speed and orderly data input for the stream processing system.

License

HDFSCachePool is released under the Apache 2 license.

Project Committers

Jie Tan (@tjmaster)

Author and Copyright

HDFSCachePool is developed in Cluster and Grid Computing Lab, Services Computing Technology and System Lab, Big Data Technology and System Lab, School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan, China by Jie Tan([email protected])

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.idea		.idea
hdfsCachePool-core		hdfsCachePool-core
hdfsCachePool-kafka		hdfsCachePool-kafka
hdfsCachePool-storm		hdfsCachePool-storm
README.md		README.md
hdfsCachePool.iml		hdfsCachePool.iml
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HDFSCachePool

License

Project Committers

Author and Copyright

About

Releases

Packages

Languages

fengruotj/HdfsCachePool

Folders and files

Latest commit

History

Repository files navigation

HDFSCachePool

License

Project Committers

Author and Copyright

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages