Resources

File vs. Block vs. Object Storage - Which is better choice for Alfresco ECM?

It was Dec 2019 and the organizations across the world were shocked with the rapid spread of COVID19 and resulting lockdowns. The employees were immediately asked to work-from-home for the unknown time period and unfortunately it cannot be estimated even today. As a result, several organizations around the world have acknowledged that their staff would continue to work from home, even after the pandemic is over. A separate survey from IDC revealed that the vast majority of U.S. enterprises—87 percent—expect that employees will still work from home at least three days a week even after offices fully reopen.

Majority of the organizations were unprepared to deal with the challenges associated with the culture of work from home. There were challenges to keep the staff virtually connected with each-other, drive communication across teams, departments and organization, make announcements, and even hold meetings virtually, rather than booking a meeting room.

We must appreciate the kind of immediate support provided by Social media platforms like WhatsApp during these tiring times equally to SMBs and large enterprises. Team / Department Chat groups were created on the fly and task updates were shared on daily basis to keep everyone synced while Audio/Video calling kept the employees connected. Now with the recent announcement by WhatsApp to force their new terms of service on its users to share their privacy data with Facebook. Users only have the option to either agree to it now or delete their accounts. Most organizations took a prompt action and have issued advisories to their employees to not to share any sensitive or confidential data at the same time they have started looking for alternatives. This has added to the list of challenges to be handled on priority for the organizations.

Organizations had communication platforms like WebEx, Microsoft Teams, Zoom, WhatsApp for Business, Google Meet etc. to bridge the connectivity gap but they offered following unique challenges:

Introducing CloudMeet – Your Self-Hosted alternative to Secure and Private communication.

CloudMeet can address the current market needs completely and offers a fully secure, private, end-to-end encrypted and self-hosted unified communication platform. CloudMeet is Singapore’s first and very own unified communication platform. CloudMeet offers the following features to customers:

Some of the key differentiators which makes CloudMeet as the no-brainer choice for enterprises looking for a private and secure unified communication platform:

Currently, CloudMeet can be evaluated / purchased by directly reaching out to Crest Sales Team, however pretty soon CloudMeet would also be available on various marketplaces of AWS, Microsoft Azure, Google Cloud and IBM Cloud.

Drop us a note, if you would like to know more / evaluate the solution, at kapil.jain@crestsolution.com

The commoditization of IT hardware, the rise of cloud platforms, the emergence of hyper-convergence and the lightning-fast speed of non-volatile memory express (NVMe) flash have perhaps masked some of the fundamentals of data storage.

But, at the root, all storage is categorised as either block, file or object, with those terms derived from how data is accessed in each mode. Files, blocks, and objects are storage formats that hold, organize, and present data in different ways, with each one of them having their own capabilities and limitations.

Block-vs-file-object-storage

Figure 1: Block vs File vs Object Storage

What is a block storage?

Block storage chops data into blocks, depending on block size defined and stores them as separate pieces. Each block of data is given a unique identifier, which allows a storage system to place the smaller pieces of data wherever is most convenient.

Block storage is often configured to decouple the data from the user’s environment and spread it across multiple environments that can better serve the data. And then, when data is requested, the underlying storage software reassembles the blocks of data from these environments and presents them back to the user. It is usually deployed in storage-area network (SAN) environments and must be tied to a functioning server.

Because block storage doesn’t rely on a single path to data—like file storage does—it can be retrieved quickly. Each block lives on its own and can be partitioned so it can be accessed in a different operating system, which gives the user complete freedom to configure their data. It’s an efficient and reliable way to store data and is easy to use and manage. It works well with enterprises performing big transactions and those that deploy huge databases, meaning the more data you need to store, the better off you’ll be with block storage.

Hence block storage is the go-to option for Alfresco database, irrespective of database choice, whether its MariaDB, MySQL, PostgreSQL, MS SQL or Oracle.

There are some downsides, though, especially when utilizing against content management system with large amount of data. Block storage can be expensive. It has limited capability to handle metadata, which means it needs to be dealt with in the application or database level—adding another thing for a developer or systems administrator to worry about.

 

What is file Storage:

File storage, also called file-level or file-based storage organizes and represents data as a hierarchy of files in folders. Data is stored as a single piece of information inside a folder, just like you’d organize pieces of papers inside a folder. When you need to access that piece of data, your computer needs to know the path to find it (Beware! it can be a really long, winding path). Data stored in files is organized and retrieved using a limited amount of metadata that tells the computer exactly where the file itself is kept.

Think of a closet full of file cabinets. Every document is arranged in some type of logical hierarchy—by cabinet, by drawer, by folder, then by piece of paper. This is where the term hierarchical storage comes from, and this is file storage. File storage has broad capabilities and can store just about anything. It’s great for storing an array of complex files and is fairly fast for users to navigate.

The problem is, just like with your filing cabinet, that virtual drawer can only open so far. File-based storage systems must scale out by adding more systems, rather than scale up by adding more capacity.


What is an object storage?

Object storage, also known as object-based storage, is a flat structure in which files are broken into pieces and spread out among hardware. In object storage, the data is broken into discrete units called objects and is kept in a single repository, instead of being kept as files in folders or as blocks on servers.

Object storage emerged as a rival to file-access storage for large quantities of unstructured data when scale-out NAS file systems started to creak under the sheer number of files being stored.

Where file-access storage with its hierarchical file structure can get cumbersome as it grows, object storage brings a “flat” structure with equal access to all objects held, making it eminently suitable for large volumes of unstructured data.

There are few things to keep in mind, to be sure.

Object storage in practice

Despite what some people suggest, object storage is not an emerging technology. Data stored as objects have already approached the exabyte scale (1000 petabytes) representing trillions of objects. Companies like Amazon (with S3) provide object storage via its public cloud platform at massive scale, while object storage can be implemented in on-premise data center using a variety of open source and commercial products like MinIO’s High Performance Object Storage, to cater to needs of Alfresco like CMS.

Conclusion:

Now that we’ve described what file storage, block storage and object storage are, we conclude that while block storage is a must for Alfresco’s database, Object storage is the ideal choice for Alfresco’s content store with following benefits:

Crest_Infosolutions_logo

ABOUT THE AUTHOR

Crest Infosolutions is best described as a team of as youthful, ambitious, passionate and committed technology professionals

Like this Article? Don't forget to share it!

Facebook
Twitter
LinkedIn