Howdy, Stranger!

It looks like you're new here. If you want to get involved, click one of these buttons!

Sign In with Google

Poll

No poll attached to this discussion.

In order to better support our growing community we've set up a new more powerful forum.


The new forum is at: http://community.covecube.com


The new forum is running IP.Board and will be our primary forum from now on.


This forum is being retired, but will remain online indefinitely in order to preserve its contents. This forum is now read only.


Thank you,

[Tech] File Duplication

edited February 2013 in DrivePool
I've seen a number of questions here on the forum and in my support tickets about how file duplication works in DrivePool.

I'd like to make a post here that describes the inner workings of file duplication, as it is in the latest versions of DrivePool (1.3, 2.0), which has evolved some since the original implementation.

For those of you who have been with DrivePool from the beginning, starting some years ago, you may recall that the original technique for determining whether a folder was duplicated or not was the dot suffix. It was literally as simple as appending a .1 or a .2 to the end of a folder name. This was very simple, reliable and worked great.

The problem was that the Windows Home Server 2011 enforces strict naming requirements for any shared folder, so appending strange digits to folder names was not very compatible with it.

Eventually DrivePool moved to using alternate streams in order to determine the duplication count for folders. You can think of alternate streams as hidden files that are attached to folders (or other files).

Today, we actually write the duplication count into a special stream that we call a "directory tag". You can see these special "tags" by issuing a dir /r command on a duplicated folder.

In particular, the stream that contains the duplication count is called DuplicationCount.Tag.CoveFs.

So what does this stream contain?

Well, at first, all it had was a number, either 1 or 2. That simply said how many copies of files should be maintained under this directory (much as the original system did).

But as we moved forward, we had the need to develop a system that would allow for arbitrary duplication counts on any sub-folder. It would also have to be done efficiently in memory, without querying the disk.

This is the system that is implemented today in DrivePool 1.X and 2.X.

The system works following these rules:
  • Any folder can have an explicit duplication count, or specify that it is inheriting the duplication count of the parent folder. Explicit duplication counts are specified using a number >= 1, and inheritance is indicated using an I.
  • In addition, each folder that specifies a duplication count, can indicate whether its sub-folders specify additional duplication properties or simply automatically inherit the current folder's duplication count (whether inherited from the parent or explicitly specified). We call this a Multiple flag, meaning that the sub-folders of this folder exhibit multiple duplication levels.
  • The root folder of a pool implicitly inherits the duplication count of 1, from an imaginary parent folder.
That's it. DrivePool just uses these rules to determine the duplication level of every file on the pool.

The rules allow us to build an efficient caching system to determine the duplication count of every file on the pool, without explicitly reading the duplication count from the disk, on each file open.

Now I know the rules are confusing the first time that you read them, so here's an example:

\ - MI - Multiple, Inhetit 
    (this folder inherits the duplication count from the parent (1) and specifies that its sub-folders have additional duplication counts)

\ServerFolders\ - MI Multiple, Inherit

\ServerFolders\Documents - 1 (one copy of every file under this folder)

\ServerFolders\Videos - 2 (two copies of every file under this folder)

\ServerFolders\... - (other folder would simply inherit the duplication count from the parent)

Comments

  • Member
    Hey Alex

    Is there any way to manipulate the inherit/multiple flags on a folder so that it will start inheriting again?

    It seems right now that once you've set an explicit duplication level on a folder there is no way to get it to start inheriting from the parent again. Or is this a planned feature?  (same function as ACL ~replace all child objects with inheritable permissions)


  • Covecube
    Henrik,

    There is currently no way to manipulate the "multiple" and "inherit" flags directly.

    We simply have a way to toggle duplication on and off per folder and those flags are set automatically for you.

    But I do realize that the system could be more flexible and I do have plans to improve it in the future.
  • Member
    Hello,

    Alex, you seem to indicate that the duplication count can be greater than 2.
    E.g. if my pool contain 4 disks, can I setup duplication to 4 so that a folder is fully available on my 4 disks?

    I do not find any option to do that in the drivepool UI; is it possible to do so using a command line?

    I am very intersted in this feature, and that would definitely make me jump from demo to a paid license ;)

    Thanks for the answer...
  • edited May 2013 Member
    Alex, you seem to indicate that the duplication count can be greater than 2.
    E.g. if my pool contain 4 disks, can I setup duplication to 4 so that a folder is fully available on my 4 disks?
    I do not find any option to do that in the drivepool UI; is it possible to do so using a command line?
    yes; you can do that with the "dpcmd" command line tool.

    dpcmd set-duplication <path-to-folder> #

    for example:
    dpcmd set-duplication P:\quad 4

    and check the status of any file/folder with:
    dpcmd get-duplication <path-to-folder>

    It will also show up as 4x in the GUI but you can only set 1x (disable duplication) and 2x (enable duplication) from the GUI.
  • edited May 2013 Member
    Thank you very much for the fast answer.
    This is great that it is supported, makes Drivepool a very good challenger to Greyhole under Windows.
    I will test that asap in my VM.

    That with the feeder plugin, that seem to act as the landing zone of Greyhole, fit all of my needs.
    Adding to that integration to Windows and a straightforward UI, and we have a winner ;)

    Why don't you make the duplication count setting available directly from the UI ? That's not such a big deal to propose a combo-box instead of an activate button...
Sign In or Register to comment.