Fwd: [jira] [Created] (CARBONDATA-1051) why sort_columns?

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
Report Content as Inappropriate

Fwd: [jira] [Created] (CARBONDATA-1051) why sort_columns?

Liang Chen

---------- Forwarded message ----------
From: Liang Chen <[hidden email]>
Date: 2017-05-13 17:13 GMT+08:00
Subject: Re: [jira] [Created] (CARBONDATA-1051) why sort_columns?
To: [hidden email], [hidden email], Sehriff <[hidden email]>

Hi Sehriff

Good question.

First, please check this doc: http://carbondata.apache.org/useful-tips-on-carbondata.html, see if can help you to understand CarbonData's index usage.

Like you mentioned that 1.2 will introduce sort columns feature to help users to more easily specify which columns need to build index. for example : "create table(c1...c7)  tblproperties('sort_columns' = 'c7,c3')".

As you know , before 1.1 CarbonData by default builds MDK index as per the order of columns in creation table, the feature will be kept in future versions, so you can still use "create table (c1,c2,...c7")" to build index. 



2017-05-13 13:55 GMT+08:00 sehriff (JIRA) <[hidden email]>:
sehriff created CARBONDATA-1051:

             Summary: why sort_columns?
                 Key: CARBONDATA-1051
                 URL: https://issues.apache.org/jira/browse/CARBONDATA-1051
             Project: CarbonData
          Issue Type: Improvement
            Reporter: sehriff

since we already have many rmdb sqls scripts,we don't want to change them too much when migrating to carbon.
suppose we already have sql script like:
"create table (c1,c2,...c7")",
If we want to change column order to shift most often used column ahead when query data,i think it's better change it this way like:
"create table(c1,c2,c7,c4..c6,c3)
 "create table(c1...c7)  tblproperties('sort_columns' = 'c7,c3')"
because the former way just reordering columns without adding extra settings,which is more readable to thos who is familiar with rmdb but not  with carbon.

This message was sent by Atlassian JIRA