Extracts one column from the given CSV file and prints the frequencies of its values
Version | 1.3 |
---|---|
Bundle | tools |
Categories | Analysis |
Authors | Marko Laakso (Marko.Laakso@Helsinki.FI) |
Issue tracker | View/Report issues |
Requires | csbl-javatools.jar (jar) ; installer (bash) |
Source files | component.xml |
Usage | Example with default values |
Name | Type | Mandatory | Description |
---|---|---|---|
table1 | CSV | Optional | The first input relation |
table2 | CSV | Optional | The second input relation |
table3 | CSV | Optional | The third input relation |
table4 | CSV | Optional | The fourth input relation |
table5 | CSV | Optional | The fifth input relation |
table6 | CSV | Optional | The sixth input relation |
table7 | CSV | Optional | The seventh input relation |
table8 | CSV | Optional | The eighth input relation |
table9 | CSV | Optional | The ninth input relation |
array | Array<CSV> | Optional | An array of input files |
Name | Type | Description |
---|---|---|
out | Matrix | IDs and their frequencies |
Name | Type | Default | Description |
---|---|---|---|
acceptMissing | boolean | false | Files with missing columnIn are accepted as empty if this is true. |
columnIn | string | "" | A comma separated list of column names for the IDs of interest in each file. Empty values refer to the first column of the file. |
columnInArray | string | "" | A comma separated list of array_key=column_name pairs for the IDs of interest in array files. Empty values refer to the first column of the file. |
columnOut | string | "" | Name of the identifier column of the output list. Empty input refers to the name of the input column. |
isList | boolean | true | True if the seleted column contains a comma separated list of values to be splitted. |
quotation | boolean | false | Indicator that can be used to disable quotation of the output values. |
regexp1 | string | "" | Regular expression for the row filtering in table1. A row is included in the result if this parameter is empty or if values in the given columns match given regular expressions. The parameter has the format COLNAME1=EXPRESSION,COLNAME2=EXPRESSION2 where COLNAMEs are column names in "csv" and EXPRESSIONs are regular expressions using Java syntax. For example, "col=a|b" includes rows where the column col has a value of "a" or "b". |
regexp2 | string | "" | Regular expression for the row filtering in table2. |
regexp3 | string | "" | Regular expression for the row filtering in table3. |
regexp4 | string | "" | Regular expression for the row filtering in table4. |
regexp5 | string | "" | Regular expression for the row filtering in table5. |
regexp6 | string | "" | Regular expression for the row filtering in table6. |
regexp7 | string | "" | Regular expression for the row filtering in table7. |
regexp8 | string | "" | Regular expression for the row filtering in table8. |
regexp9 | string | "" | Regular expression for the row filtering in table9. |
regexpArr | string | "" | Regular expression for the row filtering of array files |
Test case | Parameters▼ | IN table1 |
IN table2 |
IN table3 |
IN table4 |
IN table5 |
IN table6 |
IN table7 |
IN table8 |
IN table9 |
IN array |
OUT out |
---|---|---|---|---|---|---|---|---|---|---|---|---|
case1 | properties | table1 | (missing) | (missing) | (missing) | (missing) | (missing) | (missing) | (missing) | (missing) | (missing) | (missing) |
columnIn=name, |
||||||||||||
case2 | properties | table1 | table2 | table3 | (missing) | (missing) | (missing) | (missing) | (missing) | (missing) | (missing) | (missing) |
columnIn =value,V,value, |
||||||||||||
case3 | properties | (missing) | (missing) | (missing) | (missing) | (missing) | (missing) | (missing) | (missing) | (missing) | array | (missing) |
columnInArray = tf1=value,tf2=V,tf3=value, |