Compare multiple lists and return the intersection, difference, or union.
wrangles:
- compare.lists:
input:
- col1
- col2
output: Difference
method: difference
| → |
|
wrangles:
- compare.lists:
input:
- col1
- col2
output: Intersection
method: intersection
| → |
|
wrangles:
- compare.lists:
input:
- col1
- col2
output: Union
method: union
| → |
|
| Parameter | Required | Data Type | Notes |
|---|---|---|---|
| input | ✓ | list | List of input columns containing lists to compare. |
| output | ✓ | str | Name of the output column. |
| method | ✓ | str | Type of comparison to perform. |
| remove_duplicates | bool | Remove duplicates from the result. | |
| ignore_case | bool | Ignore case when comparing string items. |
Compare two strings and return the intersection or difference, or use overlap to find the matching characters between the two strings.
wrangles:
- compare.text:
input:
- col1
- col2
output: Difference
method: difference
| → |
|
wrangles:
- compare.text:
input:
- col1
- col2
output: Intersection
method: intersection
| → |
|
wrangles:
- compare.text:
input:
- Part Code1
- Part Code2
output: Overlap
method: overlap
| → |
|
| Parameter | Required | Data Type | Notes |
|---|---|---|---|
| input | ✓ | str, list | The columns to compare. First column is the base column |
| output | ✓ | str, list | The column to output the results to |
| method | str | The type of comparison to perform (difference, intersection, overlap), defaults to difference. | |
| char | str | split the strings on. Default is a space (difference and intersection) | |
| non_match_char | str | Character to use for non-matching characters (overlap) | |
| include_ratio | bool | Include the ratio of matching characters (overlap) | |
| decimal_places | number | Number of decimal places to round the ratio to (overlap) | |
| exact_match | str | Value to use for exact matches (overlap) | |
| empty_a | str | Value to use for empty input a (overlap) | |
| empty_b | str | Value to use for empty input b (overlap) | |
| all_empty | str | Value to use for both inputs (overlap) | |
| case_sensitive | bool | Whether the comparison is case sensitive. Default is True. |