filter_endpoints.Rd
Filters the log based on a provided set of start and end activities
filter_endpoints(eventlog, start_activities, end_activities, percentage, reverse, ...) # S3 method for eventlog filter_endpoints(eventlog, start_activities = NULL, end_activities = NULL, percentage = NULL, reverse = FALSE, ...) # S3 method for grouped_eventlog filter_endpoints(eventlog, start_activities = NULL, end_activities = NULL, percentage = NULL, reverse = FALSE, ...) ifilter_endpoints(eventlog)
eventlog | The dataset to be used. Should be a (grouped) eventlog object. |
---|---|
start_activities | A vector of activity identifiers, or NULL |
end_activities | A vector of activity identifiers, or NULL |
percentage | A percentage p to be used as percentile cut off. When this is used, the most common endpoint-pairs will be selected until at least the p% of the cases are selected. |
reverse | Logical, indicating whether the selection should be reversed. |
... | Deprecated arguments. |
When given an eventlog, it will return a filtered eventlog. When given a grouped eventlog, the filter will be applied in a stratified way (i.e. each separately for each group). The returned eventlog will be grouped on the same variables as the original event log.
The filter_endpoints method filters cases based on the first and last activity label. It can be used in two ways: by specifying vectors with allowed start activities and/or allowed end activities, or by specifying a percentile. In the latter case, the percentile value will be used as a cut off. For example, when set to 0.9, it will select the most common endpoint pairs which together cover at least 90
eventlog
: Filter event log
grouped_eventlog
: Filter grouped event log stratified