Filters the log based on a provided set of start and end activities

filter_endpoints(eventlog, start_activities, end_activities, percentage,
  reverse, ...)

# S3 method for eventlog
filter_endpoints(eventlog, start_activities = NULL,
  end_activities = NULL, percentage = NULL, reverse = FALSE, ...)

# S3 method for grouped_eventlog
filter_endpoints(eventlog, start_activities = NULL,
  end_activities = NULL, percentage = NULL, reverse = FALSE, ...)

ifilter_endpoints(eventlog)

Arguments

eventlog

The dataset to be used. Should be a (grouped) eventlog object.

start_activities

A vector of activity identifiers, or NULL

end_activities

A vector of activity identifiers, or NULL

percentage

A percentage p to be used as percentile cut off. When this is used, the most common endpoint-pairs will be selected until at least the p% of the cases are selected.

reverse

Logical, indicating whether the selection should be reversed.

...

Deprecated arguments.

Value

When given an eventlog, it will return a filtered eventlog. When given a grouped eventlog, the filter will be applied in a stratified way (i.e. each separately for each group). The returned eventlog will be grouped on the same variables as the original event log.

Details

The filter_endpoints method filters cases based on the first and last activity label. It can be used in two ways: by specifying vectors with allowed start activities and/or allowed end activities, or by specifying a percentile. In the latter case, the percentile value will be used as a cut off. For example, when set to 0.9, it will select the most common endpoint pairs which together cover at least 90

Methods (by class)

  • eventlog: Filter event log

  • grouped_eventlog: Filter grouped event log stratified

See also