Filter

Filter provides conditional processing that either executes a processor or passes data through unchanged based on a predicate function.

Overview

Filter creates a branch in your pipeline where processing is optional based on runtime conditions. Unlike Switch which routes to different processors, Filter either processes or skips. Unlike Mutate which only supports safe transformations, Filter can execute any Chainable including ones that may error.

filter := pipz.NewFilter(identity, condition, processor)

When to Use

Use Filter when:

Conditional processing needed (feature flags, A/B testing)
Skip expensive operations based on data state
Apply business rules to subset of data
Different users need different processing paths
You want clean separation of condition and logic
Performance optimization through selective processing

When NOT to Use

Don't use Filter when:

All data needs the same processing (just use the processor directly)
You need multiple branches (use Switch instead)
The condition is better expressed in the processor itself
You're just transforming conditionally (use Mutate for simpler cases)

Basic Usage

// Define identities upfront
var (
    BetaFeatureID      = pipz.NewIdentity("beta-feature", "Apply new algorithm for beta users with feature flag")
    PremiumValidID     = pipz.NewIdentity("premium-validation", "Perform enhanced validation for premium customers")
    PremiumChecksID    = pipz.NewIdentity("premium-checks", "Premium customer validation checks")
)

// Feature flag example
betaFeature := pipz.NewFilter(
    BetaFeatureID,
    func(ctx context.Context, user User) bool {
        return user.BetaEnabled && isFeatureEnabled(ctx, "new-algorithm")
    },
    newAlgorithmProcessor,
)

// Conditional validation
validatePremium := pipz.NewFilter(
    PremiumValidID,
    func(ctx context.Context, order Order) bool {
        return order.CustomerTier == "premium"
    },
    pipz.NewSequence(
        PremiumChecksID,
        validateCreditLimit,
        checkFraudScore,
        verifyIdentity,
    ),
)

Condition Function

The condition function determines whether processing should occur:

func(context.Context, T) bool

Returns true: Execute the processor
Returns false: Pass data through unchanged
Context aware: Can use context for timeouts, values, cancellation
Pure function: Should not have side effects

Condition Examples

// Simple data check
func(ctx context.Context, order Order) bool {
    return order.Amount > 1000
}

// Feature flag with context
func(ctx context.Context, user User) bool {
    return user.BetaEnabled && 
           featureFlags.IsEnabled(ctx, "experimental-feature")
}

// Time-based condition
func(ctx context.Context, data Data) bool {
    return time.Now().Hour() >= 9 && time.Now().Hour() < 17 // Business hours
}

// Complex business logic
func(ctx context.Context, payment Payment) bool {
    return payment.Method == "crypto" && 
           payment.Amount > 10000 &&
           payment.Customer.RiskScore < 0.3
}

Processor

Any Chainable can be used as the processor:

// Define identities upfront
var (
    DoubleID      = pipz.NewIdentity("double", "Double the value")
    ValidateID    = pipz.NewIdentity("validate", "Validate data")
    ComplexID     = pipz.NewIdentity("complex", "Validate, enrich, and transform data")
    ConditionalID = pipz.NewIdentity("conditional", "Conditionally apply complex flow")
)

// Simple processor
processor := pipz.Transform(DoubleID, func(ctx context.Context, n int) int {
    return n * 2
})

// Error-prone processor
validator := pipz.Apply(ValidateID, func(ctx context.Context, data Data) (Data, error) {
    return validateData(data)
})

// Complex pipeline
complexFlow := pipz.NewSequence(
    ComplexID,
    validate,
    enrich,
    transform,
)

filter := pipz.NewFilter(ConditionalID, condition, complexFlow)

Dynamic Behavior

Filter supports runtime updates for dynamic behavior:

// Define identity upfront
var DynamicID = pipz.NewIdentity("dynamic", "Filter with dynamic condition and processor")

filter := pipz.NewFilter(
    DynamicID,
    initialCondition,
    initialProcessor,
)

// Update condition at runtime
filter.SetCondition(func(ctx context.Context, data Data) bool {
    // New condition logic
    return data.Version >= 2
})

// Update processor at runtime
filter.SetProcessor(newProcessor)

// Access current values
currentCondition := filter.Condition()
currentProcessor := filter.Processor()

Error Handling

When the processor returns an error, Filter prepends its name to the error path:

// Define identities upfront
var (
    PaymentFilterID = pipz.NewIdentity("payment-filter", "Validate high-value payments over $100")
    ValidateID      = pipz.NewIdentity("validate", "Validate payment")
)

filter := pipz.NewFilter(
    PaymentFilterID,
    func(ctx context.Context, p Payment) bool { return p.Amount > 100 },
    pipz.Apply(ValidateID, failingValidator),
)

result, err := filter.Process(ctx, payment)
if err != nil {
    // err.Path will be ["payment-filter", "validate"]
    fmt.Printf("Failed at: %v\n", err.Path)
}

Thread Safety

Filter is thread-safe and can be safely used in concurrent scenarios:

// Define identity upfront
var ConcurrentSafeID = pipz.NewIdentity("concurrent-safe", "Thread-safe conditional processor")

filter := pipz.NewFilter(
    ConcurrentSafeID,
    condition,
    processor,
)

// Safe to call from multiple goroutines
go func() { filter.Process(ctx, data1) }()
go func() { filter.Process(ctx, data2) }()

// Safe to update from other goroutines
go func() { filter.SetCondition(newCondition) }()

Performance Characteristics

Filter has minimal overhead:

Condition false: ~5ns with zero allocations
Condition true: Processor overhead + ~10ns
No reflection: Direct function calls
Memory efficient: No intermediate allocations

Common Patterns

Feature Flag Processing

type FeatureFlags struct {
    flags map[string]bool
    mu    sync.RWMutex
}

func (f *FeatureFlags) IsEnabled(flag string) bool {
    f.mu.RLock()
    defer f.mu.RUnlock()
    return f.flags[flag]
}

// Define identity upfront
var FeatureGateID = pipz.NewIdentity("feature-gate", "Gate new feature for beta users with feature flag")

// Create feature flag filter
featureFilter := pipz.NewFilter(
    FeatureGateID,
    func(ctx context.Context, user User) bool {
        return user.BetaEnabled && flags.IsEnabled("new-feature")
    },
    newFeatureProcessor,
)

Conditional Enrichment

// Define identities upfront
var (
    EnrichPremiumID     = pipz.NewIdentity("enrich-premium", "Enrich premium and enterprise customers with additional data")
    PremiumEnrichmentID = pipz.NewIdentity("premium-enrichment", "Add personalized offers, loyalty points, and priority support")
)

// Only enrich premium customers
enrichPremium := pipz.NewFilter(
    EnrichPremiumID,
    func(ctx context.Context, customer Customer) bool {
        return customer.Tier == "premium" || customer.Tier == "enterprise"
    },
    pipz.NewSequence(
        PremiumEnrichmentID,
        addPersonalizedOffers,
        calculateLoyaltyPoints,
        addPrioritySupport,
    ),
)

Performance Optimization

// Define identity upfront
var CacheCheckID = pipz.NewIdentity("cache-check", "Skip expensive processing if data is cached")

// Skip expensive processing for cached data
skipIfCached := pipz.NewFilter(
    CacheCheckID,
    func(ctx context.Context, request Request) bool {
        _, exists := cache.Get(request.CacheKey())
        return !exists // Only process if not cached
    },
    expensiveProcessor,
)

Time-Based Processing

// Define identity upfront
var BusinessHoursID = pipz.NewIdentity("business-hours", "Process only during weekday business hours (9am-5pm)")

// Only process during business hours
businessHours := pipz.NewFilter(
    BusinessHoursID,
    func(ctx context.Context, task Task) bool {
        now := time.Now()
        hour := now.Hour()
        weekday := now.Weekday()

        return weekday >= time.Monday &&
               weekday <= time.Friday &&
               hour >= 9 &&
               hour < 17
    },
    businessProcessor,
)

Filter vs Other Connectors

Filter vs Switch

Filter: Execute or skip (binary choice)
Switch: Route to different processors (multiple choices)

// Define identities upfront
var (
    OptionalID = pipz.NewIdentity("optional", "Conditionally apply processor")
    RouterID   = pipz.NewIdentity("router", "Route to different processors")
)

// Filter: Optional processing
filter := pipz.NewFilter(OptionalID, condition, processor)

// Switch: Alternative processing
router := pipz.NewSwitch(RouterID, routingFunction)
router.AddRoute("path-a", processorA)
router.AddRoute("path-b", processorB)

Filter vs Mutate

Filter: Can use any Chainable, including error-prone ones
Mutate: Only safe transformations (no errors)

// Define identities upfront
var (
    ValidateIfNeededID = pipz.NewIdentity("validate-if-needed", "Conditionally validate data")
    ModifyIfNeededID   = pipz.NewIdentity("modify-if-needed", "Conditionally transform data")
)

// Filter: Can fail
filter := pipz.NewFilter(ValidateIfNeededID, condition, validator)

// Mutate: Cannot fail
mutate := pipz.Mutate(ModifyIfNeededID, transformer, condition)

Filter vs Conditional Logic

// Define identities upfront
var (
    MixedLogicID      = pipz.NewIdentity("mixed-logic", "Conditionally apply expensive operation")
    CleanSeparationID = pipz.NewIdentity("clean-separation", "Separate condition from expensive operation")
    ExpensiveID       = pipz.NewIdentity("expensive", "Expensive operation")
)

// Instead of embedding conditions
processor := pipz.Apply(MixedLogicID, func(ctx context.Context, data Data) (Data, error) {
    if shouldProcess(data) {
        return expensiveOperation(ctx, data)
    }
    return data, nil
})

// Use Filter for cleaner separation
filter := pipz.NewFilter(
    CleanSeparationID,
    shouldProcess,
    pipz.Apply(ExpensiveID, expensiveOperation),
)

Testing

Test Filter by verifying both condition paths:

func TestFilter(t *testing.T) {
    // Define identities upfront
    var (
        DoubleID   = pipz.NewIdentity("double", "Double the value")
        EvenOnlyID = pipz.NewIdentity("even-only", "Double only even numbers")
    )

    processor := pipz.Transform(DoubleID, func(ctx context.Context, n int) int {
        return n * 2
    })

    filter := pipz.NewFilter(
        EvenOnlyID,
        func(ctx context.Context, n int) bool { return n%2 == 0 },
        processor,
    )

    // Test condition true
    result, err := filter.Process(context.Background(), 4)
    assert.NoError(t, err)
    assert.Equal(t, 8, result) // 4 * 2

    // Test condition false
    result, err = filter.Process(context.Background(), 3)
    assert.NoError(t, err)
    assert.Equal(t, 3, result) // unchanged
}

Gotchas

❌ Don't have side effects in conditions

// Define identity upfront
var BadID = pipz.NewIdentity("bad", "Filter with side effects in condition")

// WRONG - Condition modifies state
filter := pipz.NewFilter(
    BadID,
    func(ctx context.Context, data Data) bool {
        counter++ // Side effect!
        log.Println("Checking...") // Side effect!
        return data.Important
    },
    processor,
)

✅ Keep conditions pure

// Define identity upfront
var GoodID = pipz.NewIdentity("good", "Filter with pure condition")

// RIGHT - Pure condition function
filter := pipz.NewFilter(
    GoodID,
    func(ctx context.Context, data Data) bool {
        return data.Important
    },
    processor,
)

❌ Don't use for simple true/false transforms

// Define identities upfront
var (
    OverkillID = pipz.NewIdentity("overkill", "Absolute value for positive numbers only")
    AbsID      = pipz.NewIdentity("abs", "Calculate absolute value")
)

// WRONG - Overkill for simple conditional
filter := pipz.NewFilter(
    OverkillID,
    func(ctx context.Context, n int) bool { return n > 0 },
    pipz.Transform(AbsID, math.Abs),
)

✅ Use Mutate for simple conditional transforms

// Define identity upfront
var AbsIfNegativeID = pipz.NewIdentity("abs-if-negative", "Negate negative numbers")

// RIGHT - Simpler with Mutate
mutate := pipz.Mutate(
    AbsIfNegativeID,
    func(ctx context.Context, n int) int { return -n },
    func(ctx context.Context, n int) bool { return n < 0 },
)

Best Practices

Keep conditions simple: Complex logic makes debugging difficult
Avoid side effects in conditions: Conditions should be pure functions
Use descriptive names: Names appear in error paths
Test both paths: Verify condition true and false scenarios
Consider caching: For expensive condition calculations
Use context: Leverage context for timeouts and values
Document behavior: Make condition logic clear to other developers
Monitor pass rates: Use metrics to understand filter effectiveness