Skip to main content
Solved

Identifying regressed features

  • December 1, 2020
  • 5 replies
  • 29 views

Forum|alt.badge.img

Hi There,

 

I'm having some trouble of figuring out an approach to detect features that have "regressed".

My data looks something like this:

<Fruit> <Batch>

Orange, 1

Apple, 1

Pear, 1

Orange, 2

Apple, 2

Pear, 2

Orange, 3

Pear, 3

 

So look at this as Batch 1 had all three fruit, batch 2 had all three fruit, but in batch 3, Apple is missing.

What I am interested in is knowing when a fruit went missing, specifically in a sequential view.

Meaning, I don't really care about the relationship between batch 1 and 3, but I want to know that between batch 2 and 3, Apple went missing. So even if I have 10 batches, I only really care if I have "pineapple" in batch 8, but missing in 9.

 

I hope my obscure example makes sense and please let me know in case any additional information is needed.

 

Thanks a lot!

 

Best answer by nielsgerrits

One way to do this is to assign a BatchCombination to all rows and check Fruit and BatchCombination with a ChangeDetector. If you have batches 1,2 and 3, you want to check BatchCombination 1,2 and BatchCombination 2,3.

FruitBatchChangeDetectionProbably faster with Python and lists, but this is one way to do it.

This post is closed to further activity.
It may be an old question, an answered question, an implemented idea, or a notification-only post.
Please check post dates before relying on any information in a question or answer.
For follow-up or related questions, please post a new question or idea.
If there is a genuine update to be made, please contact us and request that the post is reopened.

5 replies

nielsgerrits
VIP
Forum|alt.badge.img+62
  • Best Answer
  • December 1, 2020

One way to do this is to assign a BatchCombination to all rows and check Fruit and BatchCombination with a ChangeDetector. If you have batches 1,2 and 3, you want to check BatchCombination 1,2 and BatchCombination 2,3.

FruitBatchChangeDetectionProbably faster with Python and lists, but this is one way to do it.


ebygomm
Influencer
Forum|alt.badge.img+46
  • Influencer
  • December 1, 2020

Do you always know the number of batches? If so you should be able to sort then use adjacent attribute mapping in an AttributeCreator to get something like this (it also assumes your batch is always an integer)

CaptureCapture 


alexlynch3450
Contributor
Forum|alt.badge.img+16
  • Contributor
  • December 1, 2020

Do you know how many or max amount of items that are to be expected? Will the first batch always contain the max ingredients?


alexlynch3450
Contributor
Forum|alt.badge.img+16
  • Contributor
  • December 1, 2020

Do you always know the number of batches? If so you should be able to sort then use adjacent attribute mapping in an AttributeCreator to get something like this (it also assumes your batch is always an integer)

CaptureCapture 

you could run max & min on batch field using StatisticCalculator to determine batch count.


Forum|alt.badge.img

One way to do this is to assign a BatchCombination to all rows and check Fruit and BatchCombination with a ChangeDetector. If you have batches 1,2 and 3, you want to check BatchCombination 1,2 and BatchCombination 2,3.

FruitBatchChangeDetectionProbably faster with Python and lists, but this is one way to do it.

Thanks a million, this was indeed exactly what Iw as looking for.