www.seattlecoffeegear.com: let us help you find the right gear

TGP II: Brainstorming Session

Postby another_jim on Fri Oct 15, 2010 10:34 am

The grind samples are done, and Kendall will be running the analyses towards the end of next week. I thought in the meantime we could discuss what sort of things we should look for in the data (settling this before the data is produced allows for some types of testing that are precluded when one peeks at the data first)

Here's the breakdown of the 20 pairs of samples we received

----------------------------
GRINDER        HOPPER SINGLE
----------------------------
Ascaso I2..........1........           
Baratza Vario......2......1.
Cimbali Junior.....1......1.
Compak K10.........2......3.
Lelit PL53.........1........
Macap MXK.................1.
Macap M7K..........1......1.
Mazzer Kony........1......1.
Mazzer Robur.......2*.......
Versalab M3...............1.
----------------------------
* Both pairs from the same grinder


The bad news is that we are under-represented on conventional flat burr grinders; the good news is we have lots of duplication on some grinders and lots of hopper/no hopper sets.

Here's my rough take, please contradict or add -- the better we understand the issues the better we can analyze the data:

We will be able to nail the question whether hopper grinds differ from single dose grinds at the same dose and flow and how they differ.

We will be able to nail the question whether grind samples from the same grinder prepared at different homes can be adequately standardized by standardizing coffee, dose, basket and flow.

We may not be able to learn much about the greater consistency of large conicals; that is, whether there is a difference in how conicals change grind with how flats do it.

You can grind "finer" in two ways:
  • by producing more fines to restrict the flow without changing the size or way the coarse particles extract;
  • or by making the coarse particles smaller, and so both restricting the flow and increasing the extraction rate.
  • the forgiveness factor of conicals may be because they work by mostly increasing fines, leaving extraction rates more constant, so that this can be controlled more simply by the length of the shot
We'll have lucked out if the Vario, Cimbali and Versalab samples fall onto one group, and the conicals into another; otherwise, we may just end up with hints about the consistency factor.
User avatar
another_jim
Team HB
 
Posts: 7489
Joined: May 05, 2005
Location: Chicago

Postby TrlstanC on Fri Oct 15, 2010 11:56 am

Looking at the range of grinders (we ended up with a wide variety of conicals of different sizes and a hybrid) I'm really interested to see if there's a difference in the ratio of fines to coarse particles between the different types of burr sets.

For example, I'm thinking that the conicals might be more consistent in making the coarse particles the same size (the distribution for that size will be very small) and that they will have a relatively larger ratio of coarse to fine particles (there will be one big spike and one small spike, both narrow). Where the flats will produce a wider range of sizes of both coarse and fines and produce relatively more fines (there will be two spikes of similar size, and they'll be wider).

I'm also thinking that we might see a pattern in particle distributions that goes from big conicals -> small conicals -> flats with the hybrid flat/conical sample falling somewhere in that range. It's too bad we don't have more variation in the sizes of the flat burrs too see how size affects distribution there.

It would also be interesting to know the grind speed or RPMs of the different grinders so we can try to see if that has any impact?
User avatar
TrlstanC
 
Posts: 346
Joined: Jan 14, 2008
Location: Somerville, MA

Postby another_jim on Fri Oct 15, 2010 1:20 pm

That would be cool very ; but there's a but.

To compare the overall shape, you have to compare single samples. We might be able to do that; if all the Vario and K10 samples submitted by different people are really close, the differences between different brands of grinders will be meaningful. If the output of the same grinder from different sites is as different as the outputs of different brands of grinders, only the paired sample data will be meaningful; and that just tells us how the grind changes.

But we should do the quality control test. If the raw distributions from the same brand of grinder are really close, we have much higher quality data than we expected, and can look at the different grinders directly.
User avatar
another_jim
Team HB
 
Posts: 7489
Joined: May 05, 2005
Location: Chicago

Postby TrlstanC on Fri Oct 15, 2010 1:41 pm

Yes, lets keep our fingers crossed that there's some similarity between the same grinders used at different sites. There's not enough pairs for a really strong statistical statement if there's much noise, but if the Vario and K10 samples are all much more similar to each other then other grinders (for all we know now all the grinders could all come out looking the same at 15g) then it will open up some interesting inter-grinder comparisons.

Another thought I had is comparing the largest particle size as a way of comparing grinder settings, either between two of the same grinder (right now a 10 on one vario isn't likely to be the same as a 10 on a different one, and so isn't really a useful piece of info) or even between different brands of grinders. For example if it turns out that the biggest particle from all the K10s (or better yet, all the grinders) at the 15g dose is 230 µm and 210 µm at the 18g dose then that's a way to accurately record and share the grinder setting. Of course in practice it would require a bunch of expensive sieves to measure at home, so not really practical right now, but it would be nice to know it's an option.
User avatar
TrlstanC
 
Posts: 346
Joined: Jan 14, 2008
Location: Somerville, MA

Postby wookie on Sat Oct 16, 2010 1:46 pm

another_jim wrote:I thought in the meantime we could discuss what sort of things we should look for in the data (settling this before the data is produced allows for some types of testing that are precluded when one peeks at the data first)

I'm not sure what else we could look for in this data set. Tristan had some good ideas, but I believe that machine to machine comparisons will only be possible if the data turns out to be robust. Is there statistical merit to testing for expected results in advance? For example if it is presumed that the 15g doses will contain less coarse particles than the 18g doses?

Agreed that the data will be strongest for the cases that you've already framed. We should see some trends that speak to filled hopper vs. single dose grinding. And whether grind settings are consistent across the same model in different homes.

It will be interesting to see what "finer" actually represents when looking at 18g vs 15g doses. There is some evidence, for bulk grinders at least (Walsh, Marco) that "finer" is a combination of more fines and less coarse particles.
.
wookie
 
Posts: 249
Joined: Jul 25, 2005
Location: PNW

Postby akallio on Tue Oct 19, 2010 3:13 pm

wookie wrote:Is there statistical merit to testing for expected results in advance?


I'm not sure if I understood your question correctly, but you should decide what you test before seeing the data. Especially now that you have a small number of samples and a lot of questions to ask. 20 pairs of samples from 10 grinders will contain many clear patterns that are produced by coincidence, and eyeballing the data allows you to first pick them and next test them (and possibly get good, but false, p-values or similar).
akallio
 
Posts: 225
Joined: Feb 03, 2009
Location: Helsinki, Finland

Postby another_jim on Tue Oct 19, 2010 4:01 pm

Aleksi is right. If you test for patterns in the same data in which you've discovered them, success is both guaranteed and meaningless. It's equally meaningless to test for thousands of patterns and then declare ten or so significant at the 1% level. "Reanalyses" or "meta-analysis" of social survey or epidemiological data frequently produce pseudo-discoveries of this sort, since people forget to modify their significance levels to account for all the testing done by previous researchers.

The proper practice is to take a subset of the data, look for prominent patterns in it, then test the rest of the data to see if those patterns indeed hold. We have too few samples to do that; so we need to decide ahead of time what to test.

I'm looking closely at the samples previously produced by Kendall and John to get ideas.
User avatar
another_jim
Team HB
 
Posts: 7489
Joined: May 05, 2005
Location: Chicago

Postby jbell on Tue Oct 19, 2010 8:45 pm

Being more of a visual thinker, I'll just throw this out there.

I'd be interested in seeing some bar graphs that display percentage of total grinds vs particle size.
Maybe sample 100 individual grinds out of each batch- then plot the size...
Something like 10% of the grinds were between x.x microns, 30% between x.x, 30% were x.x, etc.

Heck, if i can get a copy of the data after Kendall is done, I'll put some together if I can.
I like graphs. :lol:
jbell
 
Posts: 107
Joined: Mar 17, 2010
Location: East Coast

Postby kmills on Tue Oct 19, 2010 9:19 pm

Hey jbell, what you describe is exactly how the machine works only right now its set to do a line graph. The Machine plots frequency (number of particles at that size) against the log of their size. The log part just makes the graph readable, the important information would look really compressed on a normal scale that ranged from 10nm to 3mm, which is what the machine is capable of. There should be no issue of making the data public, just not a very interesting read.
kmills
 
Posts: 98
Joined: Dec 07, 2009
Location: New Jersey

Postby jbell on Tue Oct 19, 2010 9:28 pm

Gotcha. I wasn't sure what type of equipment you had, but imagined it would (or should) work in that way.
jbell
 
Posts: 107
Joined: Mar 17, 2010
Location: East Coast

Next

Return to Grinders