Pawpaw breeding considerations

Richard · May 19, 2022, 11:48pm

This is an induction fallacy. The classic example is: A. Lincoln wore a beard, therefore all men wear beards.

An induction fallacy. In this case, correlation does not imply causality.

Yes you can, here it is: Pomper’s SSR data.
The zeroes are missing values. Only 3 of the tested cultivars have a full set. The number of markers (2x5) is grossly less than the number of specimens (41), consequently it is mathematically under-determined and any analysis will be highly speculative. Further, Pomper et al did not use a mathematically sound method (a metric) to compute “distances” between cultivar markers. Finally, they used a faulty clustering software (Camen and Sokal pair grouping) to create cultivar groupings - and then unjustifiably split one of the groups. So the results of Pomper et al cannot be viably compared to any other study - regardless of any similarities that exist.

jrd51 · May 20, 2022, 12:05am

@Richard –

It’s not induction fallacy. I’m not saying that my conclusion is the only one possible, only that it seems the most likely. Take a Bayesian approach. Whatever your priors, the likelihood that annonacin is inherited is dramatically higher if clusters of related varieties have similar levels. Give me an alternate explanation.
I’m not relying on Pomper’s genetic data, rather on his data re Brine Shrimp Mortality (aka pawpaw lethality). Why is that not OK?
I read the Peterson article that you suggested. He says that NC-1 was reportedly a product of Davis (female) x Overleese (male), hand pollinated. He says that genetic data suggests differently. The genetic data comes from Pomper, who you tell me not to trust. Really? And your own analysis shows that NC-1 is closely related to Overleese. So even if Overleese is not the male parent, it seems closely related to the male parent. All my other arguments stand.

Richard · May 20, 2022, 12:24am

Yes, that is clear.

Jujube · May 20, 2022, 1:18am

There is a more constructive and informative way to have a scientific conversation. Assumptions are a potential starting point and are present in all research.

Richard · May 20, 2022, 1:18am

This graph concerns distances (number of genetic marker mismatches) between Overleese and other cultivars tested by H. Huang. Values from 2 to 7 are considered close, from 8 to 11 are moderately close. Keep in mind the measurements are coarse - so they might only be accurate to +/- 2.

At the top right is “1-23”, whose only documented parent is C. Davis’s Taylor.
At 3 o’clock is C. Davis’s “Prolific” of unknown parentage.
Notice also the cultivars from J. Gordon who is known to have Overleese but also Zimmerman: PA-Golden, SAA-Zimmerman-1 and -2. The latter are crosses of Sweet Alice with Zimmerman. Unfortunately, H. Huang did not test Zimmerman or Davis.

Jujube · May 20, 2022, 1:22am

Not having read the paper how did they decide on the genetic markers to select?

Richard · May 20, 2022, 1:29am

@Jujube
H. Huang conducted 4 years of research into RAPD markers for Pawpaw: RAPD Inheritance and Diversity in Pawpaw (2000). The testing was performed in the following year: Molecular Characterization of Cultivated Pawpaw (2003).

Pomper et al did zero research. Instead they contracted with Genetic Identification Services of Chatsworth CA for selection of markers and production of marker data.

Jujube · May 20, 2022, 1:35am

Interesting but I personally wouldn’t be so quick to dismiss a lab for contracting out some experimentation.

Regardless this is interesting for ancestry purposes but phenotypic relation overlayed on this would really boost this researches reach

Richard · May 20, 2022, 1:50am

They did not know enough to reject the results.

Unfortunately the lab used by H. Huang only returned 49 of his markers free from error. I have re-analyzed that data and produced what I believe is a bias-free set of 45 markers - to be published by IJCSA next month.

Next I plan to apply all of H. Huang’s markers to the 100 available cultivars in circulation and incorporate morphology as well. At current pricing this would run about $40k for error-free results.

Richard · May 20, 2022, 3:35am

Here is another look at Sweet Alice. Values from 2 to 7 are considered close, from 8 to 11 are moderately close, 12 to 13 are sub-average. Keep in mind the measurements are coarse - so they might only be accurate to +/- 2.

Of interest here is the isolation of Sweet Alice. Only Overleese and Sunflower can be considered independent finds and the remainder are due to ancestral parentage.

Jujube · May 23, 2022, 5:45pm

Again interesting for ancestry purposes but how the markers correlate to phenotype would be even more interesting. Additionally the trends in which markers are shared or not may be informative if patterns emerge.

Additionally, I suggest adjusting the node distances to be proportional to the number of mismatches (more fibonacci looking) , make lines gray, and bold “Sweet Alice” to improve figure readability. Just a suggestion

Richard · May 23, 2022, 7:09pm

I agree. That is the purpose of this study.

It is a topological graph extracted from a 45-dimensional space. Note that the displayed rotational sequence of neighbors is arbitrary and likely has no relation to the orientation of any 2D projection.

I agree.

Richard · May 28, 2022, 8:38am

I’ve now worked out the primary ancestry group for each of the cultivars tested by H. Huang. Here’s a short table of how they compare with the questionable clades published by KSU.

[Updated 6/2/2022]

Cultivar	Year introduced	Primary association	KSU clade
Overleese	1950	A	V
NC-1	1976	A	V
Prolific	1985	A	II
Rappahannock	1990	A	III
Shenandoah	1990	A	V
Susquehanna	1990	A	II
Potomac	1994	A	III
Sweet Alice	1945	B	III
Taylor	1968	C	I
Taytwo	1968	C	V
Sunflower	1970	D	V
8-20	1994	D	II
Wabash	1994	D	III
Rebecca’s Gold	1974	E	V
Mitchell	1979	F	V
PA-Golden	1986	F	untested
Wells	1990	F	IV

Richard · June 3, 2022, 4:10am

Here’s a list of cultivars I believe should be in any pawpaw breeding program, plus any advanced specimens you wish to fine-tune:

Middletown
Sweet Alice
Taylor
Sunflower
Rebecca’s Gold
Mitchell

TrilobaTracker · June 3, 2022, 2:20pm

As a general observation only, I think some of those are hard to find readily as an end consumer.
They exist perhaps mainly at repositories and collectors’ orchards.
Though as an academic/research endeavor this would probably not matter.

Richard · June 3, 2022, 4:00pm

Each were offered at some point in the last 12 months at one or more retail nurseries.

Frost, R. Diversity of Pawpaw (Asimina triloba) cultivars in USDA repositories and selected retail nurseries c. 2022 (in preparation).

disc4tw · September 25, 2022, 1:30am

Richard, what is the reasoning behind your suggestion for including these varieties? Is it primarily based on genetic distance between each of them to improve the gene pool?

To me, the idea of searching for a new cultivar focuses on traits similar to what Neal Peterson was seeking- high flesh to seed ratio, low seed count, etc.

I would also consider freestone, the trait in Susquehanna limiting seeds full formation, and Allegheny’s high productivity to be important along with selecting for low acetogenins. Obviously there is a laundry list of other traits that could be selected for breeding but some are more important to different growers.

Richard · September 25, 2022, 3:55am

Yes.

Richard · September 25, 2022, 3:57am

For example, understanding what constitutes an induction fallacy.

Richard · September 25, 2022, 3:59am

Accepted for publication, January 2023.