Pawpaw breeding considerations

jrd51 · May 19, 2022, 6:37pm

You’re right that there is no direct measurement of annonacin. In this paper, for example, Brine Shrimp Mortality is used as a marker for annonacin content. From Table 3, it appears that it takes roughly 10 times as much pawpaw pulp from Zimmerman, Wells, and Sunflower to attain the same lethality as Middletown, Overleese, and NC-1.

You’re also right that these relative measures do not help us understand whether the absolute levels are safe. But if (a) we agree that annonacin is a risk, and (2) we still hope to consume pawpaws, then it seems we should prefer varieties from the relatively low groups.

Finally – as a “pawpaw breeding consideration” – your genetic clusters seem correlated with annonacin levels. It would seem that a breeder who wants to produce a low-annonancin variety should use parent plants from the low groups (i.e., 2 & 4).

Richard · May 19, 2022, 7:48pm

You do not know if the levels are safe and thus your conclusion is false. It could be true that all of them, some of them, or none of them are safe.

You are assuming:

levels can be assigned to the ancestry groups
annonacin concentration is an inherited trait
Pomper’s data is viable

If you were to apply the same concerns to other fruits, then there are quite a few you’d avoid e.g. Persimmons.

Richard · May 19, 2022, 9:26pm

On the topic of historical breeding, here is a look at distances (number of genetic marker mismatches) between Middletown, Overleese, and their close neighbors. Keep in mind the measurements are coarse - so they might only be accurate to +/- 2. The layout of cultivars in the image is arbitrary and the lines are (obviously) not to scale. It is a topological graph.

Notice the connection of Middletown to NC-1. From this I speculate that C. Davis used Middletown in the breeding of his “Davis” cultivar, which is a known parent of NC-1.

jrd51 · May 19, 2022, 10:14pm

Yes, it is an assumption. But the existence of a pattern seems to support this assumption.
Yes. Again, the similarity (high/low) of concentrations among ancestry groups seems to support this assumption. The alternative, I think, is that environmental conditions producing annonacin were accidentally correlated with ancestry.
I can’t judge Pomper’s data but the fact that it maps to your genetic groupings seems to lend it some credibility. Error tends to be random. These data are not random.

Richard · May 19, 2022, 11:48pm

This is an induction fallacy. The classic example is: A. Lincoln wore a beard, therefore all men wear beards.

An induction fallacy. In this case, correlation does not imply causality.

Yes you can, here it is: Pomper’s SSR data.
The zeroes are missing values. Only 3 of the tested cultivars have a full set. The number of markers (2x5) is grossly less than the number of specimens (41), consequently it is mathematically under-determined and any analysis will be highly speculative. Further, Pomper et al did not use a mathematically sound method (a metric) to compute “distances” between cultivar markers. Finally, they used a faulty clustering software (Camen and Sokal pair grouping) to create cultivar groupings - and then unjustifiably split one of the groups. So the results of Pomper et al cannot be viably compared to any other study - regardless of any similarities that exist.

jrd51 · May 20, 2022, 12:05am

@Richard –

It’s not induction fallacy. I’m not saying that my conclusion is the only one possible, only that it seems the most likely. Take a Bayesian approach. Whatever your priors, the likelihood that annonacin is inherited is dramatically higher if clusters of related varieties have similar levels. Give me an alternate explanation.
I’m not relying on Pomper’s genetic data, rather on his data re Brine Shrimp Mortality (aka pawpaw lethality). Why is that not OK?
I read the Peterson article that you suggested. He says that NC-1 was reportedly a product of Davis (female) x Overleese (male), hand pollinated. He says that genetic data suggests differently. The genetic data comes from Pomper, who you tell me not to trust. Really? And your own analysis shows that NC-1 is closely related to Overleese. So even if Overleese is not the male parent, it seems closely related to the male parent. All my other arguments stand.

Richard · May 20, 2022, 12:24am

Yes, that is clear.

Jujube · May 20, 2022, 1:18am

There is a more constructive and informative way to have a scientific conversation. Assumptions are a potential starting point and are present in all research.

Richard · May 20, 2022, 1:18am

This graph concerns distances (number of genetic marker mismatches) between Overleese and other cultivars tested by H. Huang. Values from 2 to 7 are considered close, from 8 to 11 are moderately close. Keep in mind the measurements are coarse - so they might only be accurate to +/- 2.

At the top right is “1-23”, whose only documented parent is C. Davis’s Taylor.
At 3 o’clock is C. Davis’s “Prolific” of unknown parentage.
Notice also the cultivars from J. Gordon who is known to have Overleese but also Zimmerman: PA-Golden, SAA-Zimmerman-1 and -2. The latter are crosses of Sweet Alice with Zimmerman. Unfortunately, H. Huang did not test Zimmerman or Davis.

Jujube · May 20, 2022, 1:22am

Not having read the paper how did they decide on the genetic markers to select?

Richard · May 20, 2022, 1:29am

@Jujube
H. Huang conducted 4 years of research into RAPD markers for Pawpaw: RAPD Inheritance and Diversity in Pawpaw (2000). The testing was performed in the following year: Molecular Characterization of Cultivated Pawpaw (2003).

Pomper et al did zero research. Instead they contracted with Genetic Identification Services of Chatsworth CA for selection of markers and production of marker data.

Jujube · May 20, 2022, 1:35am

Interesting but I personally wouldn’t be so quick to dismiss a lab for contracting out some experimentation.

Regardless this is interesting for ancestry purposes but phenotypic relation overlayed on this would really boost this researches reach

Richard · May 20, 2022, 1:50am

They did not know enough to reject the results.

Unfortunately the lab used by H. Huang only returned 49 of his markers free from error. I have re-analyzed that data and produced what I believe is a bias-free set of 45 markers - to be published by IJCSA next month.

Next I plan to apply all of H. Huang’s markers to the 100 available cultivars in circulation and incorporate morphology as well. At current pricing this would run about $40k for error-free results.

Richard · May 20, 2022, 3:35am

Here is another look at Sweet Alice. Values from 2 to 7 are considered close, from 8 to 11 are moderately close, 12 to 13 are sub-average. Keep in mind the measurements are coarse - so they might only be accurate to +/- 2.

Of interest here is the isolation of Sweet Alice. Only Overleese and Sunflower can be considered independent finds and the remainder are due to ancestral parentage.

Jujube · May 23, 2022, 5:45pm

Again interesting for ancestry purposes but how the markers correlate to phenotype would be even more interesting. Additionally the trends in which markers are shared or not may be informative if patterns emerge.

Additionally, I suggest adjusting the node distances to be proportional to the number of mismatches (more fibonacci looking) , make lines gray, and bold “Sweet Alice” to improve figure readability. Just a suggestion

Richard · May 23, 2022, 7:09pm

I agree. That is the purpose of this study.

It is a topological graph extracted from a 45-dimensional space. Note that the displayed rotational sequence of neighbors is arbitrary and likely has no relation to the orientation of any 2D projection.

I agree.

Richard · May 28, 2022, 8:38am

I’ve now worked out the primary ancestry group for each of the cultivars tested by H. Huang. Here’s a short table of how they compare with the questionable clades published by KSU.

[Updated 6/2/2022]

Cultivar	Year introduced	Primary association	KSU clade
Overleese	1950	A	V
NC-1	1976	A	V
Prolific	1985	A	II
Rappahannock	1990	A	III
Shenandoah	1990	A	V
Susquehanna	1990	A	II
Potomac	1994	A	III
Sweet Alice	1945	B	III
Taylor	1968	C	I
Taytwo	1968	C	V
Sunflower	1970	D	V
8-20	1994	D	II
Wabash	1994	D	III
Rebecca’s Gold	1974	E	V
Mitchell	1979	F	V
PA-Golden	1986	F	untested
Wells	1990	F	IV

Richard · June 3, 2022, 4:10am

Here’s a list of cultivars I believe should be in any pawpaw breeding program, plus any advanced specimens you wish to fine-tune:

Middletown
Sweet Alice
Taylor
Sunflower
Rebecca’s Gold
Mitchell

TrilobaTracker · June 3, 2022, 2:20pm

As a general observation only, I think some of those are hard to find readily as an end consumer.
They exist perhaps mainly at repositories and collectors’ orchards.
Though as an academic/research endeavor this would probably not matter.

Richard · June 3, 2022, 4:00pm

Each were offered at some point in the last 12 months at one or more retail nurseries.

Frost, R. Diversity of Pawpaw (Asimina triloba) cultivars in USDA repositories and selected retail nurseries c. 2022 (in preparation).