Forgive me, I am not any expert in statistics, I am just a biologist with some experience in phylogenetics. My opinion is that nobody can give you a good answer without knowing exactly what data set you are studying here. For one example, we know a lot about the history of Homo sapiens sapiens, Neanderthal, Denisova, Pan troglodytes troglodytes, Pan troglodytes verus, Pan paniscus, Gorilla gorilla and other great ape and human evolution, and it is very clear that some genes give very good statistical support for humans being closer to gorillas than to chimpanzees ((human gorilla)chimpanzee) while other genes give very good statistical support for ((human chimp) gorilla). There are many solid explanations for why this can happen, such as introgression and incomplete lineage sorting. By studying the fossil record, complete genomes of a few individuals or each species, and many genes and mitochondrial genomes from thousands of individuals from these species, we get a more clear picture.
At the other end of the spectrum of sequence diversity (humans, chimps and gorillas are very similar in DNA sequence, so homoplasies and other problems are infrequent) we can’t use complete mitochodrial genomes to clearly determine the order in which amphibians, snakes, turtles, birds, lizards, alligators, mammals, marsupials and other tetrapods diverged from some lineage of fish, because of too much genetic distance, altered rates of evolution in different lineages and other problems. But again we have some fossil evidence and other data to suggest how and why we can be certain that some phylogenies are wrong despite very high statistical support using the molecular data.
When it comes to studying bacteria, jellyfish, insects, viruses, and many other organisms, we do not have a solid fossil record or other data to support us. Statistics alone can be a huge help in deciding which hypotheses are more likely to be correct, but even very strong statistical support is not always solid evidence that a hypothesis is correct. The other tread you link to states that you are using very short sequences, roughly 250 sites. It does not say what types of distances are involved. For many types of comparisons, 250 sites is good. For example we can tell if mitochondrial DNA is from a mammal, marsupial, or bird with just 250 bases, and most likely all 250-bases regions of the mitochondrial genome would show mammals-marsupial closer than bird. But I would not trust any 250 base region to tell me if birds are closer to mammals than to amphibians, nor whether humans are closer to gorillas than to chimpanzees, no matter what the statistics were for the 250 base region.
In my opinion, it seems a little silly to argue about methods (neighbor-joining, ML, Bayesian, etc) when the real issue is most likely in the data set. But without any clue what the data set looks like, we can only guess about that.