I'm not sure if I agree with you. I was able to create my Meditation Scene using VAM in 5 minutes.,  a beautiful environment asset with a Guru "Person" who does the chants.  Are saying  using unity is will be faster?
		
		
	 
From "teach them basic words or alphabets" to "create my Meditation Scene using VAM in 5 minutes" real quick.
It seems like you're throwing out ideas finding a way to imagine alternate uses, but you don't really have a concrete/solid scope nor a goal in mind.
You either are doing a new educational app which removes every character systems and sexual content. Or you just simply fork VaM by keeping everything and removing the sexual aspect.
When you're talking about your meditation scene, it just is VaM used as a non-sexual sandbox to produce content. You're just asking for a sandbox poser app with a few scripting abilities. It's not like it has "more potential" than before, it's just you, using it differently. If you look at my content, at least half of it would work outside of a sexual context... and I never needed a new version of VaM to do that.
The sheer complexity of the scripting system and animation system of VaM makes it hard to go further than simply putting a few assets together, doing a few triggers and a couple of animation for a couple of characters. As soon as you need really advanced and complex systems, you will have to rely on Unity and external softwares. 
Stasis Lab is one of the best example for that: from the enviro to the animated assets, including VFX and sounds, doing that inside VaM only would have been a nightmare.
To get back on your "teaching alphabets" example, making an advanced interactive system, VR or not, for kids... to teach them words and letters, is gonna be a long process and extremely complex scene. If it was the main goal of a specific "new version" of VaM, I think that, yes... using Unity would be faster for this objective.
And to also get back on your meditation example, you're looking at this the wrong way. If it really took you 5 minutes :
- You already know VaM pretty well
- You have no animation, or almost none (or you used community animations)
- You used assets from the community (from the enviro to your sounds and so on...)
- The scene is most certainly extremely basic
Your "5 mins" are in reality the sum of a lot of time spent by other members of the community to produce original content on a scene that is most cetainly not as complex as "teaching alphabets to the kids".
So, to summarize, your are either :
- Talking about a couple of similar concepts as VaM (a bit of scripting and asset import abilities) constrained into an app with a specific goal (teaching alphabets). For this, making something from scratch would be the wise choice. As I was saying before, VaM as nothing extraordinary to offer if you remove the characters.
 
 
- Talking about just VaM with all its awesome features, including characters, posing and animation without its sexual aspect. And for this, I would agree, making a fork of it, and removing sexual organs and the ability to remove clothes is enough.
That said, in the second situation, VaM is far from user friendly. If you wanted to bring it to a more casual / family friendly target, you would have to work 
a lot on the bugs, on the UI and UX, and make the scripting system way easier. Stability, optimization... and so on.
In the game industry, looking at something that might be good for your project is not always the thing you should use. You may find the CryEngine beautiful, but it might not fit your needs for the networking abilities for instance... or you would have to code everything by yourself, and your studio might not be able to afford it.
It's exactly the same when you look at VaM. It's not because it's a sandbox with a lot of customization abilities that it fits any goal you have in mind or is the best and most optimal choice to make.