How to separate out the context-free part of a language from the context-sensitive part?

Question 1

One of the simplest languages that are not context-free is one where the words are of the type aⁿbⁿcⁿ (a, b, and c repeated the same number of times, that is, abc, aabbcc, aaabbbccc, ...).

You can parse it using a grammar for the context-free language {aⁿbⁿc^m}, where the number of c's is not restricted. Once you have the parse tree, you check using a separate algorithm that the number of repetitions of c is equal to the number of repetitions of a and b.

Question 2

Generally filtering is done also to disambiguate over-approximations of languages. We write ambiguous but clear context-free grammars for programming languages, then use tree walkers or other mechanisms to remove the unwanted derivations.

one reference:

Using Filters for the Disambiguation of Context-free Grammars (1994), http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.51.9812

One the other hand, you could also consider a type checker that processes abstract syntax trees as such a filter. Type checkers reject trees produced by a parser based on non-local (context) information. For example:

1 + "1"

is accepted by the grammar because:

E ::= Int | String | E "+" E;

but the type checker says that addition does not work between integers and strings and rejects the sentence from the language. The type checker does this by traversing the tree after parsing and identifying the addition symbol, then possibly looking up valid combinations of operands in a table and if the combination is not a valid it starts complaining. I guess that is typically how compilers work. See the Aho et al. dragon book. It sounds more interesting if you talk about it abstractly :-)