A mass conserving mixed stress formulation for Stokes flow with weakly   imposed stress symmetry

Jay Gopalakrishnan; Philip L. Lederer; Joachim Sch\"oberl

arXiv:1901.04648·math.NA·December 20, 2024·SIAM J. Numer. Anal.

A mass conserving mixed stress formulation for Stokes flow with weakly imposed stress symmetry

Jay Gopalakrishnan, Philip L. Lederer, Joachim Sch\"oberl

PDF

TL;DR

This paper presents a novel finite element method for Stokes flow that weakly enforces stress symmetry, ensuring mass conservation, stability, and optimal convergence rates for velocity, pressure, and stress variables.

Contribution

It introduces a new discretization that directly approximates symmetric viscous stresses with weak enforcement, improving stability and accuracy over previous methods.

Findings

01

Achieves optimal convergence rates for pressure and stress.

02

Ensures exact mass conservation with $H(div)$-conforming velocity.

03

Method is pressure robust and stable.

Abstract

We introduce a new discretization of a mixed formulation of the incompressible Stokes equations that includes symmetric viscous stresses. The method is built upon a mass conserving mixed formulation that we recently studied. The improvement in this work is a new method that directly approximates the viscous fluid stress $σ$ , enforcing its symmetry weakly. The finite element space in which the stress is approximated consists of matrix-valued functions having continuous "normal-tangential" components across element interfaces. Stability is achieved by adding certain matrix bubbles that were introduced earlier in the literature on finite elements for linear elasticity. Like the earlier work, the new method here approximates the fluid velocity $u$ using $H (div)$ -conforming finite elements, thus providing exact mass conservation. Our error analysis shows optimal…

Tables3

Table 1. (a) The d = 2 𝑑 2 d=2 example.

$k = 1$
$\| 𝒯 \|$	${‖ \nabla u - \nabla u_{h}^{*} ‖}_{h}$ (	eoc )	$‖ u - u_{h}^{*} ‖$ (	eoc )	$‖ σ - σ_{h} ‖$ (	eoc )	$‖ p - p_{h} ‖$ (	eoc )	$‖ ω - ω_{h} ‖$ (	eoc )
20	$0.009 902 863 275 354 638$ (	– )	$0.000 839 222 993 903 479 4$ (	– )	$0.010 312 466 960 396 744$ (	– )	$0.034 413 032 282 143 27$ (	– )	$0.008 827 275 037 994 896$ (	– )
80	$0.003 530 562 976 211 743 5$ (	$1.487 947 469 755 284 4$ )	$0.000 165 089 480 411 018 7$ (	$2.345 806 013 025 177$ )	$0.003 579 183 155 647 400 5$ (	$1.526 687 220 159 966 8$ )	$0.009 362 753 882 968 158$ (	$1.877 950 180 095 254 7$ )	$0.003 224 753 279 108 695$ (	$1.452 779 363 160 478 4$ )
320	$0.000 950 392 241 687 551 7$ (	$1.893 303 286 799 296 3$ )	$2.394 430 301 750 303 7 ⋅ 10^{- 05}$ (	$2.785 493 847 265 070 8$ )	$0.000 940 814 144 360 854 3$ (	$1.927 648 716 739 806$ )	$0.002 377 404 964 960 747$ (	$1.977 545 263 060 428 9$ )	$0.000 924 059 071 519 572 6$ (	$1.803 131 800 777 962 6$ )
1280	$0.000 252 203 789 574 182 94$ (	$1.913 933 105 535 469 4$ )	$3.400 178 222 372 369 7 ⋅ 10^{- 06}$ (	$2.816 000 168 331 647 6$ )	$0.000 245 617 947 936 767 3$ (	$1.937 493 764 054 934 6$ )	$0.000 596 785 141 652 696$ (	$1.994 104 149 629 636 3$ )	$0.000 257 366 780 619 461 84$ (	$1.844 159 229 459 235 3$ )
5120	$6.533 973 632 118 895 ⋅ 10^{- 05}$ (	$1.948 557 415 638 314 7$ )	$4.609 940 623 237 435 3 ⋅ 10^{- 07}$ (	$2.882 790 294 319 957$ )	$6.295 522 096 878 068 ⋅ 10^{- 05}$ (	$1.964 018 053 173 128 8$ )	$0.000 149 350 809 893 709 41$ (	$1.998 506 555 748 312 8$ )	$6.863 349 183 158 698 ⋅ 10^{- 05}$ (	$1.906 841 190 109 466 6$ )
$k = 2$
20	$0.002 223 318 053 241 880 3$ (	– )	$0.000 100 386 852 219 448 23$ (	– )	$0.001 807 572 533 516 103 2$ (	– )	$0.003 723 134 021 148 019 4$ (	– )	$0.001 458 406 040 600 15$ (	– )
80	$0.000 503 214 615 429 471 5$ (	$2.143 468 615 824 571 7$ )	$1.058 543 152 620 989 4 ⋅ 10^{- 05}$ (	$3.245 418 341 613 661$ )	$0.000 372 262 220 944 538 2$ (	$2.279 662 423 470 582 2$ )	$0.000 531 157 628 153 550 4$ (	$2.809 305 580 825 953 3$ )	$0.000 276 873 697 536 222 5$ (	$2.397 092 529 929 767$ )
320	$6.655 823 868 997 209 ⋅ 10^{- 05}$ (	$2.918 484 663 139 137 7$ )	$7.744 534 306 321 447 ⋅ 10^{- 07}$ (	$3.772 757 783 686 894 4$ )	$5.063 416 161 924 362 6 ⋅ 10^{- 05}$ (	$2.878 136 242 351 472$ )	$6.747 790 680 282 963 ⋅ 10^{- 05}$ (	$2.976 652 937 345 178 4$ )	$4.142 590 806 807 659 5 ⋅ 10^{- 05}$ (	$2.740 622 779 780 947$ )
1280	$8.355 809 373 311 833 ⋅ 10^{- 06}$ (	$2.993 765 771 670 016 6$ )	$4.932 380 475 155 760 6 ⋅ 10^{- 08}$ (	$3.972 822 493 503 286$ )	$6.371 723 935 662 283 ⋅ 10^{- 06}$ (	$2.990 355 397 715 691 6$ )	$8.471 067 211 643 275 ⋅ 10^{- 06}$ (	$2.993 799 579 853 649$ )	$5.196 616 887 090 514 ⋅ 10^{- 06}$ (	$2.994 888 714 616 015 7$ )
5120	$1.043 295 015 579 778 6 ⋅ 10^{- 06}$ (	$3.001 632 408 970 303 3$ )	$3.081 002 595 084 804 7 ⋅ 10^{- 09}$ (	$4.000 812 288 339 548 5$ )	$7.958 407 260 327 208 ⋅ 10^{- 07}$ (	$3.001 132 127 398 458$ )	$1.060 037 618 288 193 3 ⋅ 10^{- 06}$ (	$2.998 428 272 644 474$ )	$6.445 731 725 877 459 ⋅ 10^{- 07}$ (	$3.011 156 653 191 416 7$ )
$k = 3$
20	$0.000 414 611 108 409 708 84$ (	– )	$1.443 204 168 859 798 8 ⋅ 10^{- 05}$ (	– )	$0.000 237 647 329 826 083 42$ (	– )	$7.196 988 373 849 738 ⋅ 10^{- 05}$ (	– )	$0.000 224 032 839 106 052 26$ (	– )
80	$4.783 867 967 564 853 ⋅ 10^{- 05}$ (	$3.115 509 292 653 894 8$ )	$8.436 872 051 505 858 ⋅ 10^{- 07}$ (	$4.096 423 378 271 388$ )	$2.697 639 439 434 633 4 ⋅ 10^{- 05}$ (	$3.139 052 752 968 86$ )	$5.700 689 004 084 194 ⋅ 10^{- 06}$ (	$3.658 185 123 986 187$ )	$2.627 106 790 817 884 ⋅ 10^{- 05}$ (	$3.092 163 468 681 002 2$ )
320	$2.956 026 421 551 080 7 ⋅ 10^{- 06}$ (	$4.016 446 502 653 18$ )	$2.606 241 817 111 470 4 ⋅ 10^{- 08}$ (	$5.016 665 368 596 337$ )	$1.729 596 788 053 222 4 ⋅ 10^{- 06}$ (	$3.963 189 879 575 17$ )	$3.647 565 093 356 667 ⋅ 10^{- 07}$ (	$3.966 130 669 308 764$ )	$1.711 084 861 445 255 ⋅ 10^{- 06}$ (	$3.940 491 629 291 239$ )
1280	$1.861 719 667 682 021 2 ⋅ 10^{- 07}$ (	$3.988 951 407 085 007 6$ )	$8.299 954 785 406 924 ⋅ 10^{- 10}$ (	$4.972 723 661 372 874$ )	$1.115 257 717 319 600 6 ⋅ 10^{- 07}$ (	$3.954 986 712 664 551 7$ )	$2.292 767 525 473 536 3 ⋅ 10^{- 08}$ (	$3.991 771 738 493 527$ )	$1.133 819 783 070 296 ⋅ 10^{- 07}$ (	$3.915 648 060 245 93$ )
5120	$1.172 497 914 249 385 2 ⋅ 10^{- 08}$ (	$3.988 978 590 986 12$ )	$2.631 934 860 145 417 5 ⋅ 10^{- 11}$ (	$4.978 907 789 079 624$ )	$7.097 711 004 954 45 ⋅ 10^{- 09}$ (	$3.973 879 487 149 761$ )	$1.435 012 754 522 159 8 ⋅ 10^{- 09}$ (	$3.997 954 615 797 718 6$ )	$7.320 933 788 024 666 ⋅ 10^{- 09}$ (	$3.953 019 860 065 994$ )

Table 2. (a) The d = 2 𝑑 2 d=2 example.

$k = 1$
$\| 𝒯 \|$	${‖ \nabla u - \nabla u_{h}^{*} ‖}_{h}$ (	eoc )	$‖ u - u_{h}^{*} ‖$ (	eoc )	$‖ σ - σ_{h} ‖$ (	eoc )	$‖ p - p_{h} ‖$ (	eoc )	$‖ ω - ω_{h} ‖$ (	eoc )
20	$0.009 902 863 275 354 638$ (	– )	$0.000 839 222 993 903 479 4$ (	– )	$0.010 312 466 960 396 744$ (	– )	$0.034 413 032 282 143 27$ (	– )	$0.008 827 275 037 994 896$ (	– )
80	$0.003 530 562 976 211 743 5$ (	$1.487 947 469 755 284 4$ )	$0.000 165 089 480 411 018 7$ (	$2.345 806 013 025 177$ )	$0.003 579 183 155 647 400 5$ (	$1.526 687 220 159 966 8$ )	$0.009 362 753 882 968 158$ (	$1.877 950 180 095 254 7$ )	$0.003 224 753 279 108 695$ (	$1.452 779 363 160 478 4$ )
320	$0.000 950 392 241 687 551 7$ (	$1.893 303 286 799 296 3$ )	$2.394 430 301 750 303 7 ⋅ 10^{- 05}$ (	$2.785 493 847 265 070 8$ )	$0.000 940 814 144 360 854 3$ (	$1.927 648 716 739 806$ )	$0.002 377 404 964 960 747$ (	$1.977 545 263 060 428 9$ )	$0.000 924 059 071 519 572 6$ (	$1.803 131 800 777 962 6$ )
1280	$0.000 252 203 789 574 182 94$ (	$1.913 933 105 535 469 4$ )	$3.400 178 222 372 369 7 ⋅ 10^{- 06}$ (	$2.816 000 168 331 647 6$ )	$0.000 245 617 947 936 767 3$ (	$1.937 493 764 054 934 6$ )	$0.000 596 785 141 652 696$ (	$1.994 104 149 629 636 3$ )	$0.000 257 366 780 619 461 84$ (	$1.844 159 229 459 235 3$ )
5120	$6.533 973 632 118 895 ⋅ 10^{- 05}$ (	$1.948 557 415 638 314 7$ )	$4.609 940 623 237 435 3 ⋅ 10^{- 07}$ (	$2.882 790 294 319 957$ )	$6.295 522 096 878 068 ⋅ 10^{- 05}$ (	$1.964 018 053 173 128 8$ )	$0.000 149 350 809 893 709 41$ (	$1.998 506 555 748 312 8$ )	$6.863 349 183 158 698 ⋅ 10^{- 05}$ (	$1.906 841 190 109 466 6$ )
$k = 2$
20	$0.002 223 318 053 241 880 3$ (	– )	$0.000 100 386 852 219 448 23$ (	– )	$0.001 807 572 533 516 103 2$ (	– )	$0.003 723 134 021 148 019 4$ (	– )	$0.001 458 406 040 600 15$ (	– )
80	$0.000 503 214 615 429 471 5$ (	$2.143 468 615 824 571 7$ )	$1.058 543 152 620 989 4 ⋅ 10^{- 05}$ (	$3.245 418 341 613 661$ )	$0.000 372 262 220 944 538 2$ (	$2.279 662 423 470 582 2$ )	$0.000 531 157 628 153 550 4$ (	$2.809 305 580 825 953 3$ )	$0.000 276 873 697 536 222 5$ (	$2.397 092 529 929 767$ )
320	$6.655 823 868 997 209 ⋅ 10^{- 05}$ (	$2.918 484 663 139 137 7$ )	$7.744 534 306 321 447 ⋅ 10^{- 07}$ (	$3.772 757 783 686 894 4$ )	$5.063 416 161 924 362 6 ⋅ 10^{- 05}$ (	$2.878 136 242 351 472$ )	$6.747 790 680 282 963 ⋅ 10^{- 05}$ (	$2.976 652 937 345 178 4$ )	$4.142 590 806 807 659 5 ⋅ 10^{- 05}$ (	$2.740 622 779 780 947$ )
1280	$8.355 809 373 311 833 ⋅ 10^{- 06}$ (	$2.993 765 771 670 016 6$ )	$4.932 380 475 155 760 6 ⋅ 10^{- 08}$ (	$3.972 822 493 503 286$ )	$6.371 723 935 662 283 ⋅ 10^{- 06}$ (	$2.990 355 397 715 691 6$ )	$8.471 067 211 643 275 ⋅ 10^{- 06}$ (	$2.993 799 579 853 649$ )	$5.196 616 887 090 514 ⋅ 10^{- 06}$ (	$2.994 888 714 616 015 7$ )
5120	$1.043 295 015 579 778 6 ⋅ 10^{- 06}$ (	$3.001 632 408 970 303 3$ )	$3.081 002 595 084 804 7 ⋅ 10^{- 09}$ (	$4.000 812 288 339 548 5$ )	$7.958 407 260 327 208 ⋅ 10^{- 07}$ (	$3.001 132 127 398 458$ )	$1.060 037 618 288 193 3 ⋅ 10^{- 06}$ (	$2.998 428 272 644 474$ )	$6.445 731 725 877 459 ⋅ 10^{- 07}$ (	$3.011 156 653 191 416 7$ )
$k = 3$
20	$0.000 414 611 108 409 708 84$ (	– )	$1.443 204 168 859 798 8 ⋅ 10^{- 05}$ (	– )	$0.000 237 647 329 826 083 42$ (	– )	$7.196 988 373 849 738 ⋅ 10^{- 05}$ (	– )	$0.000 224 032 839 106 052 26$ (	– )
80	$4.783 867 967 564 853 ⋅ 10^{- 05}$ (	$3.115 509 292 653 894 8$ )	$8.436 872 051 505 858 ⋅ 10^{- 07}$ (	$4.096 423 378 271 388$ )	$2.697 639 439 434 633 4 ⋅ 10^{- 05}$ (	$3.139 052 752 968 86$ )	$5.700 689 004 084 194 ⋅ 10^{- 06}$ (	$3.658 185 123 986 187$ )	$2.627 106 790 817 884 ⋅ 10^{- 05}$ (	$3.092 163 468 681 002 2$ )
320	$2.956 026 421 551 080 7 ⋅ 10^{- 06}$ (	$4.016 446 502 653 18$ )	$2.606 241 817 111 470 4 ⋅ 10^{- 08}$ (	$5.016 665 368 596 337$ )	$1.729 596 788 053 222 4 ⋅ 10^{- 06}$ (	$3.963 189 879 575 17$ )	$3.647 565 093 356 667 ⋅ 10^{- 07}$ (	$3.966 130 669 308 764$ )	$1.711 084 861 445 255 ⋅ 10^{- 06}$ (	$3.940 491 629 291 239$ )
1280	$1.861 719 667 682 021 2 ⋅ 10^{- 07}$ (	$3.988 951 407 085 007 6$ )	$8.299 954 785 406 924 ⋅ 10^{- 10}$ (	$4.972 723 661 372 874$ )	$1.115 257 717 319 600 6 ⋅ 10^{- 07}$ (	$3.954 986 712 664 551 7$ )	$2.292 767 525 473 536 3 ⋅ 10^{- 08}$ (	$3.991 771 738 493 527$ )	$1.133 819 783 070 296 ⋅ 10^{- 07}$ (	$3.915 648 060 245 93$ )
5120	$1.172 497 914 249 385 2 ⋅ 10^{- 08}$ (	$3.988 978 590 986 12$ )	$2.631 934 860 145 417 5 ⋅ 10^{- 11}$ (	$4.978 907 789 079 624$ )	$7.097 711 004 954 45 ⋅ 10^{- 09}$ (	$3.973 879 487 149 761$ )	$1.435 012 754 522 159 8 ⋅ 10^{- 09}$ (	$3.997 954 615 797 718 6$ )	$7.320 933 788 024 666 ⋅ 10^{- 09}$ (	$3.953 019 860 065 994$ )

Table 3. (b) The d = 3 𝑑 3 d=3 example.

$k = 1$
$\| 𝒯 \|$	${‖ \nabla u - \nabla u_{h}^{*} ‖}_{h}$ (	eoc )	$‖ u - u_{h}^{*} ‖$ (	eoc )	$‖ σ - σ_{h} ‖$ (	eoc )	$‖ p - p_{h} ‖$ (	eoc )	$‖ ω - ω_{h} ‖$ (	eoc )
28	$0.001 534 912 937 852 505 1$ (	– )	$0.000 135 530 363 657 652 82$ (	– )	$0.001 462 006 690 774 445 7$ (	– )	$0.074 504 930 749 633 17$ (	– )	$0.001 057 151 897 040 475 4$ (	– )
224	$0.000 811 178 217 056 950 2$ (	$0.920 066 009 862 804 3$ )	$5.416 915 520 176 647 ⋅ 10^{- 05}$ (	$1.323 072 606 710 871 6$ )	$0.000 814 666 608 353 736 9$ (	$0.843 668 232 532 464 5$ )	$0.031 059 445 731 353 21$ (	$1.262 303 822 084 343 9$ )	$0.000 669 755 494 396 761 3$ (	$0.658 476 268 982 101 3$ )
1792	$0.000 316 883 727 194 630 5$ (	$1.356 065 336 034 114 8$ )	$1.315 926 622 703 861 1 ⋅ 10^{- 05}$ (	$2.041 392 546 054 836 6$ )	$0.000 316 187 110 879 376 8$ (	$1.365 431 217 497 968 8$ )	$0.009 518 865 089 765 845$ (	$1.706 170 604 451 079$ )	$0.000 316 872 489 687 682 9$ (	$1.079 732 098 478 432 9$ )
14336	$9.195 769 681 796 024 ⋅ 10^{- 05}$ (	$1.784 911 337 740 342 7$ )	$1.925 387 034 845 000 5 ⋅ 10^{- 06}$ (	$2.772 858 659 503 397$ )	$8.982 613 775 685 127 ⋅ 10^{- 05}$ (	$1.815 571 349 622 178 4$ )	$0.002 533 417 862 807 191$ (	$1.909 704 519 612 27$ )	$9.052 034 989 304 395 ⋅ 10^{- 05}$ (	$1.807 588 346 657 694 5$ )
114688	$2.384 372 161 076 711 ⋅ 10^{- 05}$ (	$1.947 360 898 397 387 8$ )	$2.484 982 674 290 340 4 ⋅ 10^{- 07}$ (	$2.953 840 782 451 363 3$ )	$2.314 273 133 948 463 8 ⋅ 10^{- 05}$ (	$1.956 576 160 320 721 5$ )	$0.000 643 709 867 099 229 3$ (	$1.976 602 566 825 686 1$ )	$2.343 623 606 510 043 4 ⋅ 10^{- 05}$ (	$1.949 501 274 240 323 3$ )
$k = 2$
28	$0.000 501 389 016 562 933 6$ (	– )	$4.301 511 717 938 578 ⋅ 10^{- 05}$ (	– )	$0.000 576 322 010 437 499 2$ (	– )	$0.006 749 483 578 828 532$ (	– )	$0.000 488 384 051 841 904 2$ (	– )
224	$0.000 207 588 342 353 322 32$ (	$1.272 204 965 020 100 8$ )	$9.650 726 925 770 082 ⋅ 10^{- 06}$ (	$2.156 134 247 570 403 5$ )	$0.000 157 950 833 908 662 6$ (	$1.867 399 565 173 022 6$ )	$0.001 550 192 727 864 627 2$ (	$2.122 329 532 047 871 7$ )	$0.000 135 236 582 940 444 73$ (	$1.852 530 621 120 108 5$ )
1792	$5.695 178 986 801 719 5 ⋅ 10^{- 05}$ (	$1.865 912 339 340 529 7$ )	$1.511 987 759 247 539 ⋅ 10^{- 06}$ (	$2.674 191 155 366 064$ )	$3.868 754 763 647 32 ⋅ 10^{- 05}$ (	$2.029 534 368 029 918$ )	$0.000 261 902 515 269 784 25$ (	$2.565 345 769 574 497 7$ )	$3.568 187 228 628 573 ⋅ 10^{- 05}$ (	$1.922 222 245 907 205 4$ )
14336	$7.872 544 751 941 488 ⋅ 10^{- 06}$ (	$2.854 839 224 759 451 6$ )	$1.058 019 998 719 267 ⋅ 10^{- 07}$ (	$3.837 007 657 155 479 6$ )	$5.419 399 168 937 890 5 ⋅ 10^{- 06}$ (	$2.835 664 462 060 319 7$ )	$3.530 495 399 346 918 ⋅ 10^{- 05}$ (	$2.891 087 373 290 317 5$ )	$5.243 883 677 105 361 ⋅ 10^{- 06}$ (	$2.766 483 729 494 196$ )
114688	$1.037 903 593 272 394 ⋅ 10^{- 06}$ (	$2.923 157 609 740 990 4$ )	$6.995 959 166 944 021 ⋅ 10^{- 09}$ (	$3.918 701 218 581 107 4$ )	$7.146 729 465 521 086 ⋅ 10^{- 07}$ (	$2.922 777 831 362 845 6$ )	$4.496 196 263 433 237 ⋅ 10^{- 06}$ (	$2.973 093 719 646 426$ )	$7.018 157 184 226 596 ⋅ 10^{- 07}$ (	$2.901 471 518 250 042$ )
$k = 3$
28	$0.000 175 916 286 437 657 45$ (	– )	$1.275 662 376 869 369 6 ⋅ 10^{- 05}$ (	– )	$0.000 166 840 404 921 817 86$ (	– )	$0.002 367 494 226 736 861 4$ (	– )	$0.000 126 738 793 025 565 37$ (	– )
224	$5.753 000 188 877 87 ⋅ 10^{- 05}$ (	$1.612 502 631 884 296 8$ )	$2.418 206 577 408 277 ⋅ 10^{- 06}$ (	$2.399 237 150 351 099 4$ )	$4.426 498 914 232 318 6 ⋅ 10^{- 05}$ (	$1.914 230 746 038 455 6$ )	$0.000 251 405 425 694 573 35$ (	$3.235 273 216 944 435$ )	$2.983 442 817 873 932 7 ⋅ 10^{- 05}$ (	$2.086 808 150 588 493$ )
1792	$6.806 454 821 350 378 ⋅ 10^{- 06}$ (	$3.079 339 054 092 517$ )	$1.682 402 106 077 241 1 ⋅ 10^{- 07}$ (	$3.845 343 027 033 511 6$ )	$4.953 349 067 192 706 ⋅ 10^{- 06}$ (	$3.159 689 869 863 922 4$ )	$2.980 735 803 728 853 5 ⋅ 10^{- 05}$ (	$3.076 275 372 113 862 7$ )	$3.622 045 410 075 866 ⋅ 10^{- 06}$ (	$3.042 101 586 980 125 6$ )
14336	$5.743 685 772 195 84 ⋅ 10^{- 07}$ (	$3.566 854 829 473 553$ )	$7.312 587 954 186 67 ⋅ 10^{- 09}$ (	$4.523 996 678 859 669$ )	$4.111 428 734 556 125 ⋅ 10^{- 07}$ (	$3.590 692 565 376 722$ )	$2.051 198 458 900 885 7 ⋅ 10^{- 06}$ (	$3.861 129 520 062 394 5$ )	$3.015 281 520 871 401 5 ⋅ 10^{- 07}$ (	$3.586 440 023 341 312$ )
114688	$3.981 013 944 264 474 6 ⋅ 10^{- 08}$ (	$3.850 768 993 929 522$ )	$2.464 760 636 887 164 4 ⋅ 10^{- 10}$ (	$4.890 862 619 675 43$ )	$2.756 289 648 924 011 2 ⋅ 10^{- 08}$ (	$3.898 840 413 207 374$ )	$1.310 788 376 662 412 ⋅ 10^{- 07}$ (	$3.967 960 392 998 584 5$ )	$2.034 661 277 284 434 ⋅ 10^{- 08}$ (	$3.889 432 160 032 821$ )

Equations304

⎩ ⎨ ⎧ - div (2 \tilde{ν} ε (u)) + \nabla p div (u) u = f = 0 = 0 in Ω, in Ω, on Γ,

⎩ ⎨ ⎧ - div (2 \tilde{ν} ε (u)) + \nabla p div (u) u = f = 0 = 0 in Ω, in Ω, on Γ,

\frac{1}{ν} dev (σ) - ε (u)

\frac{1}{ν} dev (σ) - ε (u)

div (σ) - \nabla p

div (u)

u

curl (ϕ)

curl (ϕ)

curl (ϕ)

curl (ϕ)

curl (ϕ)

curl (ϕ)

[H_{0} (div, Ω)]^{*} = H^{- 1} (curl, Ω) = {ϕ \in H^{- 1} (Ω, R^{d}) : curl (ϕ) \in H^{- 1} (Ω, R^{\tilde{d}})} .

[H_{0} (div, Ω)]^{*} = H^{- 1} (curl, Ω) = {ϕ \in H^{- 1} (Ω, R^{d}) : curl (ϕ) \in H^{- 1} (Ω, R^{\tilde{d}})} .

H (curl div, Ω)

H (curl div, Ω)

V

V

dev (ν^{- 1} σ) = dev (ε (u)) = ε (u) - \frac{ν}{d} tr (ε (u)) Id = ε (u) - \frac{1}{d} div (u) Id = ε (u) .

dev (ν^{- 1} σ) = dev (ε (u)) = ε (u) - \frac{ν}{d} tr (ε (u)) Id = ε (u) - \frac{1}{d} div (u) Id = ε (u) .

Σ^{sym} := {τ \in H (curl div, Ω) : tr (τ) = 0, τ = τ^{T}} .

Σ^{sym} := {τ \in H (curl div, Ω) : tr (τ) = 0, τ = τ^{T}} .

\int_{Ω} ε (u) : τ d x

\int_{Ω} ε (u) : τ d x

= \frac{1}{2} \int_{Ω} \nabla u : τ d x + \frac{1}{2} \int_{Ω} \nabla u : τ d x = \int_{Ω} \nabla u : τ d x .

(ν^{- 1} dev (σ), dev (τ)) + ⟨ div (τ), u ⟩_{H_{0} (div, Ω)}

(ν^{- 1} dev (σ), dev (τ)) + ⟨ div (τ), u ⟩_{H_{0} (div, Ω)}

⎩ ⎨ ⎧ (ν^{- 1} dev (σ), dev (τ)) + ⟨ div (τ), u ⟩_{H_{0} (div, Ω)} ⟨ div (σ), v ⟩_{H_{0} (div, Ω)} + (div (v), p) (div (u), q) = 0 = - (f, v) = 0 for all τ \in Σ^{sym}, for all v \in V, for all p \in Q .

⎩ ⎨ ⎧ (ν^{- 1} dev (σ), dev (τ)) + ⟨ div (τ), u ⟩_{H_{0} (div, Ω)} ⟨ div (σ), v ⟩_{H_{0} (div, Ω)} + (div (v), p) (div (u), q) = 0 = - (f, v) = 0 for all τ \in Σ^{sym}, for all v \in V, for all p \in Q .

κ (v) = \frac{1}{2} (0 v - v 0) if d = 2, κ (v) = \frac{1}{2} 0 v_{3} - v_{2} - v_{3} 0 v_{1} v_{2} - v_{1} 0 if d = 3.

κ (v) = \frac{1}{2} (0 v - v 0) if d = 2, κ (v) = \frac{1}{2} 0 v_{3} - v_{2} - v_{3} 0 v_{1} v_{2} - v_{1} 0 if d = 3.

\frac{1}{ν} dev (σ) - \nabla u + ω

\frac{1}{ν} dev (σ) - \nabla u + ω

div (σ) - \nabla p

σ - σ^{T}

div (u)

u

H^{m} (T_{h}) := T \in T_{h} \prod H^{m} (T), P^{k} (T_{h}) := T \in T_{h} \prod P^{k} (T) .

H^{m} (T_{h}) := T \in T_{h} \prod H^{m} (T), P^{k} (T_{h}) := T \in T_{h} \prod P^{k} (T) .

V_{h} := V \cap R T^{k}, Q_{h} := Q \cap P^{k} (T_{h}), W_{h} := P^{k} (T_{h}, K),

V_{h} := V \cap R T^{k}, Q_{h} := Q \cap P^{k} (T_{h}), W_{h} := P^{k} (T_{h}, K),

Q (q_{h}) = \overset{q}{^}_{h} \circ ϕ^{- 1}, P (\overset{v}{^}_{h}) := det (F)^{- 1} F (\overset{v}{^}_{h} \circ ϕ^{- 1}), W (\overset{η}{^}_{h}) := F^{- T} (\overset{η}{^}_{h} \circ ϕ^{- 1}) F^{- 1},

Q (q_{h}) = \overset{q}{^}_{h} \circ ϕ^{- 1}, P (\overset{v}{^}_{h}) := det (F)^{- 1} F (\overset{v}{^}_{h} \circ ϕ^{- 1}), W (\overset{η}{^}_{h}) := F^{- T} (\overset{η}{^}_{h} \circ ϕ^{- 1}) F^{- 1},

⟨ div (σ), v ⟩_{H_{0} (div, Ω)} = T \in T_{h} \sum [(div (σ), v)_{T} - ⟨ v_{n}, σ_{nn} ⟩_{H^{1/2} (\partial T)}]

⟨ div (σ), v ⟩_{H_{0} (div, Ω)} = T \in T_{h} \sum [(div (σ), v)_{T} - ⟨ v_{n}, σ_{nn} ⟩_{H^{1/2} (\partial T)}]

Σ_{h}

Σ_{h}

B

B

B

δ Σ_{h} := {dev (curl (curl (r_{h}) B)) : r_{h} \in P_{⊥}^{k} (T_{h}, K)},

δ Σ_{h} := {dev (curl (curl (r_{h}) B)) : r_{h} \in P_{⊥}^{k} (T_{h}, K)},

Σ_{h}^{+} := Σ_{h} \oplus δ Σ_{h}, k \geq 1.

Σ_{h}^{+} := Σ_{h} \oplus δ Σ_{h}, k \geq 1.

M (\overset{σ}{^}_{h}) := \frac{1}{det ( F )} F^{- T} (\overset{σ}{^}_{h} \circ ϕ^{- 1}) F^{T} .

M (\overset{σ}{^}_{h}) := \frac{1}{det ( F )} F^{- T} (\overset{σ}{^}_{h} \circ ϕ^{- 1}) F^{T} .

B := F^{- T} (\hat{B} \circ ϕ^{- 1}) F^{- 1} .

B := F^{- T} (\hat{B} \circ ϕ^{- 1}) F^{- 1} .

a : L^{2} (Ω, M) \times L^{2} (Ω, M) \to R,

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

A mass conserving mixed stress formulation for Stokes flow with weakly imposed stress symmetry

Jay Gopalakrishnan

Portland State University, PO Box 751, Portland OR 97207,USA

[email protected]

,

Philip L. Lederer

Institute for Analysis and Scientific Computing, TU Wien, Wiedner Hauptstraße 8-10, 1040 Wien, Austria

[email protected]

and

Joachim Schöberl

Institute for Analysis and Scientific Computing, TU Wien, Wiedner Hauptstraße 8-10, 1040 Wien, Austria

[email protected]

Abstract.

We introduce a new discretization of a mixed formulation of the incompressible Stokes equations that includes symmetric viscous stresses. The method is built upon a mass conserving mixed formulation that we recently studied. The improvement in this work is a new method that directly approximates the viscous fluid stress $\sigma$ , enforcing its symmetry weakly. The finite element space in which the stress is approximated consists of matrix-valued functions having continuous “normal-tangential” components across element interfaces. Stability is achieved by adding certain matrix bubbles that were introduced earlier in the literature on finite elements for linear elasticity. Like the earlier work, the new method here approximates the fluid velocity $u$ using $H(\operatorname{div})$ -conforming finite elements, thus providing exact mass conservation. Our error analysis shows optimal convergence rates for the pressure and the stress variables. An additional post processing yields an optimally convergent velocity satisfying exact mass conservation. The method is also pressure robust.

Key words and phrases:

mixed finite element methods; incompressible flows; Stokes equations; weak symmetry

Philip L. Lederer has been funded by the Austrian Science Fund (FWF) through the research program “Taming complexity in partial differential systems” (F65) - project “Automated discretization in multiphysics” (P10).

1. Introduction

In this work we introduce a new method for the discretization of steady incompressible Stokes system that includes symmetric viscous stresses. Let $\Omega\subset\mathbb{R}^{d}$ be a bounded domain with $d=2$ or $3$ having a Lipschitz boundary $\Gamma:=\partial\Omega$ . Let $u$ and $p$ be the velocity and the pressure, respectively. Given an external body force $f:\Omega\to\mathbb{R}^{d}$ and kinematic viscosity $\tilde{\nu}:\Omega\to\mathbb{R}$ , the velocity-pressure formulation of the Stokes system is given by

[TABLE]

where $\varepsilon({{u}})=(\nabla u+(\nabla u)^{\textrm{T}})/2$ . By introducing a new variable $\sigma=\nu\varepsilon(u)$ where $\nu:=2\tilde{\nu}$ , equation (1) can be reformulated to

[TABLE]

We shall call formulation (2) the mass conserving mixed formulation with symmetric stresses, or simply the MCS formulation. Although formulations (1) and (2) are formally equivalent, the MCS formulation (2) demands less regularity of the velocity field $u$ . Many authors have studied this formulation previously [15, 14, 13, 12], including us [18]. In [18], following the others, we introduced a new variable $\sigma=\nu\nabla u$ , which is in general nonsymmetric, and considered an analogous formulation (which was also called an MCS formulation). The main novelty in [18] was that $\sigma=\nu\nabla u$ was set in a new function space $H(\operatorname{curl}\operatorname{div},\Omega)$ of matrix-valued functions whose divergence can continuously act on elements of $H_{0}(\operatorname{div},\Omega)$ . Accordingly, the appropriate velocity space there was $H_{0}(\operatorname{div},\Omega),$ not $H_{0}^{1}(\Omega,\mathbb{R}^{2})$ as in the classical velocity-pressure formulation.

In contrast to [18], in this work we set $\sigma=\nu\varepsilon(u)$ , not $\nu\nabla u$ . Our goal is to apply what we learnt in [18] to produce a new method that provides a direct approximation to the symmetric matrix function $\sigma=\nu\varepsilon(u)$ . Being the viscous stress, this $\sigma$ is of more direct practical importance (than $\nu\nabla u$ ). We shall seek $\sigma$ in the same function space $H(\operatorname{curl}\operatorname{div},\Omega)$ that we considered in [18]. We have shown in [18] that matrix-valued finite element functions with “normal-tangential” continuity across element interfaces are natural for approximationg solutions in $H(\operatorname{curl}\operatorname{div},\Omega).$ We shall continue to use such finite elements here. It is interesting to note that in the HDG (hybrid discontinuous Galerkin) literature [11, 16] the potential importance of such normal-tangential continuity was noted and arrived at through a completely different approach.

The main point of departure in this work, stemming from that fact that $H(\operatorname{curl}\operatorname{div},\Omega)$ contains non-symmetric matrix-valued functions, is that we impose the symmetry of stress approximations weakly using Lagrange multipliers. This technique of imposing symmetry weakly is widely used in finite elements for linear elasticity [1, 2, 3, 14]. In particular, our analysis is inspired by the early work of Stenberg [30], who enriched the stress space by curls of local element bubbles. (In fact, this idea was even used in a Stokes mixed method [15], but their resulting method is not pressure robust.) These enrichment curls lie in the kernel of the divergence operator and are only “seen” by the weak-symmetry constraint allowing them to be used to prove discrete inf-sup stability. While in two dimensions – assuming a triangulation into simplices – this technique only increases the local polynomial order by $1$ , this is not the case in three dimensions. Years later [8, 17], it was realized that it is possible to retain the good convergence properties of Stenberg’s construction and yet reduce the enrichment space. Introducing a “matrix bubble,” these works added just enough extra curls needed to prove stability.

We shall see in later sections that the matrix bubble can also be used to enrich our discrete fluid stress space. This might seem astonishing at first. Indeed, an enrichment space for fluid stresses must map well when using a specific map that is natural to ensure normal-tangential continuity of the discrete stress space. Moreover, the enrichment functions must lie in the kernel of a realization of the distributional row-wise divergence used in MCS formulations (displayed in (11) below). It turns out that these properties are all fulfilled by an enrichment using a double curl involving matrix bubbles. Hence we are able to prove the discrete inf-sup condition. Stability then follows in the same type of norms used in [30] and is a key result of this work.

Some comments on the choice of the discrete velocity space and its implications are also in order here. As mentioned above, the velocity space within the MCS formulation is $V=H_{0}(\operatorname{div},\Omega)$ . One of the main features of the first MCS method [18], as well the new version with weakly imposed symmetry of this paper, is that we can choose a discrete velocity space $V_{h}\subset V$ using $H(\operatorname{div})$ -conforming finite elements. Therefore, our method is tailored to approximate the incompressibility constraint exactly, leading to pointwise and exactly divergence-free discrete velocity fields. The use of such $H(\operatorname{div})$ -conforming velocities in Stokes flow is by no means new: for the standard velocity-pressure formulation, once can find it in [9, 10], and for the Brinkman Problem in [20]. Therein, and also in the more recent works of [25, 24], the $H^{1}$ -conformity is treated in a weak sense and a (hybrid) discontinuous Galerkin method is constructed. When employing $H(\operatorname{div})$ -conforming finite elements, one has the luxury of choice. In [18], we used the ${\mathcal{BDM}}^{k+1}$ space [6] and added several local stress bubbles in order to guarantee stability. In contrast, in this paper, we have chosen to take the smaller Raviart-Thomas space [26] of order $k$ , denoted by ${\mathcal{RT}}^{k}$ . A similar choice was made also in the work of [16], where they presented a hybrid method for solving the Brinkman problem based off the work of [11]. Our current choice of the smaller space ${\mathcal{RT}}^{k}$ leads to a less accurate velocity approximation (compared to ${\mathcal{BDM}}^{k+1}$ ), so in order to retain the optimal convergence order of the velocity (measured in a discrete $H^{1}$ -norm), we introduce a local element-wise post processing. Using the reconstruction operator of [21, 22] this post processing can be done retaining the exact divergence-free property.

The remainder of this paper is organized as follows. In Section 2, we define notation for common spaces used throughout this work and introduce an undiscretized formulation. Section 3 presents the MCS method for Stokes flow including symmetric viscous stresses. In Section 4, we present the new discrete method including the introduction of the matrix bubble. Section 5 proves a discrete inf-sup condition and develops a complete a priori error analysis of the discrete MCS system. In Section 6, we introduce a postprocessing for the discrete velocity. The concluding section (Section 7) reports various numerical experiments we performed to illustrate the theory.

2. Preliminaries

In this section, we introduce notation and present a weak formulation for Stokes flow that includes symmetric viscous stresses.

Let $\mathcal{D}(\Omega)$ or $\mathcal{D}(\Omega,\mathbb{R})$ denote the set of infinitely differentiable compactly supported real-valued functions on $\Omega$ and let $\mathcal{D}^{*}(\Omega)$ denote the space of distributions. To differentiate between scalar, vector and matrix-valued functions on $\Omega$ , we include the co-domain in the notation, e.g., $\mathcal{D}(\Omega,\mathbb{R}^{d})=\{u:\Omega\to\mathbb{R}^{d}|\;u_{i}\in\mathcal{D}(\Omega)\}$ . Let $\mathbb{M}$ denote the vector space of real $d\times d$ matrices. This notation scheme is similarly extended to other function spaces as needed. Thus, $L^{2}(\Omega)=L^{2}(\Omega,\mathbb{R})$ denotes the space of square integrable $\mathbb{R}$ -valued functions on $\Omega$ , while analogous vector and matrix-valued function spaces are defined by $L^{2}(\Omega,\mathbb{R}^{d}):=\left\{u:\Omega\to\mathbb{R}^{d}\big{|}\;u_{i}\in L^{2}(\Omega)\right\}$ and $L^{2}(\Omega,{\mathbb{M}}):=\left\{\sigma:\Omega\to{\mathbb{M}}\big{|}\;\sigma_{ij}\in L^{2}(\Omega)\right\}$ , respectively. Let $\mathbb{K}$ denote the vector space of $d\times d$ skew symmetric matrices, i.e., $\mathbb{K}=\mathop{\text{missing}}{skw}(\mathbb{M})$ , and let $L^{2}(\Omega,\mathbb{K}):=\left\{\sigma:\Omega\to\mathbb{K}\big{|}\;\sigma_{ij}\in L^{2}(\Omega)\right\}$ .

Recall that the dimension $d$ in this work is either 2 or 3. Accordingly, depending on the context, certain differential operators have different meanings. The “curl” operator, depending on the context, denotes one of the differential operators below.

[TABLE]

where $(\cdot)^{\textrm{T}}$ denotes the transpose and $\partial_{i}$ abbreviates $\partial/\partial x_{i}.$ For matrix-valued functions in both $d=2$ and $3$ cases, i.e., $\phi\in\mathcal{D}^{*}(\Omega,\mathbb{M}),$ by $\operatorname{curl}(\phi)$ we mean the matrix obtained by taking $\operatorname{curl}$ row wise. Unfortunately, this still does not exhaust all the curl cases. In the $d=2$ case, there are two possible definitions of $\operatorname{curl}(\phi)$ for $\phi\in\mathcal{D}^{*}(\Omega,\mathbb{R}^{2})$ ,

[TABLE]

and we shall have occasion to use both. The latter will not be used until (14) below, so until then, the reader may continue assuming we mean (3) whenever we consider curl of vector functions in $\mathbb{R}^{2}$ . The operator $\nabla$ is to be understood from context as an operator that results in either a vector whose components are $[\nabla\phi]_{i}=\partial_{i}\phi$ for $\phi\in\mathcal{D}^{*}(\Omega,\mathbb{R}),$ or a matrix whose entries are $[\nabla\phi]_{ij}=\partial_{j}\phi_{i}$ for $\phi\in\mathcal{D}^{*}(\Omega,\mathbb{R}^{d}),$ or a third-order tensor whose entries are $[\nabla\phi]_{ijk}=\partial_{k}\phi_{ij}$ for $\phi\in\mathcal{D}^{*}(\Omega,\mathbb{K}).$ Finally, in a similar manner, we understand $\operatorname{div}(\phi)$ as either $\sum_{i=1}^{d}\partial_{i}\phi_{i}$ for vector-valued $\phi\in\mathcal{D}^{*}(\Omega,\mathbb{R}^{d}),$ or the row-wise divergence $\sum_{j=1}^{d}\partial_{j}\phi_{ij}$ for matrix-valued $\phi\in\mathcal{D}(\Omega,\mathbb{M})^{*}$ .

Let ${\tilde{d}}=d(d-1)/2$ (so that ${\tilde{d}}=1$ and $3$ for $d=2$ and 3, respectively). In addition to the standard Sobolev space $H^{m}(\Omega)$ for any $m\in\mathbb{R},$ we shall use the well-known space ${H}(\operatorname{div},\Omega)=\{{{u}}\in L^{2}(\Omega,\mathbb{R}^{d}):\operatorname{div}({{u}})\in L^{2}(\Omega)\}.$ By its trace theorem, $H_{0}(\operatorname{div},\Omega)=\{u\in H(\operatorname{div},\Omega):u\cdot n|_{\Gamma}=0\}$ is a well-defined closed subspace, where $n$ denotes the outward unit normal on $\Gamma$ . Its dual space $[H_{0}(\operatorname{div},\Omega)]^{*}$ , as proved in [18, Theorem 2.1], satisfies

[TABLE]

In this work, the following space is important:

[TABLE]

where the name results from (5): indeed a function $\sigma\in H(\operatorname{curl}\operatorname{div},\Omega)$ fulfills $\operatorname{curl}\operatorname{div}(\sigma)\in H^{-1}(\Omega,\mathbb{R}^{{\tilde{d}}})$ .

Next, let us derive a variational formulation of the system (2), which is based on the mixed stress formulation (MCS) introduced in chapter 3 in the work [18]. The method is based on a weaker regularity assumption of the velocity as compared to the standard velocity-pressure formulation (1). The velocity $u$ and the pressure $p$ now belong, respectively, to the spaces

[TABLE]

Multiplying (2c) with a pressure test function $q\in Q$ and integrating over the domain $\Omega$ ends up in the familiar equation $(\operatorname{div}(u),q)_{L^{2}(\Omega)}=0,$ which we write as the last equation of the final Stokes system (7) written below. Here and throughout, the inner product of a space $X$ is denoted by $(\cdot,\cdot)_{X}$ . When $X$ is the space of functions whose components are square integrable functions on $\Omega$ , we abbreviate $(\cdot,\cdot)_{X}$ to simply $(\cdot,\cdot)$ , as done in (7) below. Similarly, while we generally denote the norm and seminorm on a Sobolev space $X$ by $\|\cdot\|_{X}$ and $|\cdot|_{X}$ , respectively, to simplify notation, we set $\|f\|^{2}_{D}:=(f,f)_{D}$ , where $(f,g)_{D}$ denotes $L^{2}(D,\mathbb{V})$ inner product for any $\mathbb{V}\in\{\mathbb{R},\mathbb{R}^{d},\mathbb{K},\mathbb{M}\}$ and any subset $D\subseteq\Omega$ . Moreover, when $D=\Omega$ , we omit the subscript and simply write $\|f\|$ for $\|f\|$ .

To motivate the remaining equations of (7), let the deviatoric part of a matrix $\sigma$ be defined by $\operatorname{dev}{\!(\sigma)}:=\sigma-d^{-1}\textrm{tr}({\sigma})\operatorname{Id},$ where $\operatorname{Id}$ denotes the identity matrix and $\textrm{tr}({\sigma}):=\sum_{i=1}^{d}\sigma_{ii}$ denotes the matrix trace. Since $\nu^{-1}\sigma=\varepsilon(u)$ , due to the incompressibility constraint $\operatorname{div}(u)=0,$ we have the identity

[TABLE]

Since $\textrm{tr}({\sigma})=0$ and $\sigma=\sigma^{\textrm{T}}$ , we define the stress space as the following closed subspace of $H(\operatorname{curl}\operatorname{div},\Omega)$ :

[TABLE]

Testing equations (2a) with a test functions $\tau\in\Sigma^{\operatorname{sym}}$ and integrating over the domain, we have for the term including $\varepsilon(u)$ the identity

[TABLE]

Using the knowledge that the velocity $u$ should be in $H_{0}^{1}(\Omega),$ we obtain

[TABLE]

which is the first equation in the system (7) below. Here and throughout, when working with elements $f$ of the dual space $X^{*}$ of a topological space $X$ , we denote the action of $f$ on an element $x\in X$ by $\langle{f,x}\rangle_{X}$ , where we may omit the subscript $X$ when its obvious from context. Finally we also test (2b) with $v\in V$ and integrate the pressure term by parts. This results in the remaining equation of (7).

Summarizing, the weak problem is to find $(\sigma,u,p)\in\Sigma^{\operatorname{sym}}\times V\times Q$ such that

[TABLE]

In the ensuing section, we shall focus on a discrete analysis of a nonconforming scheme based on (7). Although wellposedness of (7) is an interesting question, we shall not comment further on it here since it is of no direct use in a nonconforming analysis.

3. The new method

In [18], we introduced an MCS method where $\sigma$ was an approximation to (the generally non-symmetric) $\nu\nabla u$ instead of (the symmetric) $\nu\varepsilon(u)$ considered above. Since there was no symmetry requirement in [18], there we worked with the space $\Sigma:=\{\tau\in H(\operatorname{curl}\operatorname{div},\Omega):\textrm{tr}({\tau})=0\}$ instead of $\Sigma^{\operatorname{sym}}$ . The finite element space for $\Sigma$ designed there can be reutilized in the current symmetric case (with some modifications), once we reformulate the symmetry requirement as a constraint in a weak form.

To do so, we need further notation. Let $\kappa:\mathbb{R}^{{\tilde{d}}}\to\mathbb{K}$ be defined by

[TABLE]

When $u$ represents the Stokes velocity, $\omega=\kappa(\operatorname{curl}(u))$ represents the vorticity. Since $\nabla u=\varepsilon(u)+\omega$ , introducing $\omega$ as a new variable, and the symmetry condition $\sigma-\sigma^{\textrm{T}}=0$ as a new constraint, we obtain the boundary value problem

[TABLE]

In the remainder of this section, we introduce a discrete formulation approximating (9).

The method will be described on a subdivision (triangulation) $\mathcal{T}_{h}$ of $\Omega$ consisting of triangles in two dimensions and tetrahedra in three dimensions. For the analysis later, we shall assume that the $\mathcal{T}_{h}$ is quasiuniform. By $h$ we denote the maximum of the diameters of all elements $T\in\mathcal{T}_{h}$ . Quasiuniformity implies that $h\sim\operatorname{diam}(T)$ for all mesh elements $T$ . Here and throughout, by $A\sim B$ we indicate that there exist two constants $c,C>0$ independent of the mesh size $h$ as well as the viscosity $\nu$ such $cA\leq B\leq cA$ . Similarly, we use the notation $A\lesssim B$ if there exists a constant $C\neq C(h,\nu)$ such that $A\leq CB$ . All element interfaces and element boundaries on $\Gamma$ are called facets and are collected into a set $\mathcal{F}_{h}$ . This set is partitioned into facets on the boundary $\mathcal{F}_{h}^{\text{ext}}$ and interior facets $\mathcal{F}_{h}^{\text{int}}$ . On each facet we denote by ${[\![\cdot]\!]}$ the standard jump operator. On a boundary facet the jump operator is just the identity. On all facets we denote by $n$ a unit normal vector. When integrating over boundaries of $d$ -dimensional domains, the orientation of $n$ is assumed to be outward. On a facet with normal $n$ adjacent to an mesh element $T$ , the normal and tangential traces of a smooth function $\phi:T\rightarrow\mathbb{R}^{d}$ are defined by $\phi_{n}:=\phi\cdot n$ and $\phi_{t}=\phi-\phi_{n}n,$ respectively. Similarly, for a smooth $\psi:T\rightarrow{\mathbb{M}}$ , the (scalar-valued) “normal-normal” and the (vector-valued) “normal-tangential” components are defined by $\psi_{nn}=\psi:(n\otimes n)=n^{{\textrm{T}}}\psi n$ and $\psi_{nt}=\psi n-\psi_{nn}n,$ respectively.

For any integers $m,k\geq 0$ , the following “broken spaces” are viewed as consisting of functions on $\Omega$ without any continuity constraints across element interfaces:

[TABLE]

For $D\subset\Omega$ we use the notation $(\cdot,\cdot)_{D}$ for the inner product of $L^{2}(D)$ or its vector and tensor analogues such as $L^{2}(D,\mathbb{R}^{d}),L^{2}(D,\mathbb{M}),L^{2}(D,\mathbb{K}).$ Also let $\|\cdot\|^{2}_{D}=(\cdot,\cdot)_{D}$ . Next for each element $T\in\mathcal{T}_{h}$ let ${\mathbb{P}}^{k}(T)\equiv{\mathbb{P}}^{k}(T,\mathbb{R})$ denote the set of polynomials of degree at most $k$ on $T$ . The vector and tensor analogues such as ${\mathbb{P}}^{k}(T,\mathbb{R}^{d}),{\mathbb{P}}^{k}(T,{\mathbb{M}}),{\mathbb{P}}^{k}(T,{\mathbb{K}})$ have their components in ${\mathbb{P}}^{k}(T)$ . The broken spaces ${\mathbb{P}}^{k}(\mathcal{T}_{h},\mathbb{R}^{d}),{\mathbb{P}}^{k}(\mathcal{T}_{h},{\mathbb{M}}),$ and ${\mathbb{P}}^{k}(\mathcal{T}_{h},{\mathbb{K}})$ are defined similarly. We shall also use the conforming Raviart-Thomas space (see [4, 27]), ${\mathcal{RT}}^{k}:=\{u_{h}\in H(\operatorname{div},\Omega):u_{h}|_{T}\in{\mathbb{P}}^{k}(T,\mathbb{R}^{d})+x{{\mathbb{P}}}^{k}(T,\mathbb{R})\textrm{ for all }T\in\mathcal{T}_{h}\}.$

3.1. Velocity, pressure, and vorticity spaces

For any $k\geq 1$ , our method uses

[TABLE]

for approximating the velocity, pressure, and vorticity, respectively.

Standard finite element mappings apply for these spaces. Let $\hat{T}$ be the unit simplex (for $d=2$ and $3$ ), which we shall refer to as the reference element, and let $T\in\mathcal{T}_{h}$ . Let $\phi:\hat{T}\rightarrow T$ be an affine homeomorphism and set $F:=\phi^{\prime}$ . By quasiuniformity, $\|F\|_{\ell^{\infty}}\sim h,$ $\|F^{-1}\|_{\ell^{\infty}}\sim h^{-1},$ and $|\det{(F)}|\sim h^{d},$ estimates that we shall use tacitly in our scaling arguments later. Such arguments proceed by mapping functions on $\hat{T}$ to and from $\hat{T}$ . Given a scalar-valued $\hat{q}_{h}$ , a vector-valued $\hat{v_{h}}$ , and a skew-symmetric matrix-valued $\hat{\eta}_{h}$ on the reference element $\hat{T}$ , we map them to $T$ using

[TABLE]

respectively, i.e., these are our mappings for functions in the pressure, velocity, and vorticity spaces, respectively. The first is the inverse of the standard pullback, the second is the standard Piola map, and the third is designed to preserve skew symmetry.

3.2. Stress space

The definition of our stress space is motivated by the following result, proved in [18, Section 4].

Theorem 1.

Suppose ${{\sigma}}$ is in $H^{1}(\mathcal{T}_{h},{\mathbb{M}})$ and ${{\sigma}}_{{{n}}{{n}}}|_{\partial T}\in H^{1/2}(\partial T)$ for all elements $T\in\mathcal{T}_{h}$ . Assume that the normal-tangential trace ${{\sigma}}_{nt}$ is continuous across element interfaces. Then $\sigma$ is in $H(\operatorname{curl}\operatorname{div},\Omega)$ and moreover

[TABLE]

for all $v\in H_{0}(\operatorname{div},\Omega).$

Clearly, matrix finite element subspaces having normal-tangential continuity are suggested by Theorem 1. Technically, the theorem’s sufficient conditions for full conformity also include the condition ${{\sigma}}_{{{n}}{{n}}}|_{\partial T}\in H^{1/2}(\partial T).$ This condition is very restrictive as it would enforce continuity at vertices and edges in two and three dimensions respectively. If this constraint is relaxed, much simpler, albeit nonconforming, elements can be constructed. This was the approach we adopted in [18]. We continue in the same vein here and define the nonconforming stress space

[TABLE]

As mentioned in the introduction, we must enrich the above stress space ${{{\Sigma}}_{h}}$ to guarantee solvability of the resulting discrete system due to the additional weak symmetry constraints. We follow the approach of [30] and its later improvements [8, 17] to construct the needed enrichment space.

Define a cubic matrix-valued “bubble” function as follows. On a $d$ -simplex $T$ with vertices $a_{0},\ldots,a_{d}$ , let $F_{i}$ denote the face opposite to $a_{i}$ , and let $\lambda_{i}$ denote the unique linear function that vanishes on $F_{i}$ and equals one on $a_{i}$ , i.e., the $i$ th barycentric coordinate of $T$ . Following [8, 17], we define $B\in{\mathbb{P}}^{3}(T,\mathbb{M})$ by

[TABLE]

where the indices on the barycentric coordinates are calculated mod $4$ in (13a). Let ${\mathbb{P}}_{\perp}^{k}(T,\mathbb{V})$ denote the $L^{2}$ -orthogonal complement of ${\mathbb{P}}^{k-1}(T,\mathbb{V})$ in ${\mathbb{P}}^{k}(T,\mathbb{V})$ for $\mathbb{V}\in\{\mathbb{R},\mathbb{K}\}$ , and let ${\mathbb{P}}_{\perp}^{k}(\mathcal{T}_{h},\mathbb{V})=\prod_{T\in\mathcal{T}_{h}}{\mathbb{P}}_{\perp}^{k}(T,\mathbb{V}).$ For any $k\geq 1$ , define

[TABLE]

for $d=2$ and $3$ , with the understanding that in $d=2$ case, the outer curl is defined by (4), not (3). The total stress space is given by

[TABLE]

That functions in this space have normal-tangential continuity is a consequence of the following property proved in [8, Lemma 2.3].

Lemma 2.

Let $q\in\mathbb{M}$ and $T\in\mathcal{T}_{h}$ . The products $qB$ and $Bq$ have vanishing tangential trace on $\partial T$ , so the function $\operatorname{curl}(qB)$ has vanishing normal trace on $\partial T$ .

Lemma 3.

Any $\sigma\in\delta\Sigma_{h}$ has vanishing $\sigma_{nt}$ and ${[\![\sigma_{nt}]\!]}$ on all facets $F\in\mathcal{F}_{h}.$

Proof.

Since $(\operatorname{dev}{\!(\sigma)})_{nt}=\sigma_{nt}$ , this is a direct consequence of Lemma 2. ∎

We also need a proper mapping for functions in $\Sigma_{h}^{+}$ that preserves normal-tangential continuity. We shall continue to use the following map, first introduced in [18]:

[TABLE]

As shown in [18, Lemma 5.3], on each facet, $(\mathcal{M}(\hat{\sigma}_{h}))_{nt}$ is a scalar multiple of $(\hat{\sigma}_{h})_{nt}$ and $\textrm{tr}({\hat{\sigma}_{h}})=0$ if and only if $\textrm{tr}({\mathcal{M}(\hat{\sigma}_{h})})=0.$ Degrees of freedom are discussed in §3.4.

*Remark 4**.*

Note that in (13), $B$ was given using barycentric coordinates as an expression that holds on any simplex. Let $\hat{B}$ denote the function on the reference element $\hat{T}$ obtained by replacing $\lambda_{i}$ by reference element barycentric coordinates $\hat{\lambda}_{i}$ . Considering the obvious map that transforms $\hat{\nabla}\hat{\lambda}_{i}\otimes\hat{\nabla}\hat{\lambda}_{i}$ to $\nabla\lambda_{i}\otimes\nabla\lambda_{i}$ , we find that the matrix bubble $B$ on any simplex is given by

[TABLE]

3.3. Equations of the method

For the derivation of the discrete variational formulation we turn our attention back to the weak formulation (7) and identify these forms:

[TABLE]

The definition of the remaining bilinear form is motivated by the definition of the “distributional divergence” given by (11). To this end we define $b_{2}:\{\tau\in H^{1}(\mathcal{T}_{h},\mathbb{M}):{[\![\tau_{nt}]\!]}=0\}\times\left(\{v\in H^{1}(\mathcal{T}_{h},\mathbb{R}^{d}):{[\![v_{n}]\!]}=0\}\times L^{2}(\Omega,\mathbb{M})\right)\to\mathbb{R}$ by

[TABLE]

Integrating the first integral by parts, we find the equivalent representation

[TABLE]

Using these forms, we state the method. For any $k\geq 1$ , the discrete MCS method with weakly imposed symmetry finds $\sigma_{h},u_{h},\omega_{h},p_{h}\in\Sigma_{h}^{+}\times V_{h}\times W_{h}\times Q_{h}$ such that

[TABLE]

Since $V_{h}$ and $Q_{h}$ fulfills $\operatorname{div}(V_{h})=Q_{h}$ , the discrete velocity solution component $u_{h}$ satisfies $\operatorname{div}(u_{h})=0$ point wise, providing exact mass conservation.

3.4. Degrees of freedom of the new stress space

We need degrees of freedom (d.o.f.s) for the stress space that are well-suited for imposing normal-tangential continuity across element interfaces. Since the bubbles in $\delta\Sigma_{h}$ have zero normal-tangential continuity, we ignore them for this discussion and focus on d.o.f.s that control $\Sigma_{h}$ .

Consider $\Sigma_{T}=\{\tau|_{T}:\tau\in\Sigma_{h}\}$ on any mesh element $T$ . Letting $\mathbb{D}$ denote the subspace of matrices $M\in{\mathbb{M}}$ satisfying $M:\operatorname{Id}=0,$ we may identify $\Sigma_{T}$ with ${\mathbb{P}}^{k}(T,\mathbb{D})$ . Let us recall a basis for $\mathbb{D}$ that was given in [18]. Define the following two sets of constant matrix functions, for $d=2$ and $d=3$ cases, respectively, by

[TABLE]

taking the indices mod 3 and mod 4, respectively. We proved in [18, Lemma 5.1] that the sets $\{S^{i}:i=0,1,2\}$ and $\{S^{i}_{q}:i=0,1,2,3,\;q=0,1\}$ form a basis of $\mathbb{D}$ when $d=2$ and $3$ , respectively.

Our d.o.fs for $\Sigma_{T}\equiv{\mathbb{P}}^{k}(T,\mathbb{D})$ are grouped into two. The first group is associated to the set of element facets ( $d-1$ subsimplices of $T$ ), namely, for each facet $F\in\partial T$ , we define the set of d.o.f.s

[TABLE]

for each $r$ in any fixed basis for ${\mathbb{P}}^{k}(F,\mathbb{R}^{d-1})$ . The next group is the set of interior d.o.f.s, defined by

[TABLE]

for all $\varsigma$ in any basis of ${\mathbb{P}}^{k-1}(T,\mathbb{D})$ . We proceed to prove that the set of these d.o.f.s, $\Phi(T):=\Phi^{0}(\tau)\cup\{\Phi^{F}:F\subset\partial T\}$ , is unisolvent.

Theorem 5.

The set $\Phi(T)$ is a set of unisolvent d.o.f.s for $\Sigma_{T}\equiv{\mathbb{P}}^{k}(T,\mathbb{D})$ .

Proof.

Suppose $\tau\in\Sigma_{T}$ satisfies $\phi(\tau)=0$ for all d.o.f.s $\phi\in\Phi(T)$ . We need to show that $\tau=0$ . From the facet d.o.f.s we conclude that $\tau_{nt}$ vanishes on $\partial T$ . By [18, Lemma 5.2], $\tau$ may be expressed as

[TABLE]

when $d=2$ or $3$ , respectively, where $\mu_{i},\mu^{0}_{i},\mu^{1}_{i}\in{\mathbb{P}}^{k-1}(T)$ . The interior d.o.f.s imply that $\int_{T}\tau:s\mathop{~{}\mathrm{d}{{x}}}=0$ for any $s\in{\mathbb{P}}^{k-1}(\hat{T},\mathbb{D})$ . Choosing for $s$ the expression on the right hand side in (21) omitting the $\lambda_{i}$ , say for the $d=2$ case, we obtain

[TABLE]

yielding $\mu_{i}=0$ , and thus $\tau=0$ . A similar argument in $d=3$ case yields the same conclusion that $\tau=0$ .

To complete the proof, it now suffices to prove that $\dim(\Sigma_{T})$ equals the number of d.o.f.s, i.e., $\#\Phi(T)$ . Obviously, $\dim(\Sigma_{T})=\dim{\mathbb{P}}^{k}(T,\mathbb{D})=(d^{2}-1)\dim{\mathbb{P}}^{k}(T)$ . The cardinality of $\Phi(T)$ equals the sum of the number of facet d.o.f.s $(d+1)(d-1)\dim{\mathbb{P}}^{k}(T)$ and the number of interior d.o.f.s $(d^{2}-1)\dim{\mathbb{P}}^{k-1}(T)$ , which simplifies to $(d^{2}-1)\big{(}\dim{\mathbb{P}}^{k-1}(T)+\dim{\mathbb{P}}^{k}(F)\big{)}$ , equalling $\dim(\Sigma_{T})$ . ∎

Using these d.o.f.s, a canonical local interpolant $I_{T}(\tau)$ in $\Sigma_{T}$ can be defined as usual, by requiring that $\psi(\tau-I_{T}\tau)=0,$ for all $\psi\in\Phi(T).$

Lemma 6.

For any $\tau\in H^{1}(T,\mathbb{D}),$ we have $\mathcal{M}^{-1}(I_{T}\tau)=I_{\hat{T}}(\mathcal{M}^{-1}(\tau)).$

Proof.

This proceeds along the same lines as the proof of [18, Lemma 5.4]. ∎

The global interpolant $I_{\Sigma_{h}}$ is also defined as usual. On each element $T\in\mathcal{T}_{h}$ the global interpolant $(I_{\Sigma_{h}}\tau)|_{T}$ coincides with the local interpolant $I_{T}(\tau|_{T})$ .

Theorem 7.

For any $m\geq 1$ and any $\sigma\in\{\tau\in H^{m}(\mathcal{T}_{h},\mathbb{D}):{[\![\tau_{nt}]\!]}=0\}$ , the global interpolation operator $I_{\Sigma_{h}}$ satisfies

[TABLE]

for all $s\leq\min(k+1,m)$ .

Proof.

This follows from a standard Bramble-Hilbert argument using Lemma 6. ∎

4. A priori error analysis

In this section we first show the stability of the MCS method with weakly imposed symmetry by proving a discrete inf-sup condition (Theorem 21). We then prove consistency (Theorem 25), optimal error estimates (Theorem 26), and pressure robustness (Theorem 28). For simplicity, the analysis from now on assumes that $\nu$ is a constant.

4.1. Norms

In addition to the previous notation for norms (established in Section 2), hereon we also use $\|\cdot\|_{h}^{2}$ to abbreviate $\sum_{T\in\mathcal{T}_{h}}\|\cdot\|_{T}^{2}$ , a notation that also serves to indicate that certain seminorms are defined using differential operators applied element by element, not globally, e.g.,

[TABLE]

for $v\in H^{1}(\mathcal{T}_{h},\mathbb{R}^{d})$ and $\gamma\in H^{1}(\mathcal{T}_{h},\mathbb{M})$ . Recall that $U_{h}=V_{h}\times W_{h}$ . Our analysis is based on norms of the type used in [30]. Accordingly, we will need to use the following norms for $v_{h}\in V_{h}$ and $\eta_{h}\in W_{h}$ :

[TABLE]

Lemma 15 below will show that the latter is indeed a norm.

On the discrete space $U_{h}$ , we will also need another norm defined using the following projections. On any mesh element $T$ , let $\Pi^{k-1}_{T}$ denote the $L^{2}(T,\mathbb{V})$ orthogonal projection onto ${\mathbb{P}}^{k}(T,\mathbb{V})$ where $\mathbb{V}$ is determined from context to be an appropriate vector space such as $\mathbb{R}^{d},$ or $\mathbb{M}$ . When the element $T$ is clear from context, we shall drop the subscript $T$ in $\Pi^{k-1}_{T}$ and simply write $\Pi^{k-1}$ . Also, on each facet $F\in\mathcal{F}_{h},$ we introduce a projection onto the tangent plane $n_{F}^{\perp}$ : for any $v\in L^{2}(F,n_{F}^{\perp})$ , the projection $\Pi_{F}^{1}v\in{\mathbb{P}}^{1}(F,n_{F}^{\perp})$ is defined by $(\Pi_{F}^{1}v,r)_{F}=(v,r)_{F}$ for all $r\in{\mathbb{P}}^{1}(F,n_{F}^{\perp})$ . Using these, define

[TABLE]

Lemma 14 below will help us go between this norm and $\|(v_{h},\eta_{h})\|_{U_{h}}$ .

The remaining spaces ${\Sigma_{h}^{+}}$ and $Q_{h}$ are simply normed by the $L^{2}$ norm $\|\cdot\|$ . The full discrete space is normed by

[TABLE]

for any $(v_{h},\eta_{h},\tau_{h},q_{h})\in V_{h}\times W_{h}\times\Sigma_{h}^{+}\times Q_{h}$ .

4.2. Norm equivalences

Next, we use the finite element mappings introduced earlier –see (10) and (15)– to show several norm equivalences.

Lemma 8.

Let $\tau_{h}\in\Sigma_{h}^{+}$ . Then

[TABLE]

Proof.

The first two follow by a simple scaling argument. For the third, see the proof of [18, Lemma 6.1]. ∎

In the proof of the next lemma, we use the space of rigid displacements ${\mathbb{E}}={\mathbb{P}}^{0}(T,\mathbb{R}^{d})+{\mathbb{P}}^{0}(T,\mathbb{K})\,x.$ For each element $T\in\mathcal{T}_{h}$ , let $\Pi^{{{\mathbb{E}}}}:H^{1}(T)\to{\mathbb{E}}$ denote the projector defined in [5]. Then, for any $v_{h}\in V_{h},$ the projection $\Pi^{{{\mathbb{E}}}}v_{h}\in{\mathbb{E}}$ fulfills the properties (see [5, eq. (3.3), (3.11)])

[TABLE]

We shall also use a global discrete Korn inequality, implied by [5, Theorem 3.1]. Namely, there is an $h$ -independent constant $c_{K}$ such that

[TABLE]

Lemma 9.

For all $(v_{h},\eta_{h})\in U_{h}$ ,

[TABLE]

Proof.

One side of the equivalence is obvious by the continuity of the $\Pi_{F}^{1}$ . For the other direction first note that $h^{-1}\|{[\![{(v_{h})_{{t}}}]\!]}\|^{2}_{F}\leq 2h^{-1}\|\Pi_{F}^{1}{[\![(v_{h})_{{t}}]\!]}\|^{2}_{F}+2h^{-1}\|{[\![{(v_{h}-\Pi_{F}^{1}v_{h})_{{t}}}]\!]}\|^{2}_{F}.$ As $\Pi^{{{\mathbb{E}}}}v_{h}\in{\mathbb{P}}^{1}(T,\mathbb{R}^{d})$ we have again by the continuity of $\Pi_{F}^{1},$

[TABLE]

We conclude the proof using (28). ∎

The following well-known property of Raviart-Thomas spaces (see, e.g., [7, Lemma 3.1]) is needed at several points.

Lemma 10.

*Let $v\in{\mathbb{P}}^{k}(T,\mathbb{R}^{d})+x{\mathbb{P}}^{k}(T,\mathbb{R})$ and $\operatorname{div}(v)=0$ . Then $v$ is in ${\mathbb{P}}^{k}(T,\mathbb{R}^{d})$ . *

Lemma 11.

For all $T\in\mathcal{T}_{h}$ and $v_{h}\in V_{h},$

[TABLE]

Proof.

One side of the equivalence of (30) is obvious by the continuity of the $\Pi^{k-1}$ . For the other direction, we use the following equivalence on the reference element $\hat{T}$ :

[TABLE]

This follows by finite dimensionality, because by the Euler identity if any one of the above two terms is zero, then $\hat{q}=0$ (see e.g., [23]). Consequently, given any $v_{h}\in V_{h}$ , setting $\hat{v}_{h}=\mathcal{P}^{-1}(v_{h}|_{T})$ , the following problem is uniquely solvable: find $\hat{b}\in{\mathbb{P}}^{k}(\hat{T},\mathbb{R})$ such that

[TABLE]

Since $\hat{\operatorname{div}}(\hat{x}{\mathbb{P}}^{k}(\hat{T},\mathbb{R}))={\mathbb{P}}^{k}(\hat{T},\mathbb{R})$ , (34) implies that $\hat{\operatorname{div}}(\hat{x}\hat{b})=\hat{\operatorname{div}}(\hat{v}_{h})$ . Put $r=\mathcal{P}^{-1}(\hat{x}\hat{b})$ . Then, due to the properties of the Piola map $\mathcal{P}$ , $r$ is a function in ${\mathbb{P}}^{k}(T,\mathbb{R}^{d})+x{\mathbb{P}}^{k}(T,\mathbb{R})$ satisfying $\operatorname{div}(r)=\operatorname{div}(v_{h})$ in $T$ , and a scaling argument using (33) implies

[TABLE]

Let $a=v_{h}-r\in{\mathbb{P}}^{k}(T,\mathbb{R}^{d})+x{\mathbb{P}}^{k}(T,\mathbb{R})$ . Then $\operatorname{div}(a)=0$ and $v_{h}=a+r$ in $T$ . In particular, the former implies, by Lemma 10, that $a\in{\mathbb{P}}^{k}(T,\mathbb{R}^{d})$ . Then we have

[TABLE]

This proves (30).

To prove (31), first note that due to the definition of $\kappa(\cdot),$ we have $\|\kappa(\operatorname{curl}v_{h})\|_{T}\sim\|\operatorname{curl}(v_{h})\|_{T}$ . Thus, using the same decomposition as above, namely, $v_{h}|_{T}=a+r$ ,

[TABLE]

As $\operatorname{curl}(a)\in{\mathbb{P}}^{k-1}(T,\mathbb{R}^{\tilde{d}})$ , the first term on the right vanishes. The last term satisfies

[TABLE]

due to (35). Hence (31) is proved.

The proof of (32) uses the same technique:

[TABLE]

where we have used that $a\in{\mathbb{P}}^{k}(T,\mathbb{R}^{d})$ and (35). ∎

*Remark 12**.*

The same technique shows that $\|\nabla v_{h}\|_{T}^{2}\sim\|\Pi^{k-1}[\operatorname{dev}{\!(\nabla v_{h})}]\|_{T}^{2}+\|\operatorname{div}(v_{h})\|_{T}^{2}$ for all Raviart-Thomas functions $v_{h}\in V_{h}$ . The technique allows one to control the gradient of the highest order terms of a velocity $v_{h}$ in the Raviart-Thomas space by $\operatorname{div}(v_{h})$ . A similar estimate does not hold for $v_{h}$ in ${\mathcal{BDM}}^{k+1}\!:=H_{0}(\operatorname{div},\Omega)\cap{\mathbb{P}}^{k+1}(\mathcal{T}_{h},\mathbb{R}^{d}).$

Lemma 13.

For all $T\in\mathcal{T}_{h}$ and $\eta_{h}\in W_{h}$ ,

[TABLE]

Proof.

The proof is based on a scaling argument and equivalence of norms on finite dimensional spaces on the reference element. Recall the map $\phi$ and $F=\phi^{\prime}$ . Calculations using the chain rule yield

[TABLE]

We continue with the $d=3$ case only (since $d=2$ case proceeds using (36b) analogously). With $\hat{\eta}_{h}=F^{\textrm{T}}(\eta_{h}\circ\phi)F$ , standard estimates for $F$ yield

[TABLE]

Let $\hat{v}\in{\mathbb{P}}^{k}(\hat{T},\mathbb{R}^{d})$ and $v\in{\mathbb{P}}^{k}(T,\mathbb{R}^{d})$ be such that $\hat{\eta}_{h}=\kappa(\hat{v})$ and $\eta_{h}=\kappa(v)$ , where $\kappa$ is as defined in (8). Then,

[TABLE]

In view of (37) and (38), to complete the proof, it suffices to establish the reference element estimate

[TABLE]

by proving that one side is zero if and only if the other side is zero. Note these two identities: $\hat{\operatorname{curl}}\,\kappa(\hat{v})=(\hat{\nabla}\hat{v})^{\textrm{T}}-\hat{\operatorname{div}}(\hat{v})\operatorname{Id},$ and $\hat{\operatorname{curl}}\kappa(\hat{v}):\operatorname{Id}=-2\operatorname{div}(\hat{v})$ . If $\hat{\operatorname{curl}}\,\kappa(\hat{v})=0$ , then the latter identity implies $\hat{\operatorname{div}}(\hat{v})=0$ , which when used in the former identity, yields $\hat{\nabla}\hat{v}=0$ . Combined with the obvious converse, we have established (39). ∎

Lemma 14.

For all $T\in\mathcal{T}_{h}$ and $(v_{h},\eta_{h})\in U_{h}$ ,

[TABLE]

Proof.

Since the decomposition $\nabla v_{h}=\varepsilon(v_{h})+\kappa(\operatorname{curl}(v_{h}))$ is orthogonal in the Frobenius inner product, so is $\nabla v_{h}-\eta_{h}=\varepsilon(v_{h})+[\kappa(\operatorname{curl}(v_{h})-\eta_{h}].$ Application of the deviatoric and $\Pi^{k-1}$ preserves this orthogonality. Hence, by Pythagoras theorem,

[TABLE]

We shall now prove the result using (40) and Lemma 11.

Proof of “ $\lesssim$ ”: Since

[TABLE]

it suffices to prove that

[TABLE]

which we do next. Since the projection $r_{1}=\Pi^{k-1}(\kappa(\operatorname{curl}(v_{h}))-\eta_{h})$ can be bounded using (40), we focus on the remainder $r_{2}=(\operatorname{Id}-\Pi^{k-1})(\kappa(\operatorname{curl}(v_{h}))-\eta_{h})$ .

[TABLE]

When this estimate for $r_{2}$ is used in $\|\kappa(\operatorname{curl}(v_{h}))-\eta_{h}\|^{2}_{T}=\|r_{1}\|_{T}^{2}+\|r_{2}\|_{T}^{2}$ and $r_{1}$ is bounded using (40), we obtain (41).

Proof of “ $\gtrsim$ ”: The last term of the lemma obviously satisfies $\|\operatorname{div}(v_{h})\|_{T}^{2}\lesssim\|\varepsilon(v_{h})\|_{T}^{2}$ , while the first term satisfies $\|\Pi^{k-1}\operatorname{dev}{\!(\nabla v_{h}-\eta_{h})}\|_{T}^{2}\leq\|\varepsilon(v_{h})\|_{T}^{2}+\|\kappa(\operatorname{curl}(v_{h}))-\eta_{h}\|_{T}^{2}$ by (40). It remains to bound $h^{2}\|\operatorname{curl}(\eta_{h})\|_{T}^{2}$ . As $\operatorname{curl}[\kappa(\operatorname{curl}(\Pi^{{\mathbb{E}}}v_{h}))]=0$ , we obtain using an inverse inequality for polynomials

[TABLE]

where we used (27) in the last step. ∎

Lemma 15.

For any $v_{h}\in V_{h}$ and $\gamma_{h}\in W_{h},$

[TABLE]

While the first estimate in (42) involves only the local constants from Lemmas 13 and 14, using the global constant $c_{K},$ we also have

[TABLE]

Proof.

To prove the first estimate of (42),

[TABLE]

Taking infimum over $v_{h}\in V_{h}$ , we obtain the lower estimate of (42). The upper bound of the first infimum obviously follows by choosing $v_{h}=0$ .

To prove the equality in (42), observe that the infimum over $\eta_{h}\in W_{h}$ cannot be larger than $\|v_{h}\|_{1,h,\varepsilon}$ because we may choose $\eta_{h}=\kappa(\operatorname{curl}v_{h})$ . The reverse inequality also holds since $\|(v_{h},\eta_{h})\|_{U_{h}}\geq\|v_{h}\|_{1,h,\varepsilon}$ for any $\eta_{h}\in W_{h}$ , so the equality must hold.

Finally, to prove (43), we use triangle inequality to get

[TABLE]

Applying the Korn inequality (29) and noting that the jump of the normal components are zero for functions in $v_{h}\in H_{0}(\operatorname{div},\Omega)$ , the proof is complete. ∎

4.3. Stability analysis

The next three lemmas lead us to a discrete inf-sup condition.

Lemma 16.

Let $\mu\in{\mathbb{P}}^{k}(T,\mathbb{M})$ for some $T\in\mathcal{T}_{h}$ and $\tau=(\det F)\operatorname{dev}{\!(\operatorname{curl}(\operatorname{curl}(\mu)B))}$ . Then for $d=3,2$ ,

[TABLE]

Proof.

If $\operatorname{curl}\mu=0$ , then obviously $\tau=0$ . We claim that the converse is also true. Indeed, if $\tau=0$ , then putting $s=d^{-1}\textrm{tr}({\operatorname{curl}(\operatorname{curl}(\mu)B)})$ , we have

[TABLE]

Taking divergence on both sides, we find that $\nabla s=0$ , so $s$ must be a constant on $T$ . Then, taking normal components of both sides of (44) on each facet, we find that $sn=0$ , so $s=0$ . Hence $\operatorname{curl}(\operatorname{curl}(\mu)B)=0$ , which in turn implies that $0=(\operatorname{curl}(\operatorname{curl}(\mu)B,\mu)_{T}=(\operatorname{curl}(\mu)B,\operatorname{curl}(\mu))_{T}=0$ . Therefore, by [8, Lemma 2.2], $\operatorname{curl}(\mu)=0$ .

Applying this on the reference element $\hat{T}$ for $\hat{\mu}=F^{\textrm{T}}(\mu\circ\phi)F\in{\mathbb{P}}^{k}(T,\mathbb{M})$ and $\hat{\tau}=\operatorname{dev}{\!(\hat{\operatorname{curl}}(\hat{\operatorname{curl}}(\hat{\mu})\hat{B}))}$ where $\hat{B}$ is in Remark 4, by finite dimensionality, we have

[TABLE]

We will now show that $\tau=(\det F)\operatorname{dev}{\!(\operatorname{curl}(\operatorname{curl}(\mu)B))}$ is related to $\hat{\tau}$ by

[TABLE]

By the definition of $\mathcal{M}$ ,

[TABLE]

as trace is preserved under similarity transformations. Focusing on the part of the last term inside the deviatoric, in the $d=3$ case,

[TABLE]

This proves that

[TABLE]

when $d=3$ . The same identity holds in the $d=2$ case: the argument is similar after changing the definitions of the curls and the mapping of $B$ appropriately. Thus, $\mathcal{M}(\hat{\tau})\circ\phi=(\det{F})\operatorname{dev}{\!(\operatorname{curl}(\operatorname{curl}(\mu)B))}\circ\phi$ and (46) is proved.

Finally, the result follows from (46) by scaling arguments: indeed (45) implies, by (24) and (36) that

[TABLE]

from which the result follows. ∎

Lemma 17.

For any $\gamma_{h}\in W_{h}$ , there is a $\tau_{h}\in{\Sigma_{h}^{+}}$ such that

[TABLE]

Furthermore, for any $v_{h}\in V_{h},$ the same $\gamma_{h},\tau_{h}$ pair satisfies

[TABLE]

Proof.

Given a $\gamma_{h}\in W_{h}$ , set $\tau_{h}$ element by element by

[TABLE]

Clearly, $\operatorname{dev}{\!(\operatorname{curl}(\operatorname{curl}(\Pi^{k-1}\gamma_{h})B))}$ is in $\Sigma_{h}.$ Since $\operatorname{dev}{\!(\operatorname{curl}(\operatorname{curl}(\gamma_{h}-\Pi^{k-1}\gamma_{h})B))}$ is in $\delta\Sigma_{h},$ we conclude that $\tau_{h}\in\Sigma_{h}^{+}$ . Since $\gamma_{h}$ is trace-free, $(\tau_{h},\gamma_{h})_{T}=(\operatorname{curl}(\operatorname{curl}(\gamma_{h}|_{T})B),\gamma_{h})_{T}$ $\det F,$ which in turn implies, after integrating by parts and applying Lemma 2, $(\tau_{h},\gamma_{h})_{T}=(\operatorname{curl}(\gamma_{h}){B},\operatorname{curl}\gamma_{h})_{T}$ $\det F$ .

In the $d=3$ case, this yields

[TABLE]

Noting that $\nabla\lambda_{i}=-n_{i}/h_{i}$ , where $h_{i}$ is the distance from the $i$ th vertex to the facet of the simplex opposite to it, and that the $\ell^{2}$ -norm of any matrix $m\in\mathbb{M}$ is equivalent to the sum of $\ell^{2}$ -norms of $mn_{i}$ , a local scaling argument with $m=\operatorname{curl}(\gamma_{h})$ and (49) imply

[TABLE]

Therefore, $(\tau_{h},\gamma_{h})_{\Omega}\gtrsim h\|\operatorname{curl}(\gamma_{h})\|_{h}^{2}\gtrsim h\|\operatorname{curl}(\gamma_{h})\|_{h}\,\|\tau_{h}\|$ , by Lemma 16. This proves (47) in the $d=3$ case. In the $d=2$ case, the analogue of (49) gives $(\tau_{h},\gamma_{h})_{T}\gtrsim$ $(\det F)$ $\|\operatorname{curl}(\gamma_{h})\|_{T}^{2}$ $\gtrsim h^{2}\|\operatorname{curl}(\gamma_{h})\|_{T}^{2}\geq h\|\operatorname{curl}(\gamma_{h})\|_{T}\,\|\tau_{h}\|,$ where we have used Lemma 16 again. This completes the proof of (47).

To prove (48), we use (18). The last sum in

[TABLE]

vanishes due to Lemma 3. Hence by (47),

[TABLE]

To handle the last term, note that

[TABLE]

because $(\operatorname{curl}(\operatorname{curl}(\gamma_{h})B),\nabla v_{h})_{T}=0$ . This follows by integrating one of the curls by parts, observing that the resulting volume term is zero (since $\operatorname{curl}(\nabla v_{h})=0$ ) and so is the resulting boundary term (due to Lemma 2). Continuing, we apply Cauchy-Schwarz inequality and an inverse inequality to get

[TABLE]

by Lemma 16. Returning to (50) and using this estimate, the proof is complete. ∎

*Remark 18**.*

The message of Lemmas 16 and 17 is that it is possible to choose a $\tau_{h}$ in the form of a deviatoric of a curl of a bubble to bound (from below) the term arising from the weak symmetry constraint. If $\tau_{h}$ was just a curl, it would not be seen by the equilibrium equation and the bound in (48) would not have the $\|\operatorname{div}(v_{h})\|$ -term, but our $\tau_{h}$ is a deviatoric (of a curl), thus necessitating this term.

Lemma 19.

For any $(v_{h},\gamma_{h})\in U_{h},$ there is a $\tau_{h}\in\Sigma_{h}$ such that

[TABLE]

Proof.

We only present the proof in two dimensions, as the three dimensional case is similar. From the local element basis exhibited in (20) (see also [18, §5.5] for a more detailed discussion), its clear that on any facet $F\in\mathcal{F}_{h}$ , there exists a constant trace-free function $S^{F}$ with the property that $S^{F}_{nt}\in{\mathbb{P}}^{0}(F,n_{F}^{\perp})$ , $\|S^{F}_{nt}\|_{2}=1$ on the facet $F,$ and $S^{F}_{nt}$ equals $(0,0)$ on all other facets in $\mathcal{F}_{h}$ . Given any $(v_{h},\gamma_{h})\in U_{h}$ , define

[TABLE]

where $\lambda_{T}^{F}$ is the unique barycentric coordinate function on the element $T$ opposite to the facet $F$ (so that $\lambda_{T}^{F}S^{F}$ is an $nt$ -bubble). Clearly, $\tau_{h}^{0}$ and $\tau_{h}^{1}$ are in $\Sigma_{h}$ . Using the norm equivalences stated in (26) and the mappings for $v_{h}$ and $\gamma_{h}$ given in (10), a scaling argument yields

[TABLE]

Setting $\tau_{h}=\alpha_{0}\tau_{h}^{0}+\alpha_{1}\tau_{h}^{1}$ and selecting the constants $\alpha_{0},\alpha_{1}$ appropriately, the rest of the proof proceeds along the same lines as the proof of [18, Lemma 6.5]. ∎

*Remark 20**.*

It is interesting to contrast Lemma 19 with [18, Lemma 6.5]. The latter gives a similar LBB-condition. The differences are (i) the velocity space in [18] is ${\mathcal{BDM}}^{k+1}$ (defined in Remark 12), (ii) the velocity norm is a discrete $H^{1}$ -norm defined using $\nabla$ in place of $\varepsilon(\cdot)$ , (iii) there is no weak symmetry constraint and no associated space $W_{h}$ , and (iv) the stress space in [18] equals the $\Sigma_{h}$ in (12) plus certain $nt$ -bubbles of degree $k+1$ (different from our $\delta\Sigma_{h}$ here). Lemma 19 shows that the inf-sup condition in [18, Lemma 6.5] continues to hold even if the $nt$ -bubbles there are removed and ${\mathcal{BDM}}^{k+1}$ is replaced by our Raviart-Thomas velocity space $V_{h}$ . This observation can be extended to prove the convergence of the MCS formulation in [18] with so modified spaces.

Theorem 21 (Discrete LBB-condition).

Let $v_{h}\in V_{h}$ and $\gamma_{h}\in W_{h}$ . Then,

[TABLE]

If $v_{h}$ is in the divergence-free subspace $V_{h}^{0}:=\{z_{h}\in V_{h}:\operatorname{div}(z_{h})=0\},$ then

[TABLE]

Proof.

By Lemmas 17 and 19, for any given $(v_{h},\gamma_{h})\in U_{h}$ , there are $\tau_{h}^{1},\tau_{h}^{2}\in\Sigma_{h}^{+}$ satisfying

[TABLE]

Clearly, the same inequalities hold when $\tau^{1}_{h}$ and $\tau^{2}_{h}$ are scaled by any nonzero factor, so we may assume without loss of generality, that they have been scaled so that $\|\tau_{h}^{1}\|=h\|\operatorname{curl}\gamma_{h}\|_{h}$ and $\|\tau^{2}_{h}\|=\|(v_{h},\gamma_{h})\|_{U_{h},*}.$ Set $\tau_{h}=\alpha\tau_{h}^{1}+\tau^{2}_{h},$ where $\alpha\in\mathbb{R}$ is to be chosen shortly. It follows from (53) and (54) that

[TABLE]

Next, we choose $q_{h}\in Q_{h}$ so that $q_{h}=\beta\operatorname{div}(v_{h})$ , where $\beta\in\mathbb{R}$ is another constant to be chosen shortly. Then (55) implies

[TABLE]

Choose any $\alpha>1$ and $\beta>\alpha^{2}/2$ . Then, using Young’s inequality for the last term,

[TABLE]

we can now conclude the proof of (51) using the norm equivalence of Lemma 14. The proof of (52) is similar (and in fact simpler since all terms involving $\operatorname{div}(v_{h})$ vanish). ∎

4.4. Error estimates

In this subsection we show that the error in the discrete MCS solution converges at optimal order. As we have chosen polynomials of degree $k$ for the stress space $\Sigma_{h}$ , the optimal rate of convergence for $\|\sigma-\sigma_{h}\|$ is $\mathcal{O}(h^{k+1})$ . However, the optimal rate for the velocity error in our discrete $H^{1}$ -like norm, namely, $\|u-u_{h}\|_{1,h,\varepsilon}$ is only $\mathcal{O}(h^{k})$ (since the Raviart-Thomas velocity space $V_{h}$ only contains ${\mathbb{P}}^{k}(T,\mathbb{R}^{d})$ within each mesh element $T$ ). Nevertheless, we are still able to prove optimal convergence rate of the stress error by using an appropriate interpolation operator and deducing that the stress error is independent of the velocity error. Another important property we shall conclude in this subsection is the pressure-robustness of the method.

Lemma 22 (Continuity).

The bilinear forms $a,b_{1}$ and $b_{2}$ are continuous:

[TABLE]

Proof.

The continuity of $a$ and $b_{1}$ follow by the Cauchy Schwarz inequality. For $b_{2},$ we use (18) and $\nabla v_{h}=\varepsilon(v_{h})+\kappa(\operatorname{curl}v_{h})$ to get

[TABLE]

Now, Cauchy-Schwarz inequality and (26) of Lemma 8 finishes the proof. ∎

Lemma 23 (Coercivity in the kernel).

For all $(\tau_{h},q_{h})$ in the kernel

[TABLE]

we have $\nu^{-1}\big{(}\,\left\|\tau_{h}\right\|+\|q_{h}\|\big{)}^{2}\lesssim\;a(\tau_{h},\tau_{h}).$

Proof.

By [24, Theorem 2.2], for any $q_{h}\in Q_{h},$ there is a $v_{h}\in V_{h}$ such that $\|q_{h}\|^{2}\lesssim(\operatorname{div}({v}_{h}),q_{h})$ and a discrete $H^{1}$ -norm of $v_{h}$ is bounded by $\|q_{h}\|$ . The latter bound implies, in particular, that $\|v_{h}\|_{1,h,\varepsilon}\lesssim\|q_{h}\|,$ and also that $\eta_{h}=\kappa(\operatorname{curl}v_{h})$ satisfies $\|(v_{h},\eta_{h})\|_{U_{h}}\lesssim\|q_{h}\|.$ This together with Lemma 22 implies

[TABLE]

yielding the needed bound for $\|q_{h}\|$ . ∎

We are now ready to conclude an inf-sup condition for $B(v_{h},\eta_{h},\tau_{h},q_{h};\tilde{v}_{h},\tilde{\eta}_{h},\tilde{\tau}_{h},\tilde{q}_{h}):=a(\tau_{h},\tilde{\tau}_{h})+b_{1}(v_{h},\tilde{q}_{h})+b_{1}(\tilde{v}_{h},q_{h})+b_{2}(\tau_{h},(\tilde{v}_{h},\tilde{\eta}_{h}))+b_{2}(\tilde{\tau}_{h},(v_{h},\eta_{h})).$

Corollary 24.

Let $\tau_{h}\in\Sigma_{h}^{+}$ , $v_{h}\in V_{h}$ , $\eta_{h}\in W_{h}$ , and $q_{h}\in Q_{h}$ . There holds

[TABLE]

so, in particular, there is a unique solution for the discrete MCS system (19). Moreover, if $v_{h}$ is restricted to $V_{h}^{0}$ , we also have

[TABLE]

Proof.

The first inf-sup condition follows from the standard theory of mixed methods [4], using Theorem 21 (the inf-sup condition for $b_{1}$ and $b_{2}$ given by (51)), Lemma 22 (continuity of forms), and Lemma 23 (coercivity in the kernel).

The second inf-sup condition also follows in a similar fashion, but now using the other inequality (52) of Theorem 21. ∎

Theorem 25 (Consistency).

The MCS method with weakly imposed symmetry (19) is consistent in the following sense. If the exact solution of the Stokes problem (9) is such that ${{u}}\in H^{1}(\Omega,\mathbb{R}^{d})$ , $\omega\in L^{2}(\Omega,{\mathbb{M}})$ , ${{\sigma}}\in H^{1}(\Omega,\mathbb{D})$ and ${p}\in L^{2}_{0}(\Omega,\mathbb{R})$ , then

[TABLE]

*for all ${v}_{h}\in{{{V}}_{h}},\eta_{h}\in W_{h},{q_{h}}\in{{Q}_{h}},$ and ${\tau}_{h}\in{{{\Sigma}}_{h}}.$ *

The proof of Theorem 25 is easy (see, e.g., the similar proof of [18, Theorem 6.2]), so we omit it. We now have all the ingredients to prove the following convergence result. Let $I_{V_{h}}$ denote the standard Raviart-Thomas interpolator (see, e.g., [4]) and let $\|(u,\omega,\sigma,p)\|_{\nu,s}=\nu^{-1}\|\sigma\|_{H^{s}(\mathcal{T}_{h},\mathbb{D})}+\nu^{-1}\|p\|_{H^{s}(\mathcal{T}_{h},\mathbb{R})}+\|\omega\|_{H^{s}(\mathcal{T}_{h},\mathbb{K})}+\|u\|_{H^{s+1}(\mathcal{T}_{h},\mathbb{R}^{d})}.$

Theorem 26 (Optimal convergence).

Let ${{u}}\in H^{1}(\Omega,\mathbb{R}^{d})\cap H^{m}(\mathcal{T}_{h},\mathbb{R}^{d})$ , ${{\sigma}}\in H^{1}(\Omega,\mathbb{D})\cap H^{m-1}(\mathcal{T}_{h},\mathbb{D})$ , ${p}\in L^{2}_{0}(\Omega,\mathbb{R})\cap H^{m-1}(\mathcal{T}_{h},\mathbb{R})$ and $\omega\in L^{2}(\Omega,\mathbb{K})\cap H^{m-1}(\mathcal{T}_{h},\mathbb{K})$ be the exact solution of the mixed Stokes problem (9), let $u_{h}$ , ${\sigma}_{h}$ , $\omega_{h}$ and $p_{h}$ solve (19) and let $s=\min(m-1,k+1)$ . Then,

[TABLE]

Proof.

Let $e_{h}^{\sigma}=I_{\Sigma_{h}}\sigma-\sigma_{h}$ , $e_{h}^{u}=I_{V_{h}}u-u_{h}$ , $e_{h}^{\omega}=\Pi^{k}\omega-\omega_{h}$ , $e_{h}^{p}=\Pi^{k}p-p_{h}$ (where the two occurrences of $\Pi^{k}$ represent projections onto two different discrete spaces per our prior notation). Denoting the analogous approximation errors by $a^{\sigma}=I_{\Sigma_{h}}\sigma-\sigma$ , $a^{u}=I_{V_{h}}u-u$ , $a^{\omega}=\Pi^{k}\omega-\omega$ , and $a^{p}=\Pi^{k}p-p$ , observe that Theorem 25 implies

[TABLE]

for any $v_{h}\in V_{h},\eta_{h}\in W_{h},\tau_{h}\in{\Sigma_{h}^{+}},$ and $q_{h}\in Q_{h}$ . The right hand side above is a sum of five terms $(\nu^{-1}a^{\sigma},\tau_{h})+b_{1}(a^{u},q_{h})+b_{1}(v_{h},a^{p})+b_{2}(\tau_{h},(a^{u},a^{\omega}))+b_{2}(a^{\sigma},(v_{h},\eta_{h})).$ The second term vanishes: $b_{1}(a^{u},q_{h})=(\operatorname{div}(I_{V_{h}}u-u),q_{h})=(\Pi^{k}\operatorname{div}(u)-\operatorname{div}(u),q_{h})=0$ as $\operatorname{div}(u)=0$ . The third term also vanishes: $b_{1}(v_{h},a^{p})=(\operatorname{div}(v_{h}),\Pi^{k}p-p)=0$ since $\operatorname{div}(v_{h})\in{\mathbb{P}}^{k}(\mathcal{T}_{h})$ . The fourth term, due to (17), is

[TABLE]

where the last two terms vanish by the properties of the Raviart-Thomas d.o.f.s that define $I_{V_{h}}$ , i.e., $b_{2}(\tau_{h},(a^{u},a^{\omega}))=(\tau_{h},a^{\omega}).$ The fifth term, due to (18), is

[TABLE]

Writing $(a^{\sigma},\eta_{h}-\nabla v_{h})=(a^{\sigma},\eta_{h})+(a^{\sigma},(\Pi^{k-1}-\operatorname{Id})\nabla v_{h})-(a^{\sigma},\Pi^{k-1}\nabla v_{h}),$ note that by the d.o.f.s of Theorem 5, the last term $(a^{\sigma},\Pi^{k-1}\nabla v_{h})$ is zero, and moreover, $(a^{\sigma},\eta_{h})=(a^{\sigma},\eta_{h}-\Pi^{0}\eta_{h})$ . Incorporating these observations on each term into (59), we obtain

[TABLE]

We now proceed to estimate the right hand side of (60). By (42) and Lemma 11,

[TABLE]

Using these after an application of the Cauchy-Schwarz inequality, (60) yields

[TABLE]

where we have used Theorem 7 and the approximation property of $\Pi^{k}$ in the last step.

To complete the proof, we apply triangle inequality starting from the left hand side of (58), to get

[TABLE]

again using Theorem 7. Bounding the last term above using (56) and (61),

[TABLE]

the proof is complete. ∎

*Remark 27** (Convergence in standard norms).*

Using also Lemma 15’s estimate (43), a consequence of the global discrete Korn inequality, (58) implies

[TABLE]

under the assumptions of Theorem 26 for a sufficiently smooth solution. Note that even though the optimal rate for $\|u-u_{h}\|_{1,h,\varepsilon}$ is only $\mathcal{O}(h^{k})$ , (63) gives a superconvergent rate of $\mathcal{O}(h^{k+1})$ for $\|u_{h}-I_{V_{h}}u\|_{1,h,\varepsilon}$ .

Theorem 28 (Pressure robustness).

Under the same assumptions as Theorem 26,

[TABLE]

Proof.

Proceeding along the lines of the proof of Theorem 26, omitting the pressure error, we obtain, instead of (62),

[TABLE]

We may now complete the proof as before by using (57) instead of (56). ∎

5. Postprocessing

In this section we describe and analyze a postprocessing for the discrete velocity. While for the raw solution $u_{h}$ , we may only expect $\|u-u_{h}\|_{1,h,\varepsilon}$ to go to zero at the rate $\mathcal{O}(h^{k}),$ we will show that a locally postprocessed velocity $u_{h}^{*}$ has error $\|u-u_{h}^{*}\|_{1,h,\varepsilon}$ that converges to zero at the higher rate $\mathcal{O}(h^{k+1})$ for sufficiently regular solutions. The key to obtain this enhanced accuracy, as in [30], is the $O(h^{k+1})$ -superconvergence of $\|u_{h}-I_{V_{h}}u\|_{1,h,\varepsilon}$ – see Remark 27. Finally, we shall also show that $u_{h}^{*}$ retains the prized structure preservation properties of exact mass conservation and pressure robustness.

The crucial ingredient is a reconstruction operator (see [21, 22]) whose properties are summarized in the next lemma. Let

[TABLE]

denote the BDM space (one order higher) and its “relaxed” analogue, respectively. The next result is a consequence of [21, Lemmas 3.3 and 4.8] and the Korn inequality (29).

Lemma 29.

There exists an operator $\mathcal{R}:V_{h}^{*,-}\rightarrow V_{h}^{*},$ whose application is computable element-by-element, satisfying

(1)

$\|\mathcal{R}v_{h}\|_{1,h,\varepsilon}\lesssim\|v_{h}\|_{1,h,\varepsilon},$ * for al $v_{h}\in V_{h}^{*,-}$ ,* 2. (2)

$\mathcal{R}v_{h}^{*}=v_{h}^{*}$ * for all $v_{h}^{*}\in V_{h}^{*}$ , and* 3. (3)

whenever the local (element-wise) property $\operatorname{div}(v_{h}|_{T})=0$ holds for all $T\in\mathcal{T}_{h}$ and all $v_{h}\in V_{h}^{*,-}$ , the global property $\operatorname{div}(\mathcal{R}v_{h})=0$ holds.

A simple choice of $\mathcal{R}$ is given by the classical BDM intepolant. This was used in [19]. Another choice of $\mathcal{R}$ , given in [21], based on a simple averaging of coefficients, is significantly less expensive for high orders.

The postprocessed solution $u_{h}^{*}\in V_{h}^{*}$ is given in two steps as follows. First, using the computed $\sigma_{h}$ and $u_{h}$ , solve the local (see Remark 31) minimization problem

[TABLE]

Second, apply the reconstruction and set $u_{h}^{*}:=\mathcal{R}(u_{h}^{*,-})$ .

Theorem 30.

Suppose the assumptions of Theorem 26 hold. Then $u_{h}^{*}\in V_{h}^{*},$ $\operatorname{div}(u_{h}^{*})=0,$ and for $s=\min(m-1,k+1)$ we have the pressure-robust error estimate

[TABLE]

Proof.

On any $T\in\mathcal{T}_{h}$ , the condition $I_{V_{h}}(u_{h}^{*,-})=u_{h}$ implies that the Raviart-Thomas d.o.f.s applied to $u_{h}^{*,-}$ and $u_{h}$ coincide. Hence, for all $q_{h}\in{\mathbb{P}}^{k}(T,\mathbb{R})$ ,

[TABLE]

as $\operatorname{div}(u_{h})=0$ . Thus, Lemma 29 implies that $u_{h}\in V^{*}_{h}$ and $\operatorname{div}(u_{h}^{*})=0$ .

It only remains to prove the error estimate. Let $I_{V_{h}^{*}}$ be the standard ${\mathcal{BDM}}^{k+1}$ interpolator. Then, $u_{h}^{*}=\mathcal{R}u_{h}^{*,-}$ satisfies

[TABLE]

Since standard approximation estimates yield $\|u-I_{V_{h}^{*}}u\|_{1,h,\varepsilon}\lesssim h^{s}\|(u,0,0,0)\|_{\nu,s}$ , we focus on the last term. A triangle inequality (where we add and subtract different functions in the element and facet terms) yields

[TABLE]

Naming the four sums on the right as $s_{1},s_{2},s_{3}$ and $s_{4}$ , respectively, we proceed to estimate each. Obviously $s_{1}=\nu^{-1}\|\sigma-\sigma_{h}\|\lesssim h^{s}\|(0,\omega,\sigma,0)\|_{\nu,s}$ by Theorem 28.

To bound $s_{2}$ , note that for any $w_{h}$ in the admissible set of the minimization problem (64), we have $s_{2}\leq\nu^{-2}\|\sigma_{h}-\nu\varepsilon(w_{h})\|^{2}$ . We choose $w_{h}=I_{V_{h}^{*}}u+u_{h}-I_{V_{h}}u\in V_{h}^{*}\subset V_{h}^{*,-}$ . Since $I_{V_{h}}I_{V_{h}^{*}}u=I_{V_{h}}u$ implies $I_{V_{h}}w_{h}=u_{h}$ , the chosen $w_{h}$ is in the admissible set. Hence,

[TABLE]

so a standard approximation estimate and Theorem 28 yield $s_{2}\lesssim h^{s}\|(u,\omega,\sigma,0)\|_{\nu,s}.$

The same standard approximation estimate for $I_{V_{h}^{*}}$ also gives $s_{3}\leq\|u-I_{V_{h}^{*}}u\|_{1,h,\varepsilon}\lesssim h^{s}\|(u,\omega,\sigma,0)\|_{\nu,s}$ . Hence it only remains to bound $s_{4}$ . Observe that $I_{V_{h}^{*}}u-u_{h}^{*,-}=I_{V_{h}}(I_{V_{h}^{*}}u-u_{h}^{*,-})+(\operatorname{Id}-I_{V_{h}})(I_{V_{h}^{*}}u-u_{h}^{*,-})=(I_{V_{h}}u-u_{h})+(\operatorname{Id}-I_{V_{h}})(I_{V_{h}^{*}}u-u_{h}^{*,-}),$ because $I_{V_{h}}I_{V_{h}^{*}}u=I_{V_{h}}u$ and $I_{V_{h}}u_{h}^{*,-}=u_{h}$ . This implies, letting $a=(\operatorname{Id}-I_{V_{h}})(\operatorname{Id}-\Pi^{{{\mathbb{E}}}})(I_{V_{h}^{*}}u-u_{h}^{*,-})$ , the identity $I_{V_{h}^{*}}u-u_{h}^{*,-}=(I_{V_{h}}u-u_{h})+a$ holds because $(\operatorname{Id}-I_{V_{h}}){\mathbb{E}}=0$ (as $k\geq 1$ ). Hence

[TABLE]

Since the first term can be bounded by Theorem 28, let us consider the last term. On any facet $F$ adjacent to a mesh element $T$ , a trace inequality yields $h^{-1}\big{\|}{[\![a_{t}]\!]}\big{\|}_{F}^{2}\leq h^{-1}\|a_{t}\|_{\partial T}^{2}\lesssim\|\nabla a\|_{T}^{2}+h^{-2}\|a\|_{T}^{2}.$ Hence,

[TABLE]

where we have used the continuity properties of $I_{V_{h}}$ , scaling arguments, (27), and an estimate analogous to (28). Using triangle inequality and returning to (66),

[TABLE]

The last two terms are $s_{1}$ and $s_{2}$ , respectively. Hence the prior estimates, the standard approximation estimate for $I_{V_{h}^{*}}$ , and Theorem 28 shows $s_{4}\lesssim h^{s}\|(u,\omega,\sigma,0)\|_{\nu,s}.$ ∎

*Remark 31**.*

The restriction of the minimizer of (64) to an element $T$ , namely $u_{T}^{*,-}:=u_{h}^{*,-}|_{T},$ can be computed using the following Euler-Lagrange equations. Letting $\Lambda_{h}^{*}(T)=\{\lambda:\lambda|_{F}\in{\mathbb{P}}^{k}(F,\mathbb{R})$ on all facets $F\subset\partial T\}$ , the function $u_{T}^{*,-}$ is the unique function in ${\mathbb{P}}^{k+1}(T,\mathbb{R}^{d})$ , which together with $\ell_{h}^{*}\in{\mathbb{P}}^{k-1}(T,\mathbb{R}^{d})$ and $\lambda_{h}^{*}\in\Lambda_{h}^{*}(T)$ , satisfies

[TABLE]

for all $v\in{\mathbb{P}}^{k+1}(T,\mathbb{R}^{d}),$ $\wp\in{\mathbb{P}}^{k-1}(T,\mathbb{R}^{d})$ and $\mu\in\Lambda_{h}^{*}(T)$ . The last two equations are another way to express the constraint $I_{V_{h}}u_{h}^{*,-}=u_{h}$ in (64).

6. Numerical exampels

In this last section we present two numerical examples to verify our method. All examples were implemented within the finite element library NGSolve/Netgen, see [28, 29] and on www.ngsolve.org. The computational domain is given by $\Omega=[0,1]^{d}$ and the velocity field is driven by the volume force determined by $f=-\operatorname{div}({{\sigma}})+\nabla{p}$ with the exact solution given by

[TABLE]

Here $\psi_{2}:=x^{2}(x-1)^{2}y^{2}(y-1)^{2}$ and $\psi_{3}:=x^{2}(x-1)^{2}y^{2}(y-1)^{2}z^{2}(z-1)^{2}$ defines a given potential in two and three dimensions respectively and we choose the viscosity $\nu=10^{-3}$ .

In Tables 1(a) and 1(b) we report the errors in all the computed solution components for varying polynomial orders $k=1,2,3$ in the two and the three dimensional cases, respectively. As predicted by Theorem 26 and Theorem 30 the corresponding errors converge at optimal order. Furthermore, the $L^{2}$ -norm of error of the (postprocessed) velocity error converges at one order higher. Note that in three dimensions the errors are already quite small already on the coarsest mesh. It appears that to get out of the preasymptotic regime and see the proper convergence rate, it takes several steps.

Bibliography30

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] D. N. Arnold, F. Brezzi, and J. Douglas, Jr. , PEERS: a new mixed finite element for plane elasticity , Japan J. Appl. Math., 1 (1984), pp. 347–367.
2[2] D. N. Arnold, R. S. Falk, and R. Winther , Mixed finite element methods for linear elasticity with weakly imposed symmetry , Math. Comp., 76 (2007), pp. 1699–1723.
3[3] D. Boffi, F. Brezzi, and M. Fortin , Reduced symmetry elements in linear elasticity , Commun. Pure Appl. Anal., 8 (2009), pp. 95–121.
4[4] , Mixed Finite Element Methods and Applications , Springer Science & Business Media, 2013.
5[5] S. C. Brenner , Korn’s inequalities for piecewise H 1 superscript 𝐻 1 H^{1} vector fields , Math. Comp., 73 (2004), pp. 1067–1087.
6[6] F. Brezzi, J. Douglas Jr., and L. D. Marini , Two families of mixed finite elements for second order elliptic problems , Numerische Mathematik, 47 (1985), pp. 217–235.
7[7] B. Cockburn and J. Gopalakrishnan , A characterization of hybridized mixed methods for the Dirichlet problem , SIAM J. Numer. Anal., 42 (2004), pp. 283–301.
8[8] B. Cockburn, J. Gopalakrishnan, and J. Guzmán , A new elasticity element made for enforcing weak stress symmetry , Math. Comp., 79 (2010), pp. 1331–1349.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

A mass conserving mixed stress formulation for Stokes flow with weakly imposed stress symmetry

Abstract.

Key words and phrases:

1. Introduction

2. Preliminaries

3. The new method

3.1. Velocity, pressure, and vorticity spaces

3.2. Stress space

Theorem 1**.**

Lemma 2**.**

Lemma 3**.**

Proof.

Remark 4*.*

3.3. Equations of the method

3.4. Degrees of freedom of the new stress space

Theorem 5**.**

Proof.

Lemma 6**.**

Proof.

Theorem 7**.**

Proof.

4. A priori error analysis

4.1. Norms

4.2. Norm equivalences

Lemma 8**.**

Proof.

Lemma 9**.**

Proof.

Lemma 10**.**

Lemma 11**.**

Proof.

Remark 12*.*

Lemma 13**.**

Proof.

Lemma 14**.**

Proof.

Lemma 15**.**

Proof.

4.3. Stability analysis

Lemma 16**.**

Proof.

Lemma 17**.**

Proof.

Remark 18*.*

Lemma 19**.**

Proof.

Remark 20*.*

Theorem 21** (Discrete LBB-condition).**

Proof.

4.4. Error estimates

Lemma 22** (Continuity).**

Proof.

Lemma 23** (Coercivity in the kernel).**

Proof.

Corollary 24**.**

Proof.

Theorem 25** (Consistency).**

Theorem 26** (Optimal convergence).**

Proof.

Remark 27* (Convergence in standard norms).*

Theorem 28** (Pressure robustness).**

Proof.

5. Postprocessing

Lemma 29**.**

Theorem 30**.**

Proof.

Remark 31*.*

6. Numerical exampels

Theorem 1.

Lemma 2.

Lemma 3.

*Remark 4**.*

Theorem 5.

Lemma 6.

Theorem 7.

Lemma 8.

Lemma 9.

Lemma 10.

Lemma 11.

*Remark 12**.*

Lemma 13.

Lemma 14.

Lemma 15.

Lemma 16.

Lemma 17.

*Remark 18**.*

Lemma 19.

*Remark 20**.*

Theorem 21 (Discrete LBB-condition).

Lemma 22 (Continuity).

Lemma 23 (Coercivity in the kernel).

Corollary 24.

Theorem 25 (Consistency).

Theorem 26 (Optimal convergence).

*Remark 27** (Convergence in standard norms).*

Theorem 28 (Pressure robustness).

Lemma 29.

Theorem 30.

*Remark 31**.*